whisper-large
Last updated
Was this helpful?
Last updated
Was this helpful?
This documentation is valid for the following list of our models:
#g1_whisper-large
The Whisper models are primarily for AI research, focusing on model robustness, generalization, and biases, and are also effective for English speech recognition. The use of Whisper models for transcribing non-consensual recordings or in high-risk decision-making contexts is strongly discouraged due to potential inaccuracies and ethical concerns.
The models are trained using 680,000 hours of audio and corresponding transcripts from the internet, with 65% being English audio and transcripts, 18% non-English audio with English transcripts, and 17% non-English audio with matching non-English transcripts, covering 98 languages in total.
If you don’t have an API key for the AI/ML API yet, feel free to use our Quickstart guide.
/v1/stt/create
#g1_whisper-large
strict
, extended
strict
, extended