Whisper API cut output short when input is large (smaller than 25 MB)

_j · February 7, 2024, 2:37am

First: I would see if it is the file size, or the audio length.

For transcriptions, you can send Opus audio using a voice codec. This is three hours at under 20MB:

ffmpeg -i audio.mp3 -vn -map_metadata -1 -ac 1 -c:a libopus -b:a 12k -application voip audio.opus

It’s more efficient for everybody, and limiting to voice bandwidth can improve the transcription.

Then: is it terminating at a silence? Too much silence would normally get you some hallucinations after a long period, not a premature finish, but the behavior may have changed.

Topic		Replies	Views
Whisper API, increase file limit >25 MB API whisper , feature-request	29	17686	June 19, 2024
Whisper API fails on "large" ogg files (still below 25MB) Bugs whisper	2	1151	April 15, 2024
Issue with speech-to-text MP3 size API whisper	6	989	April 26, 2024
WhisperAI API Not Recognizing Valid File Formats API whisper	5	4897	December 15, 2023
Whisper API Limits - Transcriptions API whisper	11	15358	December 18, 2023

Whisper API cut output short when input is large (smaller than 25 MB)

Related topics