Gpt-4o-transcribe truncates output after ~8-9 minutes even on short segments

tomfordrumm · August 26, 2025, 5:12pm

Hi everyone,

I’ve run into a frustrating issue with the gpt-4o-transcribe (and gpt-4o-mini-transcribe) models. No matter how I prepare my audio, the transcription output always gets truncated after about 8–9 minutes of audio.

Here’s what I’ve tried so far:

Converted the source video (.mkv) into clean audio chunks using ffmpeg.
Made sure each chunk is mono, 16kHz, normalized with loudnorm, and low/high-pass filtered for clarity.
Exported to .m4a (AAC) instead of MP3 to avoid VBR issues.
Limited file sizes to well under 25 MB.
Limited durations first to 1400s (~23 min), then much shorter 540s (~9 min), even down to 480s (8 min).
Sent requests with response_format=json instead of text.
Tried both gpt-4o-transcribe and gpt-4o-mini-transcribe.

Despite all of that, the API still only returns text for the first ~8-9 minutes of audio. The rest is cut off completely — no error message, just a truncated transcription.

What’s interesting:

The same chunks transcribed with whisper-1 return the full transcript as expected.
So the problem seems to be specific to the 4o models.

My questions:

Are there recommended best practices to avoid truncation (e.g. maximum safe segment length)?
Has anyone found a reliable workaround besides falling back to Whisper?
Or is this something the dev team is aware of and working on fixing?

Any insights would be really helpful. Thanks!

jeffvpace · August 26, 2025, 6:04pm

First of all, the stated “under 25 MB limit“ is a joke. Our tests indicate the limit to be roughly half of that (about 13.30 minutes for .mp3s at mono, 128bps, 22.05 kHz). This is the isssue that OpenAI needs to resolve.

We have not revert to Whisper because we think OpenAI will eventually get arround to increasing the real limit.

matthewkogan · August 28, 2025, 1:39pm

It could be that you’re hitting the 2,000 output tokens limit of the gpt-4o-transcribe model.

tomfordrumm · August 29, 2025, 10:37am

Ouch, seems like you are right. Just changed output format to the json and I see “output_tokens”:2048

Topic		Replies	Views
Gpt-4o-transcribe truncates the transcript API transcribe	15	1900	August 29, 2025
Gpt-4o-transcribe audio length limits API	4	2743	May 27, 2025
GPT4.0-Transcribe—MAX 1500 SECONDS? API api	3	245	July 4, 2025
Persistent Truncation Issues with GPT-4o-Transcribe – Has Anyone Fully Solved This? API gpt-4 , api , transcribe , gpt-4o , api-realtime	11	693	July 30, 2025
Audio file might be corrupted or unsupported Bugs api	1	322	May 20, 2025

Gpt-4o-transcribe truncates output after ~8-9 minutes even on short segments

Related topics