`gpt-4o-transcribe` drops the end of recordings

davidg707 · April 14, 2025, 12:03am

I’m noticing that the gpt-4o-transcribe model often drops the end of recordings.

It’s not that it truncates the response (like this post) to some time limit, it’s that it ignores the last phrase or sentence, even in very short recordings.

Here’s the output of three models for the same input.

gpt-4o-transcribe: Testing pineapples and cats, as well as cheetahs.
gpt-4o-mini-transcribe: Testing pineapples and cats, as well as cheetahs, which are cats.
DeepGram: Testing pineapples and cats as well as cheetahs, which are cats.

Note that these recordings are quite bad quality and that this happens about 1 in every 8 recordings I try. Most of my recordings are just a few sentences.

Another example:
gpt-4o-transcribe: This gives me a single byte string, doesn’t it?
gpt-4o-mini-transcribe: This gives me a single byte string, doesn’t it? I thought tobytes gave me an array of bytes, but not by version.
DeepGram: This gives me a single byte string, doesn’t it? I thought two bytes gave me an array of bytes, the NumPy version.

StephanBH · April 20, 2025, 2:46pm

I have run into this same issue

Topic		Replies	Views
Gpt-4o-transcribe truncates the transcript API transcribe	15	3043	August 29, 2025
GPT-4o-transcribe truncation issue after the diarize API update API gpt-4o-transcribe	0	218	October 29, 2025
Gpt-4o-transcribe truncates output after ~8-9 minutes even on short segments Bugs transcribe	3	510	August 29, 2025
Gpt-4o-mini-tts degraded - audio at end being cut off Bugs	2	270	June 26, 2025
Gpt-4o-mini-tts-2025-12-15 still truncates final sentences; 2025-03-20 is being deprecated Bugs	6	371	June 6, 2026

`gpt-4o-transcribe` drops the end of recordings

Related topics