Gpt-4o-transcribe-diarize only transcribe the first 32 sec

eric15 · November 22, 2025, 4:24pm

I have been trying to use gpt-40-transcribe-diarize using both wav and opus encoded (webm) file. But I always get result of the first 32 seconds. As if only the first chunk is processed from the backend. Any clue what I am doing wrong? I tried with and without extra_body, same result.
try {

stream = fh.createReadStream();

const filePart = await toFile(stream, inputFileName || "input_audio");

speakerRefs = await buildSpeakerReferences({

  interviewerPath: interviewerSamplePath,

  panelistPath: panelistSamplePath,

  fallbackPath: monoSamplePath,

  durationSec

});

const request = {

  model: "gpt-4o-transcribe-diarize",

  file: filePart,

  response_format: "diarized_json",

  chunking_strategy: "auto"

};

if (speakerRefs.names.length) {

  request.extra_body = {

    known_speaker_names: speakerRefs.names,

    known_speaker_references: speakerRefs.references

  };

}

console.log("DEBUG0",JSON.stringify(request,null,2))

response = await openai.audio.transcriptions.create({

  ...request

});

response.\_speaker_reference_debug = speakerRefs.debug;



console.log("DEBUG1",JSON.stringify(response,null,2))

} finally {

if (stream) stream.destroy();

await fh.close();

}

Topic		Replies	Views
Whisper API server error for long (not big) files API whisper	7	3825	December 18, 2023
Whisper API fails on "large" ogg files (still below 25MB) Bugs whisper	2	1205	April 15, 2024
Gpt-4o-transcribe truncates output after ~8-9 minutes even on short segments Bugs transcribe	3	238	August 29, 2025
Whisper api, not transcrip all audio API whisper	3	2188	October 28, 2023
Streaming Response Delay in GPT-4o Transcribe: Issue with Real-Time Audio Chunking API chatgpt , transcribe , gpt-4o , gpt-4o-audio-preview	0	225	April 1, 2025

Gpt-4o-transcribe-diarize only transcribe the first 32 sec

Related topics