Invalid JSON returned from Audio/Whisper endpoints

SlyJack · May 23, 2024, 8:03pm

When using the Transcriptions and Translations endpoints from the Python API (openai.audio.transcriptions.create and openai.audio.translations.create), with response_format set to “verbose_json” or “json”, it returns something that is not valid JSON. The output begins with a custom data type, ie. “Transcription”, and contains parentheses, unescaped characters, single quotes, and other components that cause it to be invalid JSON.

Ex:

response = openai.audio.transcriptions.create(
    model="whisper-1", 
    file=audio_file,
    response_format="json"
)
# response
# Transcription(text='¿Cómo podría ser el mundo transformado? ¿Cómo vendría su reino? ... hasta el final, hasta lo último.')

response = openai.audio.translations.create(
    model="whisper-1", 
    file=audio_file,
    response_format="json"
)
# response
# Translation(text='How could the world be transformed? How would his kingdom come? ... until the end, until the end.')

Note: the text field has been abbreviated with “…” for space in the outputs.

Test was done with the following:

a Jupyter notebook
Python 3.12.1
OpenAI Python SDK, version 1.30.1

felixwang · June 28, 2024, 5:54am

I had the same issue. Surprised that this issue hasn’t been resolved for more than a month.

sahilashar · August 18, 2024, 8:16am

Has anybody heard if/when this will be fixed? Feels a bit weird having inconsistent response payload formats across APIs.

Topic		Replies	Views
OpenAI whisper model is generating '...' for non-english audios Bugs whisper	0	51	December 9, 2024
Whisper API respnse issue API whisper	5	2321	December 17, 2023
RealTime API Transcription errors Bugs realtime	7	1792	January 9, 2025
Incorrect Transcription - Arabic voice returns Hebrew text Bugs whisper	0	89	October 2, 2024
[Bug] Assistant API returns malformed JSON despite response_format=json_object Bugs assistant	2	75	June 21, 2025

Invalid JSON returned from Audio/Whisper endpoints

Related topics