Inconsistency in the "Speech to text" documentation

benjamin-feldman · June 2, 2025, 11:23pm

Hello,

I’m reporting a small inconsistency in the “Speech to text” documentation.

In the documentation, there is this excerpt:

from openai import OpenAI

client = OpenAI()
audio_file = open("/path/to/file/speech.mp3", "rb")

transcription = client.audio.transcriptions.create(
    model="gpt-4o-transcribe", 
    file=audio_file, 
    response_format="text"
)

print(transcription.text)

However, running this leads to an AttributeError, because transcription will actually be of type str. This is expected, as confirmed by the implementation of audio.transcriptions.create which expects a str return when response_format equals "text".

A quick fix would be to change the print(transcription.text) to print(transcription) in the documentation.

thx!

Topic		Replies	Views
AttributeError: type object 'Audio' has no attribute 'transcriptions' Deprecations api	2	2124	April 26, 2024
Audio transcription supported on last version of openAI Deprecations gpt-4	2	1139	February 11, 2024
Missing word in API overview webpage Documentation api	0	479	February 15, 2024
'OpenAI' import error, and audio.transcribe or any audio related functions not supported Bugs	0	135	December 4, 2024
API endpoint /v1/audio/translations is transcribing to original language. It is not translating to English Bugs translation	1	90	January 10, 2026

Inconsistency in the "Speech to text" documentation

Related topics