Missing Documentation for WebSocket Realtime Transcription Mode

dnnkeeper · November 17, 2025, 9:07pm

The Realtime WebSocket API documentation does not mention how to establish a transcription‑only session. Attempting to use transcription models (whisper-1, gpt-4o-transcribe-latest, etc.) with the WebSocket endpoint like this: wss://api.openai.com/v1/realtime?model=whisper-1 results in errors:

Error: Model “whisper-1” is not supported in realtime mode.

The official docs for WebSockets connection only show example like this: wss://api.openai.com/v1/realtime?model=gpt-realtime

There is no mention of how to connect to transcription mode.

Developers have discovered that ?intent=transcription works, but this parameter is not documented.

Attempting to send session.update to change session type also fails because

Passing a transcription session update event to a realtime session is not allowed

It seems that without intent=transcription it is impossible to establish a realtime transcription session via WebSockets. Documentation should clearly explain how to start a transcription session via WebSocket.

Topic		Replies	Views
Realtime transcription model changes Deprecations whisper , realtime	2	661	May 27, 2025
Transcription config for `gpt-4o-mini-transcribe` doesn't work? Bugs	4	910	March 21, 2025
Gpt-realtime-whisper rejects turn_detection despite docs showing it as the canonical example Deprecations	0	169	May 14, 2026
WebRTC transcription guide seems to be broken Bugs	12	1138	April 1, 2025
Why does transcription_session.update cause an error? Realtime transcription API bug Bugs api-realtime	0	117	December 31, 2025

Missing Documentation for WebSocket Realtime Transcription Mode

Related topics