How do you handle user transcripts in real-time GPT-4o chats?

vdhavala · April 25, 2025, 10:23pm

OpenAI’s Realtime API can optionally provide you the user side transcript. Can you use that? OpenAI RT API is voice-to-voice model. Optionally, OpenAI can provide you the user-side transcript by running it through a transcriber. You need to configure in session update that you need user side transcripts and also choose your model. Then, at conversation time, you need to subscribe to an event ‘response.audio_transcript.done’.

See details here https://platform.openai.com/docs/api-reference/realtime-server-events/response/audio_transcript/done

Topic		Replies	Views
Missing input audio transcription API api-realtime	6	102	May 12, 2025
Can I use Openai Realtime API for Speech-to-Text? API realtime	5	1792	January 30, 2025
[Realtime API] Input audio transcription is not showing Bugs realtime	11	2567	May 12, 2025
Unable to Access User Audio Transcript in Realtime API API api-realtime	5	1438	February 10, 2025
How to Log User Speech Input in Realtime Using the API? API	0	113	February 10, 2025

How do you handle user transcripts in real-time GPT-4o chats?

Related topics