What is the difference between realtime-transcription and speech-to-text for Streaming the transcription of an ongoing audio recording?

inference · March 28, 2025, 12:39am

I’m trying to transcribe audio to text in real-time with microphone audio streamed over websocket to openai via javascript SDK … I want to know the difference between https://platform.openai.com/docs/guides/realtime-transcription and https://platform.openai.com/docs/guides/speech-to-text for Streaming the transcription of an ongoing audio recording

TonyStark · March 29, 2025, 8:23pm

speech-to-text enables streaming of the output, but takes a complete audio file as input

realtime can take streams as input and ouput

inference · April 1, 2025, 9:27pm

My bad need to pay attention to details… the “ongoing recording” is using the same realtime transcription endpoint so its’ the same

Topic		Replies	Views
Can I use Openai Realtime API for Speech-to-Text? API realtime	5	2741	January 30, 2025
Transcribe via Whisper in real-time / live API whisper	4	34624	February 6, 2024
What is the mechanism behind realtime speech to speech api, are transcript and audio stream pushed in a synchronized manner? API api , api-realtime , api-realtime-speech	0	178	October 7, 2024
Voice differences between Realtime API and Text-to-Speech API realtime , api-realtime	1	1462	January 8, 2025
Extracting Transcription Without Using input_audio.input_transcription in OpenAI API API realtime , api-realtime	10	411	March 11, 2025