I’m trying to transcribe audio to text in real-time with microphone audio streamed over websocket to openai via javascript SDK … I want to know the difference between https://platform.openai.com/docs/guides/realtime-transcription and https://platform.openai.com/docs/guides/speech-to-text for Streaming the transcription of an ongoing audio recording
1 Like
speech-to-text enables streaming of the output, but takes a complete audio file as input
realtime can take streams as input and ouput
1 Like
My bad need to pay attention to details… the “ongoing recording” is using the same realtime transcription endpoint so its’ the same