Open Ai realtime api audio translation

Kencliff · February 25, 2025, 9:24am

Hello Open AI, I am using the open ai real time api for audio to audio translation. In my prompt (instructions), i added that in case of background music, let it be maintained will the speaker, but the output comes with plain voice, no background sound when existing.

Also, when i specify that it should translate the audio, automatically choosing a voice (man, woman, child, teen etc), it doesn’t work like that. It just selects 1 voice and voice and translate from start to end with that voice.

Please how can i walk around all of these? I’m using Node.js.

Topic		Replies	Views
OpenAI_RealTime_Questions API realtime , api-realtime , api-realtime-speech	1	320	February 20, 2025
Transcription errors in realtime API API realtime	3	242	December 1, 2025
How to get input_audio_transcription when i use openai realtime api API realtime , api-realtime , api-realtime-speech	2	948	November 16, 2025
Realtime API Audio Modality output API realtime , api-realtime , api-realtime-speech	7	1282	December 13, 2024
Openai.audio.translation.create bug? API api	2	136	November 20, 2024

Open Ai realtime api audio translation

Related topics