Streaming from Text-to-Speech api

joshuavogel79 · November 14, 2023, 8:26pm

@tonycamonte are you actually getting streaming audio, though?

From what I can tell, {spoken_response} generates the full audio output all in one go. I have a similar script, and it’ll play, sure, but put a print statement before buffer() and give it a good chunk of text, and you’ll see that it’s 30 seconds of processing before it even tries to assign a value to the buffer.

Topic		Replies	Views
Realtime API extremely expensive Feedback realtime	66	5795	December 4, 2024
How to decrease the latency of Text-To-Speech API? API gpt-4 , api	6	3287	April 26, 2024
Python integration of real time? API	13	3101	October 5, 2024
Realtime API (Advanced Voice Mode) Python Implementation API gpt-4o , advanced-voice , realtime	15	7675	February 9, 2025
[Realtime API] Audio is randomly cutting off at the end Bugs realtime	77	3337	February 18, 2025

Streaming from Text-to-Speech api

Related topics