Streaming from Text-to-Speech api

@tonycamonte are you actually getting streaming audio, though?

From what I can tell, {spoken_response} generates the full audio output all in one go. I have a similar script, and it’ll play, sure, but put a print statement before buffer() and give it a good chunk of text, and you’ll see that it’s 30 seconds of processing before it even tries to assign a value to the buffer.

6 Likes