Streaming text in and audio out?

,

Hey

I’m curious if there are some possibilities to stream in text from a text model like gpt-3.5 directly into the tts endpoint and stream the response as an output.

Even though streaming the audio output is possible, waiting for the entire text to finish before generating the audio stream results in too much latency.

1 Like