Send data stream to TTS API

pnhco · April 17, 2024, 1:42am

Hi,
I am learning to develop a feature as follows:

Users submit questions by voice. I use the Whisper API to convert speech to text.
Next, I send the above text to API Assist so that it responds based on the document I provide, then returns the data as a stream.
I use socketIO to send this stream data to the user, the same way ChatGPT is doing.

My difficulty right now is that in addition to the text answer, I want to have a voice answer, both will be answered at the same time, just like when you watch a movie with subtitles.

Is there any solution to send stream data from Assist API to TTS API? I know TTS can return stream data to play audio but I don’t see any documentation regarding receiving text stream data to TTS.

I’m using NodeJS. If you have any solution, can you give me a reference? Thanks everyone.

_j · April 17, 2024, 2:34am

There is no “at the same time”, unless you are doing your own parsing on the AI output a sentence at a time, and sending each for transcriptions. If streaming, it would instead be identifying the point where a complete section can be spoken, by intelligent identification of what is being built until sentence-length pieces are compete thoughts, get the audio, and hold back on the text display until the first chunk is complete. Then buffering what continues after.

The only way I can see to sync voice to a transcript reliably for realtime display (up to the level of coloring words as they are spoken) is to then send to whisper for time indexing, and play the transcript back as text at the same display rate as the timestamps.

pnhco · April 17, 2024, 2:56am

@_j Thank you for your reply, since I have little experience dealing with these issues, do you have any example code that I can refer to?

Topic		Replies	Views
Whisper Streaming Strategy API chatgpt , whisper , streaming	8	15604	June 30, 2025
ChatGPT API TTS streaming API api	3	5144	January 21, 2025
Is it possible to interact with the assistant API using ones voice similarly to the app? API	8	2481	August 18, 2024
Web Speech API with whisper API whisper	1	38	July 24, 2025
Streaming feature of assistant api nodeJS API assistants-api , streaming , assistants-streaming	0	163	August 12, 2024

Send data stream to TTS API

Related topics