Text completion and get voice response

contato.aifit · September 12, 2024, 9:11pm

Anyone knows, or maybe the OpenAI team could respond, how we can use the API to send a request for completions endpoint and get an audio stream as response?

Is that possible ou will be soon? (At least the task exists in the backlog)

Current Workaroud (ugly and costly way)

Send a request to the completion API (3 minutes awaiting)
Get the response and send to the audio API (more minutes)

So, the client await long time to be able to listen the response

_j · September 13, 2024, 6:04am

Better workaround:

stream the response, so you are getting tokens as they are generated,
start sending response sentences for TTS as soon as they are received,
buffer and assemble audio stream, initiating WebRTC playback after buffer underruns are unlikely.

Topic		Replies	Views
GPT-4o Chat Completion with audio response API	6	6055	May 24, 2024
Getting audio stream from chat completion API API chatgpt , api , tts	5	4251	December 25, 2023
Text to Speech included in Chat Completions API (combine 2 API calls to 1) API	0	702	November 7, 2023
ChatCompletion stream to tts API gpt-4 , gpt-35-turbo , chatgpt , api , tts	2	2848	June 19, 2024
Speech-to-Speech (Audio Input/Output) with 4o API	5	1200	October 13, 2024

Text completion and get voice response

Related topics