Getting audio stream from chat completion API

teshnizi · November 27, 2023, 7:12pm

Hi,

I want to receive the chat completion as an audio stream and play it for the user (like the voice feature in the OpenAI app)

One way to do it is to receive it as a stream of text and use the TTS api to turn it into an audio stream, but that means I’ll need to send multiple TTS requests for different chunks of the received text. The RPM for the TTS API is 3, so that would not be feasible.

I think the ideal way would be just directly receiving an audio stream from the chat completion API. Does anyone have any tips?

Foxalabs · November 27, 2023, 8:22pm

Hi and welcome to the Developer Forum!

For the moment the new API endpoints are for evaluation only, and not intended for use in a commercial application.

You can reasonably assume that these rate limits will be lifted after the evaluation period is over, I am not aware of the evaluation period duration.

_j · November 27, 2023, 8:36pm

Only when in a free trial. Pay up, and that rate limit is increased.

The chat completions endpoint returns the text generated by the AI. There are no other features except for returning function-call language in a different manner.

The TTS endpoint accepts up to 4096 characters. That allows for almost all responses that aren’t multiple minutes of an AI reading text to you. You’d likely want to do additional system prompting to tell the AI “User receives text as audio, avoid output over 400 words” or similar.

It shouldn’t take much chunking of streamed output to simulate responsiveness. You can send the first two sentences off for TTS, and by the time that is read to the user, you can probably have encoded the rest.

(Also, nobody says you have to use OpenAI’s TTS service…)

teshnizi · November 28, 2023, 12:24am

Ah you’re right it’s higher for other tiers. thanks.
Any recs on good alternatives?

Harmen4096 · December 1, 2023, 4:42pm

I could use this streaming request functionality as well.

EricGT · December 25, 2023, 6:47pm

@teshnizi

If it OK if a moderator closes this topic?

Topic		Replies	Views
ChatCompletion stream to tts API gpt-4 , gpt-35-turbo , chatgpt , api , tts	2	2847	June 19, 2024
ChatGPT API TTS streaming API api	3	5145	January 21, 2025
Implementing audio conversation with AI API	8	4263	April 29, 2024
How can I stream chatGPT responses into the new TTS APIs? API tts , streaming	2	6621	November 30, 2023
Waiting for gpt-4o-audio-preview API audio	11	3686	November 4, 2024

Getting audio stream from chat completion API

Related topics