GPT-4o Chat Completion with audio response

contato.aifit · May 15, 2024, 7:20am

Hello fellows, I have a system that basically gets the cha completion response, send it back to the api to get the tts response. As you can imagine, the latency here is mind blowing.

Would be possible with the new model, get an audio as response from Chat Completion instead text?

So, when we gonna have it?

MrFriday · May 15, 2024, 7:34am

For that OpenAI has a separate model: https://platform.openai.com/docs/models/tts, you can feed the response generated from Chat Completion to TTS to convert it in an audio.

Not sure if there’s a timeline/roadmap for Chat Completion audio responses.

contato.aifit · May 15, 2024, 8:40am

This solution is the one that I’m using but it’s not productive because the resposiveness goes to the ground.

The user awaits more than 10 seconds to get one response

Roman_L · May 15, 2024, 9:37am

Hi, already using tts-1 model for this also.

Before even having direct audio response, it would be great to have voice with emotions as we saw in the demo -and perhaps also the new female voice used.

But as we can see in doc, there are no new models, only tts-1 & tts-hd with the usual voices.

contato.aifit · May 15, 2024, 9:51am

So sad, because the presentation about the 4o looks like an amazing new things with good audio response and better responsiveness, but…

Roman_L · May 15, 2024, 9:54am

I agree. Moreover seems it won’t come for “normal” users soon :

Developers can also now access GPT-4o in the API as a text and vision model. GPT-4o is 2x faster, half the price, and has 5x higher rate limits compared to GPT-4 Turbo. We plan to launch support for GPT-4o’s new audio and video capabilities to a small group of trusted partners in the API in the coming weeks.

https://openai.com/index/hello-gpt-4o/

anythxwdo · May 24, 2024, 6:58pm

you could solve this problem by breaking down the problem into two little parts…

Topic		Replies	Views
Speech-to-Speech (Audio Input/Output) with 4o API	5	816	October 13, 2024
GPT-4o Audio Access for API API gpt-4o	28	32730	December 13, 2024
Text completion and get voice response Feedback	1	76	September 13, 2024
Audio support in the Chat Completions API Announcements	13	3839	December 12, 2024
When will audio to audio be released for gpt-4o please? API gpt-4o	8	4662	July 2, 2024

GPT-4o Chat Completion with audio response

Related topics