Advanced Voice Mode for API

devmiser · August 2, 2024, 8:13pm

The “Advanced Voice Mode” will be a game changer for the AI-consumer interface I am developing when it becomes available to developers through the API. I searched this forum but cannot find any information on when it may be available. Des anyone have any insight on when it may be available through the API? Also, should we expect real time streaming both ways and the ability to interrupt like in the demos?

grandell1234 · August 2, 2024, 8:14pm

There is no planned API release.

daepop98 · August 3, 2024, 4:00am

there is no information about it.

wanet.ariggles · August 7, 2024, 12:50pm

I have noticed note that it will be available on end of fall , it is small point when using normal voice

turbolucius · August 7, 2024, 1:29pm

OpenAI hinted at an API availability in their release post:

We plan to launch support for GPT-4o’s new audio and video capabilities to a small group of trusted partners in the API in the coming weeks.

That’s all we know so far.

aaron.lutz · September 23, 2024, 8:16am

We don’t have a lot of new info on this. Initially, they mentioned (as stated below) that a small group of test user will have access to the API for voice mode in the coming weeks after the announcement back in march (or when ever it was). Since then, no updated. However, if you are following the rumor mill on X, we might get (broader) access to advanced voice this week. Also, while they said they won’t be any announcements for new models at dev, day, maybe will get some news for advanced voice api.

aaron.lutz · September 25, 2024, 10:02am

well, we now all have it in chatgpt! it’s awesome. let’s hope that they will roll it out also in the API soon.

sotzing · September 25, 2024, 11:37am

Following. Very interested in API support for advanced voice also.

trenton.dambrowitz · September 25, 2024, 11:59am

cries in British

It’s not fair, Brexit should have at least allowed us to get these features!

aaron.lutz · September 25, 2024, 2:20pm

I’m not based in the US either, so I was also crying salty tears when I heard about the EU restrictions… However, there is a reason why VPNs exist, isn’t there?

redvivi · September 25, 2024, 2:32pm

On the website or the app?

Source?

Fusseldieb · September 25, 2024, 3:13pm

I remember when OpenAI would get us API access first, so we would be ones of the first to try.

Now tables have turned. All the people that pay for expensive API access are getting the features last, and all the $20 cheapstakes get it “first”.

Kinda sad tbh.

xfluids · September 25, 2024, 6:18pm

there is 0 insentives for independent developers to invest into OpenAI, its a money sink, also i think open ai will soon remove theri API access and they will be selling services only to end customers, Bajt and swtich

abhinav3 · September 26, 2024, 4:44am

“Sol” voice is something I am eagerly waiting for on API

ritwizmulay4612 · September 26, 2024, 10:46am

Waiting for this mode in character.ai integration i will pay whatever it asks then it could be so real

grandell1234 · October 2, 2024, 12:11am

Update, for anyone coming across this topic. It is now available with more information on it here: https://platform.openai.com/docs/guides/realtime

NormanNormal · October 2, 2024, 12:28am

Yep, OpenAI was a company for developers to create services and apps. That was their primary goal. Not so much now.

ovega · October 2, 2024, 7:47am

Great news! Although we are tier 5 users, we don’t have the service enabled. is there any estimated timetable for the deployment of the user access permission to have a little foresight?

enciklopedai · October 2, 2024, 6:20pm

OpenAI stated yesterday: “…Developers can start building with the Realtime API over the coming days…”

AndrijaR · October 2, 2024, 6:24pm

We will have to as it is getting rolled out. There was a mention from an OpenAI employee saying in the next 1-2 days.

Topic		Replies	Views
GPT-4o text to speech and speech to text API	19	19010	September 30, 2024
GPT-4o New Voice Model, API Release API	21	22216	July 23, 2024
Realtime API nerfed vs Advanced Voice Mode? Feedback realtime	10	2136	February 11, 2025
GPT-4o Audio Access for API API gpt-4o	28	33883	December 13, 2024
Latest real time Audio capabilities to Api API audio , gpt-4o	2	535	August 3, 2024

Advanced Voice Mode for API

Related topics