When do you expect official release of Voice model API for gpt4o?

hkks1205 · June 13, 2024, 7:33am

Hi
Listening to voice messages and responding directly with a voice response in GPT-4o is amazing. I’m curious about when this will be officially released as an API.

supershaneski · June 13, 2024, 8:53am

there is the US presidential election this coming November. given the possibility that it could be used for nefarious reasons even with safeguards, i would bet that it will be released afterward.

AnatolyDyatlov · June 13, 2024, 7:16pm

If they intended to release it after November from the beginning, why did they say it would roll out in the “coming weeks” back in May? It wouldve been more accurate to say “in the coming months”

EdgeLord420 · June 19, 2024, 9:29pm

We have had the ability to create realistic sounding deepfakes of people’s voices and even videos of them for like, at least a year, and that’s just how long it’s been easily accessible to a layman who downloaded an app. People with more knowhow have been doing this for years.

The voice model of GPT-4o is not introducing any potentially democracy destroying tools that we haven’t already had for years. Plus, they said it’s coming out in the coming weeks for alpha testers and will be rolled out in the coming months for all Plus users. 6 months is not exactly what I’d describe as “the coming months”, What are you talking about?

jamsmatto · June 21, 2024, 5:22am

Well, the White House is currently whining about “cheap fakes” (which are actually real videos) circulating the internet, and I presume they’re implying that these “fake” videos (which are actually real) were made by AI. That will spook a lot of the higher-profile AI companies into being very gun-shy about releasing anything in the next six months. OpenAI has already entered a weird phase where they like to talk about all the stuff they have planned, but they don’t seem to want the general public to actually have access to it. So I wouldn’t count on actually getting anything interesting from OpenAI for a while.

everik717 · June 24, 2024, 2:22pm

This is a ridiculous response. Even after November, there will be other future elections, if this were the reason it was getting held back, it wouldn’t be released at all due to this fear.

matt.mckinney12 · June 24, 2024, 5:54pm

They were likely confusing the fact that OpenAI, and others, are waiting until after the election to release their next big model. This is true, but it has nothing to do with “deep fakes”; that’s just PR nonsense. It’s because the industry doesn’t want the society altering capabilities of the next big models to become a negative talking point of the presidential election and candidates arguing over who’s going to implement tougher restrictions for obvious reasons. The voice features of 4o has seemingly been delayed, but it’s not for this reason. 4o is what they wanted to get out now since they’re waiting until after the election to release 5, or what ever they decide to call it.

6f100bc13f004fe3238b · June 25, 2024, 12:23am

Apple is probably going to get the new voice mode for their new Siri AI. They made a deal with OpenAI and now ChatGPT plus subscribers won’t get it.

daydreaminboy · July 31, 2024, 3:52am

Open ai “released” the new voice feature back on May 13th. August rolls around and still no actual release, but we keep getting more videos of open ai employees playing with it. Glad they’re happy i guess.

daydreaminboy · August 14, 2024, 4:59am

For those tired of this, google actually just beat open ai to market. Check out “gemini live”, which is a new chat mode for gemini that is a direct competitor to gpt4o advanced voice. Though to be fair, it’s hard to be a competitor to something not actually released, but still. Open ai slow rolled this so long that google caught up and released first. Gg fellas.

rich11 · August 14, 2024, 8:27am

is there an API yet for Gemini Live?

finance11 · August 22, 2024, 7:17am

how can we use voice to voice in model is there any api to it for gemini live or gpt4o. please someone mention here.

Topic		Replies	Views
GPT-4o New Voice Model, API Release API	21	22255	July 23, 2024
Will the API for the New Voice Be Released Separately? API	4	3010	September 3, 2024
True multimodal gpt4-omni from OpenAI's May Release, when and what? Community gpt-4 , chatgpt , assistants-api	4	670	July 30, 2024
GPT-4o text to speech and speech to text API	19	19081	September 30, 2024
Speech to Speech via API vs. waiting for GPT-4o voice API gpt-4 , assistants-api , speech	2	352	August 9, 2024

When do you expect official release of Voice model API for gpt4o?

Related topics