TTS voices have a clear US accent

wwessels1 · April 2, 2024, 2:13pm

I see in the description on:
https://platform.openai.com/docs/guides/text-to-speech

“You can generate spoken audio in these languages by providing the input text in the language of your choice.”

I am trying to generate Dutch spoken text. This is one of the supported languages.
I think it is brilliant how close the generated voice actually comes to the real thing, but the US Accent is clearly audible. Especially with the ‘r’ and a bit in the ‘z’ sound.

I was wondering … how often are the models updated?
Is there any chance that the multi-language support will be improved anytime soon?

Would it be an idea to add the language, or locale (for Belgium/Dutch or Canadian/French for instance) to the API Call?

Or is there a newer version that I can call? 3.5? 4.0?

darcschnider · April 2, 2024, 2:24pm

There is beta testing for voice cloning in testing. So that could mean an update in the near future. It also means that people will be able to create voices to fit these cases.

wwessels1 · April 2, 2024, 2:27pm

Would be awesome!
I saw some posts from last year with a similar complaint, so users have been wanting this for ages
Thanks for your quick reply!

darcschnider · April 2, 2024, 2:28pm

no problem, haha hanging in the forums today. well waiting for my chatgpt caps to lift lol.

wwessels1 · April 2, 2024, 2:46pm

Oh wow, I just found that Nova is much better (at Dutch) than all the others!!!

darcschnider · April 2, 2024, 2:59pm

well looks like the voices are the same based on message I got, so that information I had seen more than once from asking ai’s was “hallucination” which I find funny that openai models know alot about the code and other things it can look up, including what it can do, but still has moments.

going to have to start asking it for links to data sources lol.

wwessels1 · April 2, 2024, 3:59pm

Great, so now we can add to Nova that she speaks Dutch much better than for instance Onyx

wwessels1 · April 2, 2024, 4:00pm

I shared my code to call TTS in a separate thread:

riccardo.vieri · April 8, 2024, 8:40am

The Italian TTs have a clear us accent. Can we improve that or use any suggestion?

tozhovez · November 25, 2024, 11:25pm

The TTS model works great, reads correctly and with expression.

However, the obvious American accent significantly limits its usability.
For instance, if the input text is in Hebrew, I would prefer the output to have the accent of a native Hebrew speaker.
Is it possible for the TTS model to generate audio without the American accent and with a clearer pronunciation of the specified language?

darcschnider · January 8, 2025, 2:56pm

I dont think its really a flaw, after all the models were probably trained by NA speakers. What you see would require someone with each accent to speak and be trained on to make the models model more diverse and natural.

From api perspective you have other options outside openai.

Examples of some fantastic models that may achieve what you are looking for:

Elevenlab’s They over a lot of models and model mixing as well training of new models. Well not as cheap as openai models they are on the same level and more for realistic and emotional ranges.

You could also try some of these transformer models

This may lead you down the path of creating or fine tuning a model of your own. If you have the hardware.

I think in time openai models will continue to evolve adding more options and styles in order to maintain competitive in a booming market.

sergeliatko · January 8, 2025, 8:45pm

I like that little Glendale touch when she speaks Armenian. (But yeah, would be awesome to get the natural sound).

Topic		Replies	Views
Any plans for releasing an API for TTS? API api , tts	28	5793	November 9, 2023
Can I choose the TTS language? API tts	28	15985	March 30, 2024
Did OpenAI just make a new AI Voice? API	7	2896	May 16, 2024
New model, tts-2, any news on it? (new voice mode) API tts	9	1840	February 21, 2025
TTS API Speed and Quality Issues API api , tts	5	3631	February 6, 2024

TTS voices have a clear US accent

Related topics