[Text to Speech API] Chinese TTS unreliable and unusable

aweb1 · November 7, 2023, 7:51am

Example:

I ask the API to TTS this text:

这部电影我已经看完了。你的作业完成了吗？我们已经吃完晚饭了。

After trying about 100 times, I get the following results:

Half of the time, the audio is unusable: gibberish/glitched audio, almost like if it was a weird mix of english and chinese at the same time
20% of the time, the result is decent but it is missing some words
30% of the time, the result is good.

Tested with tts-1-hd quality and alloy voice.

I’d like to keep this thread up to keep track of the progress if any in the future.

Can we specify the language in the API request at some point in the future? Will it help?

aweb1 · November 9, 2023, 11:25am

The issue is still present, makes the API useless for production.

vb · November 9, 2023, 11:34am

Hi!

This is more of a self-help community where developers as the users of OpenAI services converse with each other.
Also, all of the new features are still in beta and one should be aware of the risks for a production service even when using them for a language like English, for example.

aweb1 · November 9, 2023, 1:02pm

I thought some openAI staff was reading some of the posts! I guess I posted in the wrong place. Thanks for the info

tom.tyiu · January 19, 2024, 8:23pm

Chinese is too fast, you need to slow the audio speed from 1 to 0.95%
Your audio TTS

dignity_for_all · January 20, 2024, 7:25am

This phenomenon seems to occur in languages other than English, likely Asian languages.

It appears that using tts-1 instead of tts-1-hd results in fewer issues. In Japanese, there were hardly any such issues with tts-1.

I hope this helps some of you!

Topic		Replies	Views
TTS API Speed and Quality Issues API api , tts	5	1205	February 6, 2024
Huge problems with TTS API Bugs tts	2	207	April 28, 2024
Can I specify the language of TTS voices with a bit ambiguous input? API tts	1	109	April 23, 2024
Getting metallic voice at slower speeds on speech API API api , tts	2	1067	November 14, 2023
Is a high latency for a response with a prompt in other languages than English normal? API	2	387	October 26, 2023

[Text to Speech API] Chinese TTS unreliable and unusable

Related Topics