Can I specify the language of TTS voices with a bit ambiguous input?

soc.nwa · April 23, 2024, 6:38am

Hi all! I am now playing with the TTS APIs. I would like to generate voices from Japanese input sentences.

But as we know, Japanese has a lot characters in common with the Chinese sentences. Sometimes, in fact, they looks exactly same. I found that the TTS API then outputs sometimes a mix of Japanese and Chinese or even solely Chinese, while I want a Japanese output. Is there any way to tune the parameters a bit more to handle this problem? Or this is what we have and we need to wait a bit for future additional functionalities?

Thanks in advance for answering this!

vb · April 23, 2024, 7:32am

Hi!

Currently it is not possible to set the language for the TTS endpoint.

I would give it a shot with a workaround:
Add a short sentence to the beginning of your text that tells the model that the language is Japanese only.

‘The following text is Japanese.’

Then add a additional pause between this first sentence and the actual text:.

After retrieving the speech from the API split the audio at the first occurrence of silence, for example using pydub.

It would be better if we could prompt the model to use a specific language explicitly.

aprendendo.next · August 24, 2024, 12:08pm

Is there a way to force a pause or longer delay between sentences?

Topic		Replies	Views
How to hint the language used for the Text-to-speech (TTS) in GPTs? Plugins / Actions builders gpt-4 , chatgpt , custom-gpt , gpts	3	2552	June 5, 2024
Is it possible to specify output language in text-to-speech? API speech	2	216	December 2, 2024
How to Fine-Tune Pronunciation with OpenAI's Text-to-Speech API? API tts	1	303	March 6, 2025
Can I choose the TTS language? API tts	28	16249	March 30, 2024
Possible to specify input language for ASR API calls? Getting Inconsistent results using translations endpoint API	0	333	November 29, 2023

Can I specify the language of TTS voices with a bit ambiguous input?

Related topics