Dropping Numbers With TTS API while Generating Speech

dixit.malia · March 5, 2024, 6:56am

I’m utilizing a Text-to-Speech (TTS) API to generate speech from text that includes phone numbers and other numerical content. However, after generating the speech, some of the numbers are missing. I’ve experimented with various formats and types of input, such as:

1234567890
1-2-3-4-5-6-7-8-9-0
one two three four five six seven eight nine zero
1 2 3 4 5 6 7 8 9 0

Despite trying these variations, the speech output consistently omits certain numbers. For instance, instead of saying ‘1234567890,’ it might say ‘12345670,’ and so on. I’ve generated batches of 10 files each time, and the error rate ranges from 50% to 80%, meaning that out of 10 files, 5 to 8 files are missing the numbers.

Could anyone provide insights on how to resolve this issue?

maiconsanson · March 18, 2024, 4:55pm

Not only numbers but sometimes it cuts off some syllables too.
Have you tried formatting the numbers with commas, periods, or ellipses?
Or even with a break line like \n?

Also, splitting a long text into smaller parts could help in a more accurate result.

johncain194 · March 18, 2024, 5:00pm

Use other providers for more complex/longer solution, especially with the current situation of “degraded GPT4”, like Claude 3 (the highest tier)
Use other TTS (preferably your own) and train your own.
Lastly, use simpler prompts and simpler words/arrangement of numbers. Mix and match, treat the AI like a child.

dixit.malia · March 19, 2024, 5:01am

As stated in the discussion, I actually did separate words and numbers using commas, periods, and so on.
and Secodonly Rewriting the sentence in smaller chunks will affect how it is used in terms of delay and voice quality.

Topic		Replies	Views
Text To Speech (tts-1) dropping numbers when reading numbered lists Bugs api , tts	3	2085	January 14, 2025
Issue with Incomplete Audio Output Using OpenAI's tts-1 Model API tts	2	964	May 31, 2024
[Realtime API] Audio Output Numbers Wrong Bugs realtime	3	364	March 17, 2025
Huge problems with TTS API Bugs tts	4	2152	May 27, 2024
Why is realtime model so bad at understanding sequences of numbers? API realtime	17	1694	April 28, 2025

Dropping Numbers With TTS API while Generating Speech

Related topics