New TTS model (gpt-4o-mini-tts) Ignoring Speed parameter

bret1 · March 28, 2025, 6:56pm

I am passing the correct speed value to the API but it is being ignored. The voices always play back at the default speed regardless of which speed value is specified. This appears to only be an issue with the gpt-4o-mini-tts and not tts-1

sharjeelfaiq · May 1, 2025, 11:32am

Hi. Did you get this issue resolved? I’m also working with gpt-4o-mini-tts API and unable to configure the desired speed, even though the speed value is correctly passed to the API in the request payload.

sps · May 1, 2025, 2:17pm

I can confirm that I was also able to reproduce this on my side. Thank you @bret1 and @sharjeelfaiq for reporting this issue.

sharjeelfaiq · May 1, 2025, 2:36pm

Do you have any idea how soon the issue is expected to get resolved?

aprendendo.next · May 1, 2025, 2:42pm

I think the new model just works differently, it’s not a bug.

Actually, the tts-1 model has a bug that creates a slower version by losing bitrate (just streches).

The new model will actually generate the speech as a person talking slower or faster.

For example, input the speed in the instructions: “speak very very fast” or “speak a bit slow and paused”. Then, it will generate the speech with a normal quality regardless of the speed.

I actually liked a lot this new improvement, as the old model would sound strange and metallic when speed was changed from normal.

bret1 · May 1, 2025, 2:48pm

Yes but it would be nice if the speed values as documented in the API would still be supported (perhaps translated to a prompt behind the scenes). This would maintain compatibility with existing TTS apps & wouldn’t require a paradigm shift.

And regardless, I did try experimenting with changing the speed via prompt and it was very inconsistent. It didn’t always work. Another issue with the new voices is that they can sound like completely different speakers from one API call to the next.

aprendendo.next · May 1, 2025, 2:51pm

Certainly, you are right on that. I guess they should change the docs to reflect that.

Meanwhile, I think it would have to be done programatically inside the app.

bret1 · May 1, 2025, 2:58pm

I tried that. I was either prompting them wrong or the model doesn’t follow instructions well. It was very inconsistent. And it also seemed to increase the likelihood of the voices sounding like different speakers between API calls.

aprendendo.next · May 1, 2025, 3:19pm

Yeah, I noticed that too, and sometimes there is still some strange pauses. There a lot of room for improvements.

To my use cases that are not that much demanding, it was overall a good update.

But if you need consistency, the tts-1 model is still better.

Hopefully they will provide us a fix soon.

gokulraya · May 1, 2025, 6:36pm

Thanks for raising this! I’ve flagged this to the team to look into. We’ll update back once we have more info.

gokulraya · May 2, 2025, 4:38pm

Update:
The speed parameter is not supported for gpt-4o-mini-tts currently. This was a bug in our documentation which has been updated. Thanks again for flagging this!

bret1 · May 4, 2025, 2:32pm

You’re welcome. What about the inconsistency of the voice between API calls? And it’s inconsistent direction following in terms of pitch and speed? I wanted to offer the new voices in my TTS app but cannot due to the inconsistency, which is unfortunate because I’m sure they’re great in other respects.

aprendendo.next · May 4, 2025, 2:45pm

Without disregarding the issue, I think adding more instructions makes the output a little more stable, as seen on openai.fm

Topic		Replies	Views
Huge problems with TTS API Bugs tts	4	1891	May 27, 2024
TTS API Speed and Quality Issues API api , tts	5	3763	February 6, 2024
Gpt-4o-mini-tts voice inconsistency between requests Bugs tts	1	152	April 8, 2025
Text-to-Speech: Call for Adjustable Speed Documentation tts	2	5237	December 26, 2023
"Speak faster" instructions that work for Real Time API? API realtime	10	1843	April 14, 2025

New TTS model (gpt-4o-mini-tts) Ignoring Speed parameter

Related topics