"Speak faster" instructions that work for Real Time API?

erik16 · October 8, 2024, 4:14am

The standard pace of each of the voices is slower than most people talk. When you’re in a conversation and you ask it to speak faster, it will do so for the next sentences.

However, I’ve not been able to get it to speak faster from the beginning, regardless of the words I use in the instructions - except the over the top “kid on sugar” example, that has a moderate effect.

Anyone who’s had success with specific instructions?

jeffreyhuhao · October 9, 2024, 1:56pm

I have same issue too, the real time api sounds very unnatural and slow, at least compared to advanced voice mode

erik16 · October 16, 2024, 3:50am

Just bumping this up one more time. Considering the likes it got, @jeffreyhuhao and I must not be the only ones running into this. Did anyone find an answer?

josh.r.tabor · November 22, 2024, 2:28am

Same problem here. Advanced voice mode does it with the same prompt.

jill7 · December 9, 2024, 9:45pm

I also am looking for a way to make the voice faster! Ideally it could be controlled by the end user via a slider → param input, but would settle for faster across the board.

anon25271712 · December 10, 2024, 1:22am

oh, i did that once, but it made voice squeaky and making it slower made it deep, something to do with the frequency, i had a github pr that i closed a while back that did that

dannymac · January 21, 2025, 5:49pm

Dealing with the same here. Anyone find a solution?

divya.r · January 22, 2025, 3:18pm

We’re experiencing the same issue and haven’t found a useful solution yet. I tried supplying additional system messages like “speak faster than normal” during response.create as custome context, but that didn’t seem to help much either. In general, my observation is that as the conversation progresses, it tends to lose its ability to consistently follow the supplied instructions.

j.wischnat · January 22, 2025, 3:30pm

This is the samplerate. While this does make the playback faster and slower, typically this is probably not the approach that most people here want.

I think we want it to generate the voice speaking faster natively, without needing to speed it up by manipulating the audio ourselves.

I haven’t tested this yet, but the newer voices should do this just fine, right?
Maybe you can get a good outcome by having the prompt as

You feel uneasy and have to speak incredibly fast as the current situation is very stressful and youre on a deadline to finish it.

Gaslighting AI, if you will.

Cheers!

Ancient · February 6, 2025, 3:59pm

Anyone get the speed figured out?

grantlicomm100 · April 14, 2025, 10:25pm

Facing the same problem here. I tried these prompt elements:

Speak faster
Speak as if the matter is urgent
Speak at normal conversational pace
None of these make any of the voices sound conversational, they all sound slow and robotic.

Anybody found a good solution yet?

Topic		Replies	Views
Is it possible to adjust the volume or speed of audio in the realtime api? API api , realtime , api-realtime , api-realtime-speech	4	808	February 6, 2025
New TTS model (gpt-4o-mini-tts) Ignoring Speed parameter Feedback api , tts	12	233	May 4, 2025
Huge problems with TTS API Bugs tts	4	1891	May 27, 2024
Assistant API Performance is Very Slow API plugin-development , api	10	5288	March 7, 2024
Realtime API voice too slow Bugs realtime , api-realtime-speech	1	204	April 10, 2025

"Speak faster" instructions that work for Real Time API?

Related topics