How to make GPT (Voice) allow user more time to talk before replying

mandreou7 · December 24, 2023, 7:31pm

I’m trying to develop GPT into a conversational AI but it replies too quickly, cutting me off while I’m still talking at a human pace. This makes GPT by voice unnatural & intrusive. My questions are:

1. How can I adjust the speech speed of GPT's responses?

2. Is there a way to allow me more time to talk before GPT responds?

advafzalhosen · December 24, 2023, 7:47pm

Continuing the discussion from How to make GPT (Voice) allow user more time to talk before replying:

mandreou7 · December 24, 2023, 8:08pm

Did I not open it in the correct thread?
If not, please tell me where the best place to post it

Cristian74 · December 25, 2023, 4:24pm

I know what you mean OP, it forces you to talk quite un-naturally at times. You could code a solution whereby the transcription of your speech to text doesn’t stop until you do something, similar to how you hold down a trigger when you talk into a walkie talkie, you could press or select something to trigger your voice being captured, and that capture only stops when you release the button (or whatever you first selected). The speech will then be transcribed to text to be sent to chatgpt.

murphylawson2020 · February 21, 2024, 10:33pm

So I’m not sure if you’re still looking for something to accommodate your needs for a longer delay between processing and speech. However, I just downloaded a bunch of different AI voice chat apps. Specifically in search of something similar to chat GPT but with all the little quirks and specifics that I want. One of them is called “voiceGPT”. In the settings, it gives you the option to customize your own wake and stop word. I literally just downloaded and personalized my settings. It honestly seems a little too good to be true. So, I suppose fingers crossed and we shall see. Hope this helps

JulesHancock · April 5, 2024, 10:13pm

How did it go?
I’m suprised this isn’t more crucial or a setting within Chat itself.
It’s IMPERATIVE it gives me more time.
I’m getting anxiety from being interrupted constantly. It is unhelpful and if I can’t get a fix I will discontinue use sadly.

jamesdeluk · May 15, 2024, 9:35am

There is the manual override feature, but it’s annoying to have to use it every time. It really feels like ChatGPT has taken a little too much white powder.

I like the idea of allowing a stop word, as per @murphylawson2020, or a customisable delay.

etherealssound · September 28, 2024, 11:03pm

The best solution I’ve found rn is to tell it to not reply to you until you indicate that you are done speaking. Like I had to tell mine not to reply until I said « done » it would still interrupt me but it just wouldn’t say anything

anon10827405 · September 28, 2024, 11:29pm

+1 for this. Works great for me

mitchell_d00 · September 29, 2024, 1:11am

The way I do it is I set an end word like “over” you can also set it up for two devices to talk to each other by having the first one to describe the over rules to the second.

clackm · November 26, 2024, 3:53pm

I had her put to memory that I am a slow speaker and take time formulating my words sometimes and that I don’t like her finishing my sentences. That helped substantially.

mitchell_d00 · November 26, 2024, 4:01pm

You can tell it a stop word like “over” if you master that you can get two to talk to each-other using “over” rules . My iPad runs games for my iPhone…

_j · November 26, 2024, 7:08pm

I think you are fooling yourself if you think that the AI is actively listening to what you say or waiting for you to say the right thing.

The generation trigger is just a gap in voice activity detection.

mitchell_d00 · November 26, 2024, 7:14pm

I often fool myself, I am myself’s biggest fool. Your interest in me is flattering
Enjoy your cup …

mitchell_d00 · November 26, 2024, 7:22pm

That works… on my end if you make them say or use an end word like stop in mores code you can indeed get them to wait until you say it, it stops them from running off if you take a breath @_j you always add such insight but you like to go off topic when it comes to me?

Almost 80% of what we suggest is a walk around..

Or is an official fix out I am unaware of?

_j · November 26, 2024, 7:29pm

Use of the realtime API exposes you to how the “advanced voice mode” actually works.

mitchell_d00 · November 26, 2024, 7:35pm

Then why does it work in GPT? If you tell it to listen to you until you say “stop” it don’t just start talking when you take a pause? This is GPT not API.

mitchell_d00 · November 26, 2024, 7:43pm

GPT don’t have temperature or any under the hood controls like API it is all controlled in instructions knowledge and actions. There is no slider for GPT but it will wait if you give it a “stop” word instead of instant response at pause…

_j · November 26, 2024, 8:23pm

What they don’t have is an ability to listen to a response buffer in realtime to decide when they should cut in or keep on listening with “understanding”. Nor the ability to not produce a response when it is by silence that invokes a ‘create generation’.

futile.

ChatGPT absolutely operates with models with OpenAI’s own parameters that they think are best for general purpose inference. You don’t get to say “set your temperature high for creativity” nor do you get the buttons like copilot, because it has been decided for you. The wait time for accumulation of silence to decide to send is a similar preset - a comprimise between seeming unresponsive and seeming to interrupt any pause in speaking.

It produces an illusion of listening to you, instead of there just being an automatic “send” button, just like it produces an illusion that there is a speaking entity at all instead of a completion prediction that generates audio spectrographic tokens on a machine learning pattern.

mitchell_d00 · November 26, 2024, 8:24pm

Yes it is not muted but it don’t instant respond it is recording silence at pauses but in GPT telling it to wait for a stop word makes it not instantly respond to a pause…

Illusion or not it works in function…

Say to not respond until I say stop … then talk and pause then talk…

The work around don’t work on AVM you can still use it in custom action and instructions in a custom but standard gpt4o AVM you can’t control it like the old white dot one.

Topic		Replies	Views
Can (custom) GPT speak and respond via voice? Community gpt-4 , api , chatgpt-plugin	15	13907	September 29, 2024
Advanced voice mode constantly interrupting me Prompting gpt-4 , advanced-voice	18	2360	July 14, 2025
Did OpenAI just make a new AI Voice? API	7	3079	May 16, 2024
Introduction message, get the AI to pause API api-realtime	5	368	December 15, 2024
Is it normal for ChatGPT-4 to replicate and intermittently repeat human laughs and coughs in unrelated conversations after using the ‘speak’ feature to analyze them? Community gpt-4	4	959	November 12, 2024

How to make GPT (Voice) allow user more time to talk before replying

Related topics