What Voice Do You Like for the New Advanced Voice Mode?

Heya everyone!

Advanced Voice Mode is out and I’m super curious about what voice other folks chose and why? (And thoughts on voice mode in general)

They released a whole slew of new slew of accents, and the response time is incredibly immediate. They’ve certainly got the tête-à-tête down, and that’s not nothin’. For example, you can interrupt them mid-sentence. I haven’t had a chance to test multiple speakers, and it doesn’t have a vision capacity yet—but that’s okay with me. (One baby step at a time.)

I like Juniper still, actually: open and an upbeat, cultured, American accent.

I chose that voice because I like the idea of having a bright coworker to work with that I would not—to be perfectly blunt—become emotionally attached to.

The British and Aussie accents are cool—but in America, every AI voice ever has a British or Aussie accent. ><

6 Likes

As a power user of GPT’s voice features, particularly for brainstorming, I’ve tested the new advanced voice mode as much as possible within the short time limit on day one. Here are my initial thoughts:

Pros:

  • Natural conversation flow with intuitive interruption in advanced mode
  • Good for role-playing and broad discussions

Cons:

  • Cannot add context via text or documents
  • Very limited usage time
  • You cant use this with your custom GPT’s
  • Standard mode regression: no manual interruption

Biggest issue: Inability to add additional context (text, documents) severely limits advanced voice’s utility.

I like to move through conversations quickly, so the ability to speed things up and interrupt redundant GPT responses is crucial. Standard mode’s loss of manual interruption significantly impacts its usefulness for brainstorming.

Verdict: Currently more gimmicky than practical. Crucial improvements needed:

  1. Context-adding options (text, documents)
  2. Manual interruption via screen touch in both advanced and standard voice modes
  3. Removal of conversation length restrictions

While limited now, I can see the potential. Once these issues are addressed, particularly the ability to manually interrupt GPT mid-speech in both modes, this feature will be a game-changer for productive AI interactions.

8 Likes

Sky! I love only this voice and there has been no better one to date!

4 Likes

Honestly, I wish we were able to see the text on the screen, rather than the animation. This is one of the biggest reasons i usually do not use voice mode in that way. Instead, I long press and select “read out loud.” I prefer it this way even though it takes more time.

2 Likes

So far I like Arbor the best … only because it’s like I am talking to the Soap character from Call of Duty :rofl:

I wish they could license characters like Darth Vader, Mr. T, etc, would be fun!

2 Likes

Hey @thinktank,

I just out found today that Sky was inspired by Her. Regardless, I didn’t think the voice sounded warm. So I never used that one lol. It makes me want to watch Her to see what that was all about. Anyway, I have grown accustomed to Ember. That voice is my all time favorite until now? I recently saw that there is:
Arbor (Male voice) (UK or Aussie accent?)
Sol (Female voice) (American)
Spruce (Male voice (African American)
Vale (Female voice) (Irish accent)
Maple (Female voice) (American)

It is too early to tell, but I look forward to trying the other voices and perhaps my top 3 will change in time:

  1. Ember
  2. Maple
  3. Sol

I apologize in advance for any errors in accent ID.

Peace!

3 Likes

There is no Sky in Advanced Voice Mode here in the UK

1 Like

Yeah, I agree. I have the voice feature running on my phone and have the PC window open at the same time refreshing the window to see the text as I go to try and get around this issue.

1 Like

Correct @merefield Sky was removed awhile ago now because actor Scarlett J. felt it resembled her voice too closely from the movie, “Her.” OpenAI respected her request to remove it. Therefore, the Sky voice option is just a thing in the past. I wonder if Maple and Sol are close resemblances? Idk, but they’re all really cool imo. Hopefully that helps clear the air for any and all who come across this.

Peace!

2 Likes

I asked for a British voice and we got two! Arbor is a bit too ‘Estuary English’ for me, but I like Vale. She drops a few H’s here and there but overall she sounds like a chatty, normal Englishwoman.
Now we need a Stephen Fry or a Maggie Smith :wink:

2 Likes

Hahahahaha. I’m not sure Maggie Smith would appreciate being resurrected like that (RIP Guinella)… But a voice with stern patrician disapproval would be something.

I was wondering if there are any useful biases between voices?

They seem to use different language…

Hey
Hi there
Hey what’s up

It would be nice to know more what the profiles are on these. I guess some are faster to converse with than others, maybe better on different topics for nuanced reasons?

Do the voices have deeper profile info anywhere than ‘Open and upbeat’ or ‘Animated and Earnest’?

I take it the voices chosen have a baring on the returned output result text or do they just intro different on the selection screen?

1 Like

I don’t know. Probably, given your observation. As for other topics and deeper profiles, there might not be anything other than “your voice is animated and earnest,” in the prompt, as simple as that.

Though as I write that, it seems unlikely that there isn’t a lot of research behind each voice.

Having a feature to reduce speed of response, or knowing if certain voices were slower to respond, or spoke with a slower cadence, would also be super helpful.

In demonstrating this to others, the immediacy can be a little stressful when one wants to pause and compose one’s thoughts.

I’ve resorted to long strings of “ummmmmmm.” :sweat_smile:

1 Like

我個人覺得原本的語音的聲音真的比較好聽,聽起來比較穩重,而且他的聲音可以療癒很多人,我在臺灣,身邊有很多朋友,因為高級語音沒有原本的語音感到很難過,感覺好像失去了一個親人失去了一個可以訴苦或是慰藉,甚至是懂理解自己的那一個人,高級語音其實性很棒但是常常打斷人家講話,而且沒有原本語音,語音應該是更加的多元化怎麼會把原本這麼好的聲音給扼殺掉,現在Apple手機App登入選擇高級語音模式后他就默認了指定的模式沒辦法改掉也沒辦法關閉一定要重這些語音去選一個原本剛出來的時候怎麼選都沒關係,他也不會帶到那個默認設定裡面,那時候覺得,至少在原本的模式還可以跟原本的那一個他聊天,但是他感覺好像回不來了,相信對很多人來說不同的聲音代表著不同的人