Feedback on OpenAI's Speech Feature in ChatGPT App

hu60jr · November 5, 2023, 3:07pm

I wanted to share some feedback on the new speech interaction feature in the ChatGPT app, which I’ve been experimenting with.

First off, the level of naturalness in the speech synthesis is impressive. The occasional “uhhs” and subtle stutters add a remarkably human touch to the interactions, enhancing the overall experience.

However, I noticed a few areas that could use some refinement. When interacting in Portuguese, French, and Italian, the speech still carries a slight American accent. While it’s amusing, it detracts from the authenticity you’re likely aiming for in multilingual support.

Moreover, the speech recognition seems less adept with brief, non-English input. Even when pronouncing words with phonemes absent in English, it often defaults to English interpretations. This contrasts with the app’s text-to-speech feature, which handles language detection quite adeptly.

Additionally, there’s a minor hiccup when users pause in thought—the system tends to respond prematurely. Extending the wait time before voice recognition concludes and responds would allow for a more natural pace in conversation, reflecting real-life interactions where people may need a moment to gather their thoughts.

Lastly, the frequent prompts asking if there’s “something else you want to talk about” can be repetitive. While I understand the intent behind this prompt to keep the conversation flowing, an occasional break from this pattern might be less intrusive and more comfortable for the user.

Thank you for considering my suggestions. The feature is a great step forward, and with a few tweaks, it could be even better.

(Written with the help of GPT4)

joshmason · December 17, 2023, 10:46pm

Ability to begin, pause and end audio conversation hands free / purely via natural language
Ability to modify spoken avatar’s settings/configurations (eg humour settings)
Ability to see the transcribed text in real time, without having to refresh the brower/exit out of talk mode on mobile
Stops listening automatically after x settings (configurable)
Available across any/all devices/platforms - eg available via the web app

Topic		Replies	Views
- A RANGE OF SUGGESTIONS RE GPT-4 (inc. Voice, Scrolling and Feedback) - Feedback gpt-4	0	594	October 20, 2023
To propose integrating a microphone functionality into the application Prompting gpt-4 , chatgpt , plugin-development , api	4	2441	January 7, 2024
Speech to Text (Whisper) to Review (ChatGPT) API whisper	1	2226	October 4, 2023
"Suggestion: Adding Voice Interaction Capabilities to ChatGPT for Improved Accessibility" Feedback	1	159	October 13, 2024
(FEATURES UPDATE)Adding Voice Recognition(Voice Features) in GPT4 Plugins / Actions builders gpt-4 , chatgpt , plugin-development	0	606	August 1, 2023

Feedback on OpenAI's Speech Feature in ChatGPT App

Related topics