Voice correspondences between ChatGPT and Realtime API

f10w · December 5, 2024, 3:39pm

In the ChatGPT app, with the advance voice mode, there are the following voices available:

Arbor - Easygoing and versatile
Breeze - Animated and earnest
Cove - Composed and direct
Ember - Confident and optimistic
Juniper - Open and upbeat
Maple - Cheerful and candid
Sol - Savvy and relaxed
Spruce - Calm and affirming
Vale - Bright and inquisitive

If I’m not mistaken, the advanced voice mode is built with the Realtime model:

There are 8 voices available for use with the Realtime API, which offers the following voices:

alloy
echo
shimmer
ash
ballad
coral
sage
verse

My question: Is there a correspondence between these two lists and why are they different?

Thank you in advance for your reply!

jpoel · February 11, 2025, 12:35am

The advanced voice mode is definitely not built ontop of the realtime api. The performance of advanced voice mode is far superior.

f10w · February 11, 2025, 11:00am

How do you know that? It makes sense that the AVM is built on top of the realtime API (except that some voices, such as Sol, are not released to the public).

In the same spirit, ChatGPT should also be built on top of the API. If your chat app is not as good as ChatGPT, that doesn’t mean ChatGPT is not built on top of the API.

Topic		Replies	Views
Realtime voices not the same as ChatGPT App API	3	534	October 21, 2024
Voice differences between Realtime API and Text-to-Speech API realtime , api-realtime	1	1298	January 8, 2025
Realtime API nerfed vs Advanced Voice Mode? Feedback realtime	10	2137	February 11, 2025
Real Time API Voices Are Worse Than The Voice on ChatGPT Feedback	2	501	February 14, 2025
Advanced Voice Mode for API API	22	19089	October 5, 2024

Voice correspondences between ChatGPT and Realtime API

Related topics