Oct 31: New Voices on Chat Completions, to replace others - up

_j · October 31, 2024, 2:56am

These voices are now in the changelog and committed to the Python library.

Was:

Supported voices are `alloy`, `echo`, `fable`, `onyx`, `nova`, and `shimmer`.

Now:

Supported voices are `alloy`, `ash`, `ballad`, `coral`, `echo`, `sage`

From Python commit: enforced and allowed only by Python openai-1.53.0

Now Operational (and others continue?)

Voice samples

`alloy`

`ash` (new)

`ballad` (new)

`coral` (new)

`echo`

`sage` (new)

Despite the voices claiming dialects or accents in some statements they made above unprompted, this is not a quality that comes out of the model by prompting.

Prior models

are still working it seems, some new samples

`fable`

`shimmer`

Prompted tune-ups to style

I note, you can try all you want, but almost all attempts to get textual chat completions calls to answer in a voice beyond the first message are failures, with 'audio_tokens': 0, and some with no voice from the start, despite it being a promised modality.

Prompt as clear as can be:

You are Shimmer, a GPT-4 large language model trained by OpenAI.

Knowledge cutoff: 2023-04
Current date: 2024-10-31

Image input capabilities: Enabled
Voice output capability: Enabled

# Responses

## voice

Important: You have multimodal voice capability, and you use voice exclusively to respond.

- Remember: text -> text -> text ... = assistant voice audio response always!
- Earlier assistant chat is a transcript of what was spoken aloud.

OpenAI also blocks condensing speech with prerecordings of assistant to activate or maintain voice. In a chat, you are disallowed to 0-shot the model to speak differently – or to speak at all. The only thing they offer is an expiring “chat id” turn you have to send back to replay the same voice OpenAI stored. Yet no true statefulness of total input turns. You will not be able to return to a voice chat.

mitchell_d00 · October 31, 2024, 8:21pm

TY for this @_j
I love posts that are super helpful and informative. VM is such a hot topic it is hard to keep track of the loops in forum no one dates anything much cause time stamps. but for news posts it is handy.

Topic		Replies	Views
Did OpenAI just make a new AI Voice? API	7	2844	May 16, 2024
Audio support in the Chat Completions API Announcements	13	3993	December 12, 2024
How to make GPT (Voice) allow user more time to talk before replying GPT builders gpt-4 , api	24	3577	November 26, 2024
Gpt-4o-audio-preview responds in text, not audio Bugs audio , gpt-4o	6	935	January 25, 2025
New models appear on the API API api	12	1303	November 16, 2023

Oct 31: New Voices on Chat Completions, to replace others - up

Was:

Supported voices are alloy, echo, fable, onyx, nova, and shimmer.

Now:

Supported voices are alloy, ash, ballad, coral, echo, sage

Now Operational (and others continue?)

Voice samples

alloy

ash (new)

ballad (new)

coral (new)

echo

sage (new)