Ballad voice inconsistent tone/personality changes in Realtime API (mini) — anyone else?

We’re running a voicebot product built on top of the OpenAI Realtime API (currently using gpt-realtime-mini). We have noticed that the Ballad voice is changing tone mid-conversation or between calls — shifting from its expected calm/warm style to something more neutral or flat, without any change on our end.

What we’re seeing:

  • Same system prompt, same session config

  • Ballad sounds noticeably different across calls — sometimes warm/expressive, sometimes flat

  • Other voices (e.g. Alloy, Echo) seem more consistent in the same setup

Setup:

  • Realtime API via SIP integration

  • Sessions initiated programmatically with identical parameters

  • No changes to voice, instructions, or turn_detection config between sessions

Questions:

  1. Has anyone else noticed tone/personality drift in the Ballad voice specifically?

  2. Is there any known variability in voice characteristics tied to model updates or infrastructure changes on OpenAI’s side?

  3. Is there a way to “anchor” voice personality more reliably through the system prompt?

We’re considering removing Ballad from our available voice options for this use case, but wanted to check if this is a known issue or something with a workaround before doing so.

Any input appreciated.

1 Like