I’m trying to generate speech using gpt-4o-mini-tts with onyx voice and the following instructions:
Talk as a 38 year old Male with strong Boston accents.
Delivery: Natural conversational with strong Boston accents
Emotion: reserved
However, it doesn’t seem to work.
Anyone has a trick to make it sound like someone from north east US?
Thanks!
Accents can be tricky with TTS models. In my experience, breaking the voice instructions into very simple, direct traits and avoiding age or emotional overload sometimes helps. Also worth experimenting with phonetic hints or example phrases typical to the region.