Gpt-4o-mini-tts voice inconsistency between requests

evdodima · April 7, 2025, 9:44am

I use TTS models for generating multi speaker dialogs. Each replica is generated in different request using same “instructions” for each speaker. However voices are really inconsistent despite same instructions and the resulting audio sounds like there are many different speakers.

This makes this API unusable for my specific use case, so switching back to TTS-1

anon1374209 · April 8, 2025, 4:12am

have you tried improving the prompt?

Topic		Replies	Views
Is it possible to add a "seed" to the gpt-4o-mini-tts model? mismatch between requests Bugs tts	0	63	April 23, 2025
Gpt-4o-mini-tts produces unusable results Bugs tts	4	400	April 19, 2025
Huge problems with TTS API Bugs tts	4	1891	May 27, 2024
TTS is unpredictable and often really wrong for non-English requests API tts	7	994	January 15, 2025
Realtime API: Voice Pitch Change in mid conversation Bugs api , realtime	0	112	February 16, 2025

Gpt-4o-mini-tts voice inconsistency between requests

Related topics