When I use 40-mini TTS, I’m consistently getting errors/hallucinations in the output.
I using relatively large text inputs to create an audio output (for a story), and the result is pretty clean for maybe the first 70%, but I’ll then get weird long gaps in the audio, or in some cases short gaps where it then fills in with hallucinated audio.
Has anybody else seen this issue with TTS? Any tips on eliminating these artifacts, my outputs are unusable because of it.