At the moment, I am interested in information on the following questions:
-
Updating the API whisper on V3?
The current version of V2 is not as accurate and contains hallucinations. -
Expanding the capabilities for speech control in the API TTS? (pauses, emotions, etc.)
-
Voice cloning