AMA on the 17th of December with OpenAI's API Team: Post Your Questions Here

Nothing to share yet on V3 Whisper in the API. But for both audio understanding and TTS, do check out the new GPT-4o mini audio preview model. It’s got state of the art speech understanding and you can prompt the model directly to control how it hears and speaks! For example, give it a prompt like "Say the following in a somber tone, and make sure to pause your speech appropriately: "

6 Likes