What is 1M output tokens for audio in the Realtime Pricing?

tejassharma08 · January 6, 2025, 6:28pm

The TTS models and other ones calculate it at per minute of audio, but I don’t understand what “1M output tokens” means for audio in the realtime API pricing.

For text, of course it’s obvious, but for audio, is it saying each word that is generated like the text that it converts to audio counts as a token?

tejassharma08 · April 5, 2025, 9:08pm

In case anyone was wondering, support said it was very variant and the best way to find out is just by testing and exerperimenting

Topic		Replies	Views
Confusion Between Per-Minute Audio Pricing vs. Token-Based Audio Pricing API realtime	3	8976	December 30, 2024
Openai gpt-4o-mini-tts price API	1	285	June 4, 2025
WebRTC gpt-4o-audio cost per minute of conversation? API gpt-4o-audio-preview	2	1535	March 11, 2025
Realtime API pricing questions: text input and audio tokens API realtime	7	546	December 6, 2025
How do I calculate the usage cost when using the GPT-4o-mini-TTS model? API assistants-api	5	1225	May 18, 2025

What is 1M output tokens for audio in the Realtime Pricing?

Related topics