I’m getting this usage block from the WebRTC realtime API and I want to calculate total cost based on it: { "total_tokens": 821, "input_tokens": 789, "output_tokens": 32, "input_token_details": { "text_tokens": 313, "audio_tokens": 476, "cached_tokens": 640, "cached_tokens_d…

Help me understand the realtime usage block

sps December 18, 2024, 4:40pm 5

Text input that hits the cache costs 50% less. Audio input that hits the cache costs 80% less.

Here is the announcement regarding prompt caching on the Realtime API:

1 Like

Topic		Replies	Views
New Realtime API voices and cache pricing Announcements realtime , prompt-caching	26	6977	November 27, 2024
Cached input audio_tokens is always 0 API audio , realtime	3	284	November 8, 2024
Realtime API pricing is wrong, will overcharge API realtime	36	3228	January 15, 2025
Confusion Between Per-Minute Audio Pricing vs. Token-Based Audio Pricing API realtime	3	2647	December 30, 2024
WebRTC gpt-4o-audio cost per minute of conversation? API gpt-4o-audio-preview	2	546	March 11, 2025