Realtime API Details (Costs, Usage, etc.)

stevenic · October 4, 2024, 5:12am

Here’s some other costs tests around interruption…

I first created a new project and asked the model to count to 100. I then interrupted it by saying stop. I noticed that it had streamed 1 - 38 back to the client but it only had spoken 1 - 5 when I said stop.

Next I created a new project and asked the model to count to 100 again. It tried to first count by 10’s so I interrupted that with “no by 1’s”. This time I let it generate the full 100 numbers but I stopped it manually using the stop button.

My assumption is that output tokens is based on generated tokens and not necessarily spoken tokens. Since the model generates faster then it speaks you can expect to that you’re paying for more tokens then what have been generated when an interruption occurs. To verify that I did another test…

I asked the model to count to 100 again and as you can see it wanted to go by 10’s but I interrupted asking for by 1’s. This time I stopped it 24 in and then followed up by asking it how many it had counted to. It said 26 which would imply that the last delta of tokens didn’t get sent to the client when I interrupted it but it’s in the conversation history server side.

As one last test I asked the model to count to 100 again but this time I opened the log and watched the deltas stream in. The audio streams in alongside the text so when the text finishes chunking in the audio finishes shortly after. I stopped the playback while the model was on 47 but both the text and audio had long since finished streaming in. Another indication that playback rate is completely separate from audio generation rate:

Topic		Replies	Views
Help me understand the true cost of the RealTime API API api , realtime	2	1033	March 26, 2025
New Realtime API voices and cache pricing Announcements realtime , prompt-caching	26	8495	November 27, 2024
Confusion Between Per-Minute Audio Pricing vs. Token-Based Audio Pricing API realtime	3	4307	December 30, 2024
Lets break down the input/output token details together! API realtime	3	1114	October 6, 2024
Realtime API updates — WebRTC, cheaper prices, 4o-mini, and more Announcements	26	7461	December 29, 2024

Realtime API Details (Costs, Usage, etc.)

Related topics