I don't understand the pricing for the realtime API

It’s even more confusing when you consider that audio in is not advanced audio, but its a transcription by Whispers (so cheap). I think what we are really paying for is the fact that this conversation is cached to make it so fast.