Real Time API Voices Are Worse Than The Voice on ChatGPT

I really don’t understand this. But why real-time voices on ChatGPT and API are totally different? I’m not talking about characters. Their talking abilities are completely different. The voice on the APi does not sound like a real person’s voice. It can’t arrange its tone for questions or similar. The difference is like the difference between GPT 3 and GPT 4. it’s huge!

I don’t understand why they don’t just release the same thing that we use on the Chat GPT App.