GPT-4o Audio Access for API

https://platform.openai.com/docs/guides/text-to-speech

Are you looking for this? Or do you want your players to be able to talk in realtime? Then realtime api might be right. Which is pretty expensive though. So caching would be neccessary.