Input audio format document?

Hi,

I noticed gpt-4o audio is released. I looked through the get start here. I wonder if is there any other way to pass audio to the model. Can someone refer me to the right place to look at?