How do I get to be able to interpret audio sent in conversations?

I use the assistent open ai in my chat bot, how do I get to be able to interpret audio sent in conversations?