Gpt-4-o-audio-preview -> audio as input and text as output

cannot use any general library for converting the audio to the api compatible format.

How to get the wav_bytes in the same format as wav_data which is what open ai’s api is compatible with