simple-openai has been updated to support Audio on Chat Completions API.
You can take a look at the following demo code to see async speech-to-speech interactions with a model (audio in, audio out):