Realtime model over sip: Advice for logging agent audio packets

Hey all,
First time posting.
I’ve set up my realtime agent and it works fine with sip. However, I have a requirement that the calls need to be recorded, or at least have some playback capability.
I thought the agent was going to be logging audio packets and subsequently i set up a redis streams message broker service to handle these. As it turns out, i can only seem to get the agent audio packets on interrupted agent utterances. I can get user ones just fine currently.

There are multiple options.. you can let the sip provider (twillio?) do the recording I suppose. As far as I remember they have two parallel channels so that should be possible.

Or you add a SBC/Media Proxy between OpenAI and the provider and do the recording over there.

Ahh, cheers for the response! I was hoping there was a way of transcoding audio packets so i get two birds with one stone with my log saving application. Its seeming like more of a headache than getting the media proxy up between them.

Hmm well, there is whisper and many other STT models.