I’m having the same issue today. It looks like this wasn’t fixed yet.
I wish they would expand the capabilities of whisper API. At this point it looks like I’ll have to run a whisper model on a remote instance or something.
Just a few extra parameters would help a lot.
Perhaps there’s a way to perform some VAD locally before saving/sending the audio recordings. I’ll have to look into that