GPT-4o-transcribe and audio model ready to use via API?

Hi, has anyone started using speech to text model GPT-4o-transcribe using API yet?

I understand this is conversation but I only want to use for speech to text. Any suggestions on alternative approaches and Any best practices tips?
Thank you

Thank you @1uc4s_m4theus

The real reason of wanting to this is the increased accuracy and real time streaming. Both are low on whisper model.

Welcome to the community @saby

Yes gpt-4o-transcribe can be used directly over the API for transcriptions and it comes with much higher quality transcriptions than whisper-1.

It can be used for streaming transcriptions for both recorded audio and live-streaming audio.

4 Likes

Thanks for the reply! @sps.

Is up to you to evaluate…