Can ChatGpt-4-o Function evaluate a Audio

lokeshchoudhary · June 1, 2024, 12:34pm

can anyone help me build this function I want to evaluate a class of student audio. so I can give marks based on the gpt evaluation.

anon22939549 · June 1, 2024, 3:12pm

Not yet possible, likely before the end of the year.

anon22939549 · June 1, 2024, 9:48pm

This is not currently true. OpenAI has not released the audio input capabilities of gpt-4o.

anon22939549 · June 2, 2024, 12:13am

You do realize what you are describing is not what we are discussing, yes?

Whisper is a speech-to-text model, so the output of that is going to be… <drumroll> text.

The model isn’t evaluating the audio.

Evaluating the audio would include things like,

All whisper does is make a best guess at the words which might be present in an audio file.

Perhaps this class is, for instance, an ESL class. Being able to tell if the speaker is correctly pronouncing things correctly would be important.

Or maybe it’s a drama class and we need to evaluate if the student’s vocal performance is compelling?

So, again, there is no currently released model from OpenAI which can evaluate audio.

msveshnikov · June 2, 2024, 9:49am

Only Gemini Pro 1.5 currently can evaluate audio

Topic		Replies	Views
Can GPT-4o analyze audio like it does with pictures? Community gpt-4	2	1640	July 30, 2024
API referances . gpt4.o Human like speak API gpt-4	1	486	July 19, 2024
Gpt-4o or whisper for kids speech Community whisper , audio	4	1254	July 12, 2024
Enabling Audio Access for GPT-4o via API API gpt-4	0	394	September 5, 2024
When will API support image/audio as input and output? API gpt-4 , chatgpt , api	1	1765	October 9, 2023