WhisperAI API Not Recognizing Valid File Formats

minseochh02 · April 7, 2023, 6:16am

So I am creating a simple website that takes in phone recording and transcribes them with WhisperAI API. I ran into this peculiar trouble where m4a audio files of phone recordings are not recognized as m4a by WhisperAI
error 400 “Invalid file format. Supported formats: [‘m4a’, ‘mp3’, ‘webm’, ‘mp4’, ‘mpga’, ‘wav’, ‘mpeg’]”

Stranger yet, if I convert the file to mp4 (or any other files) and convert it back to m4a, it then recognizes it as a valid file.

Here are some of my theories:

The website uses Nextjs framework and axios, which causes audio file to be corrupted when sending the file directly from front end
counter: but why did it work after converting it back to m4a? Or why didn’t my function for filtering out invalid file formats not work?
‘’’
const { getRootProps, getInputProps } = useDropzone({
accept: ‘audio/mpeg, audio/mp3, audio/mp4, audio/m4a, audio/wav, audio/webm’,
onDrop: (acceptedFiles) => {
if (acceptedFiles.length > 0) {
const acceptedTypes = [
‘audio/mpeg’,
‘audio/mp3’,
‘audio/mp4’,
‘audio/m4a’,
‘audio/x-m4a’,
‘audio/wav’,
‘audio/webm’,
];
const file = acceptedFiles[0];
if (acceptedTypes.includes(file.type)) {
try {
setUploading(true);
setAudioFile(file);
setError(null);
setSearchTerm(‘’);
// send the file to the API
} catch (error) {
console.error(‘Error setting audio file:’, error);
setError(‘Error setting audio file: ’ + error.message);
}
} else {
console.error(‘Invalid file type:’, file.type);
setError(‘Invalid file type: ’ + file.type);
}
} else {
console.error(‘No file selected’);
setError(‘No file selected’);
setSearchTerm(’’);
}
},
});
‘’’
the recording done by the call-recording application (Korean application called 후후 통화녹음) is problematic. Which I’m not sure how I could check it out since I have zero knowledge on how the audio files are supposed to work.
.
.

I’m not sure how to proceed from here so I would really appreciate any sort of feedbacks. Thanks in advance!

supershaneski · April 14, 2023, 7:10am

I am encountering the same problem. Curiously, this actually works at least a month ago. But trying it again today somehow it no longer works as expected. Which means OpenAI updated something. I also did what you tried (converting to mp3 then back to m4a). I noticed that in the converted m4a output, duration is now displayed correctly when played in audio player. But when directly saved from Web Audio API, this is missing. So perhaps this is related to moov atom issue in m4a audio files. I am still trying to find ways to solve this.

minseochh02 · April 17, 2023, 2:12am

Hm. I will also have to look into that too. But based on your response, at least now I know its something specifically related to m4a and openai. Thank you for sharing! Let me know if you have any lead and I’ll keep you updated on my side.

rdanielo · September 25, 2023, 12:48pm

In my case, I’m generating the audio from a base64 encoded string, and manually setting the filetype to be ogg, but the API responds back that it does not understand the audio format:

  const buffer = Buffer.from(media.data, 'base64')
  const audio = await toFile(buffer, media.filename, { type: 'ogg' })

Is that the expected mime type?

supershaneski · September 26, 2023, 11:52pm

As far as I know, ogg is not supported.

following input file types are supported: mp3 , mp4 , mpeg , mpga , m4a , wav , and webm .

Topic		Replies	Views
Whisper API- Issue with call recording m4a File API whisper	1	2930	October 3, 2024
Wisper API not recognizing .m4a file format API	5	10304	July 24, 2023
Whisper doesn't work with mp4 API whisper	3	2859	May 29, 2024
Unrecognized File Format" Error with MemoryStream When Using OpenAI's Whisper API in C# Bugs whisper	2	839	May 7, 2024
Has the Whisper Error Been Solved? API whisper , error	5	9071	January 12, 2024

WhisperAI API Not Recognizing Valid File Formats

Related topics