Having a similar issue with Safari on Mac 12.6.3. Audio from Chrome can be submitted without issue, as long as it is saved first. If I transmit the the blob directly via my Flask app, I get the Invalid file format regardless of whether I use Chrome or Safari. Taking my app to Windows to see if the issue persists.
Same here,
I want to test Wishper from a databricks notebook, but doesnât recognize the audio files.
Hopefully we get a fix because I spent quite some time testing different things none of which worked.
Tried to change the value of the âmajor brandâ bytes in beginning of âftypâ box, from [105,115,111,54] to [105,115,111,109] as [105,115,111,109] is also listed in the compatible brand. This is pass the Whisper type check.
As a result, now whisper ai sometimes can return good result, sometimes it return empty string.
Safari doesnât actually generate wav files. It uses an mp4 encoding. Either way, openAIâs servers donât like it.
Your options are mostly to re-encode on the client-side with polyfill or on the server-side with something like pydub. There are also other transcription options out there that donât have this specific problem.
I have a very strange issue, up to yesterday OpenAI Whisper API was accepting files recorded from the voice memo on iOS without any issues. Today theyâre not working at all and I keep getting file format error and that my file format is not supported.
Could this be something they changed from their side, cuz I didnât change anything in my code
Hi all! We did indeed make a change yesterday to try to support more files as described in this thread. Please let me know if that has helped with things!
@omarsultan - sorry to hear about this. Iâve tried out some voice memo files on my end and they all seem to be working - can you give any more details so I can repro? Maybe iOS version number, file extension, etc?
It turns out that what was causing the crash was some preprocessing I was doing to the file because it initially didnât work before. But not it just works out of the box with Audio Files recorded from the Voice Memos App on iOS 16
I still get this error from you uploading an MP3 - can you advise?
openai.error.InvalidRequestErrorâ>: Invalid file format. Supported formats: [âm4aâ, âmp3â, âwebmâ, âmp4â, âmpgaâ, âwavâ, âmpegâ
Iâm the record plugin on a flutter App (record | Flutter Package) to capture audio. The generated audio files where working with the Whisper API until recently. I thought the problem was in how I was sending the request (eg. mime types), but I can now verify with two audio files (Android and iOS) that if I send them with the provided CURL call, the Android generated file goes through, but the iOS generated file says itâs an invalid format (both files are audio/mp4). Is there any way that I can provide the files for testing and improving the API, or do you recommend I transcode the iOS files (as others have done). Whatâs the best way to approach this?
I figured it out⌠Just changed the file ending on iOS to the same as voice notes on iOS (m4a) rather than mp4. Even though the file is of type âaudio/mp4â. On Android it works with .mp4 file ending. Iâm using the record flutter package.