hello there, i’m having a weird issue!
I’ve been trying to make a prototype service which uses mediarecorder to record voice on the browser, then uses the python openai client to process that audio with whisper and transcribe it.
the weird part is that the mp4 file generated works perfectly when using a chrome variant browser, while safari (both on mobile and desktop) is unable to be properly processed.
IOS/SAFARI:
Input #0, mov,mp4,m4a,3gp,3g2,mj2, from ‘1_recorded_audio_1701849225592.mp4’:
Metadata:
major_brand : iso5
minor_version : 1
compatible_brands: isomiso5hlsf
creation_time : 2023-12-06T07:53:40.000000Z
Duration: 00:00:04.95, start: 0.000000, bitrate: 188 kb/s
Stream #0:00x1: Audio: aac (LC) (mp4a / 0x6134706D), 44100 Hz, mono, fltp, 185 kb/s (default)
Metadata:
creation_time : 2023-12-06T07:53:40.000000Z
handler_name : Core Media Audio
vendor_id : [0][0][0][0]
CHROME
Input #0, matroska,webm, from ‘1_recorded_audio_1701849299527.mp4’:
Metadata:
encoder : Opera
Duration: N/A, start: 0.000000, bitrate: N/A
Stream #0:0(eng): Audio: opus, 48000 Hz, mono, fltp (default)
oh… maybe i got it.
while i was writing the post, i noticed the ffprobe output and it seems that the mediarecorder js library on ios saves the files using the codec AAC
i think this is the main issue, i’d leave the discussion for the community is somebody has the same problem.
Advices are welcomed!