Yes, it has been noted and reported. The only workaround is to be persistent and error tolerant with retries, until the issue gets so bad that the fine tuning model you paid to train is essentially useless.
It’s barely been a week of API fine-tuning gpt-4o models failing, with multiple reports and immediate replication ability. The last time the exact symptom yet complete outage happened on another class of AI models a month ago, it took two weeks.
Here is the current ongoing issue that you can join in the party on:
Apparently randomly breaking API developer applications over and over again without support response is going to be the new modus operandi.