Chatgpt 4o-mini fine-tuning fails.Internal error

e.kartal115 · December 18, 2024, 8:27pm

Hello,
I am encountering the following error:
The job experienced an error while training and failed; it has been re-enqueued for retry.

After this error, the fine-tuning process completely fails.

I am trying to fine-tune the model with German, French, and Turkish data.

I have already trained the AI with English data, and now I am training it with only 10% of the data in each of the languages mentioned above. My goal is to ensure the model works well with these languages.
Initially, I created a single file containing data in all three languages and started fine-tuning. It failed.
Then, I broke it down into three separate files and started simultaneous fine-tuning jobs.
- The fine-tuning for French and German succeeded, but Turkish failed.
I examined the Turkish data, but everything seems fine.

Because both French and German fine-tuning succeeded, I thought I could fine-tune the model trained with French data using the German data, or vice versa.

However, this attempt failed again (I tried both vice versa scenarios).

This behavior is quite puzzling, and I am unsure of what steps I should take next.

davidjosephind · December 19, 2024, 3:56am

Fine-Tune jobs are failing a lot for me as well. I think it is an issue on OpenAI’s end.

This was not an issue I have faced before, but now most fine-tuning jobs are failing for me as well with the same error message.

sinmu8191 · December 19, 2024, 7:05am

I have the same problem, I can only try once every 2-3 hours

matthew.walz · December 20, 2024, 1:52pm

Same thing has been happening to me now for GPT-4o. Nearly every fine-tuning job I run is failing. The strange thing is I can fine-tune the exact same file with GPT-3.5-turbo and it works fine. This tells me something is going on with OpenAI.

dwmann5 · December 22, 2024, 4:36pm

I am having the same issue with gpt4o-mini and a .jsonl file I have used before. Is it possible that the .jsonl format has changed since the fine tuning process now has two options (supervised and direct preference optimization)? I assume that the older format is supervised which is the default. The file I submitted passed the validation stage and the training began with metrics showing, but then it failed with the internal error a few minutes later.

dwmann5 · December 22, 2024, 4:50pm

Update - my gpt-3.5-turbo job is failing now too.

Update - the gpt4o-mini job finally went thru (!) after many retries and hours. Obviously an internal OpenAI issue.

aliatoui119 · January 1, 2025, 2:34pm

I’m experiencing the same issue and have tried multiple times since yesterday. Has anyone been able to resolve it or identify the source of the problem?

vb · January 2, 2025, 10:48pm

A fix has been deployed.
You can try to run your jobs again and please report in the topic below, if you should run into further issues.

vb · January 4, 2025, 10:48pm

This topic was automatically closed 2 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
The Job Failed Due to an Internal Error \| Fine-tuning gpt4o-mini API fine-tuning	14	594	January 4, 2025
Fine tuning fail on gpt-4o-mini-2024-07-18 API fine-tuning , fine-tuning-problems	12	445	March 25, 2025
Fine Tuning, job failed due to an internal error API fine-tuning-problems	3	767	January 20, 2025
"The job experienced an error while training and failed, it has been re-enqueued for retry." API fine-tuning-problems	5	93	January 20, 2025
Gpt-4o vision fine tuning jobs failing Bugs	9	127	March 25, 2025

Chatgpt 4o-mini fine-tuning fails.Internal error

Related topics