Internal Error during fine-tuning

Lokum · January 26, 2026, 11:20am

Hi, while fine-tuning a dataset with “gpt-4.1-nano-2025-04-14” I encounter an internal error . The files are validated without a problem but then the failure occurs. I wonder what might be the reason. My training file has only 100 and the validation file 50 lines of data. I tried batch size=1 first, then 4 but both have failed.

VeitB · January 26, 2026, 1:59pm

Hi and welcome to the Community!

I was able to reproduce this issue.
My fine-tuning job with gpt-4.1-nano-2025-04-14 ended just as just described:
The job failed due to an internal error. after several attemtps.
Job-ID: ftjob-vVd67BHKa8hb4TQlM2nO61P4

Flagging this to the team.

codesandtechs · January 25, 2026, 12:32pm

Can someone help me out here? I am getting the same errors - “The job experienced an error while training and failed, it has been re-enqueued for retry.“ and sometime first an “Internal error”. I tried various acceptable jsonl formats of model training and sizes but no effect. I tried at least a dozen times. I also saw once that the OpenAI platform is currently upgrading and this cause the error. Please help I am working on a mission critical project.

waseem · January 26, 2026, 1:19pm

I am getting same error……. still no fix

waseem · January 26, 2026, 11:48am

Hi everyone,

I started a supervised fine-tuning job using gpt-4o-mini-2024-07-18 with a very small JSONL dataset (only a handful of training examples).

*Can someone explain why a fine-tuning job with a very small dataset is taking this long to complete?

Thank you*

rob19 · January 26, 2026, 3:32pm

I guess I’m glad I’m not the only one!

I have been experiencing failed fine-tuning jobs since Thursday with a process that I have used successfully hundreds of times in the past. The base model is gpt-4o-mini-2024-07-18. In most cases, the job retries multiple times over the course of several hours and then finally fails with “The job failed due to an internal error.”

Over the past several days, I’ve had 13 fine-tuning jobs fail and 3 succeed.

I’ve raised an issue with OpenAI Support and have been told, “We will continue taking ownership of the underlying issue and ensure you have clear paths to keep moving forward.” but no helpful guidance or improvement.

a3jeu · January 26, 2026, 6:58pm

I have this same error

cflucas97 · January 26, 2026, 9:01pm

+1 since last week timeline as well for finetuning gpt-4o-mini-2024-07-18 model.

deadbeef · January 27, 2026, 1:18pm

What’s going on?
The status page indicates everything is peachy, but I’m still not able to fine-tune.
For reference, I am fine-tuning with base model gpt-4o-mini-2024-07-18.

Yesterday, I kept getting errors and re-queueing of jobs.
Today, my job appears to be stuck at “Files validated, moving job to queued state”.

edit: It finally started after more than 30 minutes, and proceeded to err/time out after another 30 minutes and get re-enqueued again. 3 hours later, the job ultimately failed.

deadbeef · January 28, 2026, 8:27am

This morning, fine-tuning seems to work again.
Thank you, whoever fixed the whatever!

Topic		Replies	Views
"The job experienced an error while training and failed, it has been re-enqueued for retry." API fine-tuning-problems	5	439	January 20, 2025
Fine tuning fail on gpt-4o-mini-2024-07-18 API fine-tuning , fine-tuning-problems	12	879	March 25, 2025
Chatgpt 4o-mini fine-tuning fails.Internal error API chatgpt	7	704	January 2, 2025
API fine tune constantly gives me "We're having trouble accessing your files right now. Please try again later." Bugs fine-tuning-problems , files-api	16	469	March 24, 2026
The Job Failed Due to an Internal Error \| Fine-tuning gpt4o-mini API fine-tuning	13	1150	January 2, 2025

Internal Error during fine-tuning

Related topics