Fine-tuning server errors/restarts

I am trying to create a fine-tune model, and keep running into server errors, or the job restarting without any error in the events.

The server error is:
“Server error. Returning to queue for retry”

Is there any way for me to find more information on this error? It appears to be on OpenAI’s end as on one occasion I received the error, the job re-enqueued and completed successfully.

An example instance of the event log:

This issue has been happening all day for me. Any help is appreciated.

2 Likes

+1 on this. Any help would be much appreciated.

+2 on this! How to know the API key works successfully?

Were you able to fine-tune successfully before?

Might try reaching out on their new help system…

https://help.openai.com/en/

Good luck! Please let us know how it turns out.

+1 . I am also facing this issue while fine tuning for Hindi input-output pairs.