This fine-tuned model was blocked due to its tendency to produce outputs that violate OpenAI's usage policies

tchakrabarty · January 17, 2025, 5:47am

Hi
I finetuned GPT4o on 1000 samples in a supervised setup and it did not cause any issue. Then I tried to do DPO on same 1000 preferred responses along with 1000 non preferred response.

All this data (preferred and non preferred) was AI generated. I am baffled because my data has no issues and during checking training and validation data file it wasn’t flagged before the finetuning started. But it fails at the end. I am really wondering how to get any help on this. Thanks so much

jammirat · April 2, 2025, 6:29am

I am running into same problem. Did you solve yours?

Topic		Replies	Views
Not ablel to fine tune model - Davinci API gpt-4	0	342	June 7, 2023
"The job experienced an error while training and failed, it has been re-enqueued for retry." API fine-tuning-problems	5	133	January 20, 2025
Fine tuning is almost impossible Bugs fine-tuning , fine-tuning-problems	0	231	December 16, 2024
Fine Tuning, job failed due to an internal error API fine-tuning-problems	3	797	January 20, 2025
Chatgpt 4o-mini fine-tuning fails.Internal error API chatgpt	8	473	January 4, 2025

This fine-tuned model was blocked due to its tendency to produce outputs that violate OpenAI's usage policies

Related topics