Mental health training data warning -failed finetuning

Syma · November 22, 2023, 6:43am

hi all,
I am a reserch student and trying to train/fine tune babbage-002 with 800 datapoints. I got following warning after failed fine tune status:
"This file failed moderation safety checks. The OpenAI Moderation API identifies fine tuning examples that violate our content policies. " My question is :
if my data has original user posts (prompt)containing such language which is helpful in identifying mental health root cause(completions) classification job. Then how to pass this stage?

Syma · November 23, 2023, 12:11am

ANY advise please. I am really stuck at this stage.

b0zal · November 23, 2023, 12:14am

there is no advise unless you contact support, for permission because it got flagged by OpenAI Moderation safety checks

_j · November 23, 2023, 12:26am

What you can do is pull out the examples and send them to the moderations endpoint yourself, and record the scores and flagging.

You may discover with analysis both the type of inputs or desired responses that are triggering, and which others might contribute to a total “reject” score.

Syma · November 28, 2023, 2:09am

thnak you , will do so… let see if i can get arround as most of my content has such converstaion, which are people real life experience contributing to their mental health.

Topic		Replies	Views
Fine-tuning GPT-3.5 on hateful content API gpt-35-turbo , fine-tuning , fine-tuning-problems	1	1581	March 27, 2024
This fine-tuned model was blocked due to its tendency to produce outputs that violate OpenAI's usage policies API	1	162	April 2, 2025
Moderation endpoint not sufficient to avoid blocked training files API fine-tuning , moderation	6	586	October 4, 2024
User Content Review and Analysis API gpt-4	4	637	February 7, 2024
Need Help: Facing OpenAI Usage Violation Due to user's Abuse API moderation , best-practices , gpt-4o-mini	11	1029	November 17, 2024

Mental health training data warning -failed finetuning

Related topics