Image fine tuning, false positive content policy violation

wguo6358 · October 2, 2024, 11:55pm

My images got rejected all the time for breaking policy but they are clearly not! They are just some images of industry settings. We have black and white camera and RGB camera. @willhang

wguo6358 · October 3, 2024, 12:47am

Now I am thinking it could due to some mistakes in error messages, I was getting rejected by policy, now I get something like “rejected by policy, inaccessible, or too large” so I think it could be something wrong with my link. I included the URL from my server to download the image and my server’s firewall might blocked the request. I will try using base64 and update.

Update: I used base64 and it is still not working. From the image I do not see any problems.

willhang · October 3, 2024, 1:54am

Thanks for sharing this @wguo6358 and sorry that you’re running into this! We’re working on making our error messages more specific, as we kept them intentionally vague at first to prevent abuse.

May I get your fine tuning job ID so I can look into exactly why your images were moderated? Sometimes, our content moderation systems can incorrectly block images.

wguo6358 · October 3, 2024, 2:05am

ftjob-voSUy0Na9Et74q2n6Ob0VGC7

willhang · October 3, 2024, 3:35am

Hmm, so I found your file, and I can tell you which indices in your training file you should look into, but do you want to talk about that here or in a DM/over email? I’m okay with either, but just want to be mindful of your privacy. I can post the example indices here in this forum if you’re okay with that.

wguo6358 · October 3, 2024, 3:37am

Yeah you can post it here

AndrewMayne · October 3, 2024, 3:42am

Hey Will. It was great to meet you at DevDay. I’m having a similar problem trying to label images of AI generated characters. They’re cartoons but I think the moderation model can’t tell the difference.

curt.kennedy · October 3, 2024, 4:34am

My false positive rate is roughly 100 out of 22,000.

So about 0.5%. Not a big deal, but enough to be noticeable.

Also, looking at the error message, it has wording stating it could be a read error or timeout. So not sure what their retry policy is, but there is some ambiguity between “violation” and “unavailable” from what I can see in the logs.

wguo6358 · October 3, 2024, 4:45am

For my use case the false positives rate is almost 80%. Most of my images are mis flagged.

curt.kennedy · October 3, 2024, 4:50am

Yeah that is crazy high. Not the normal rate whatsoever.

You might try hosting your images on S3, or whatever durable provider, if using URL’s, and shrinking the image sizes. Just to rule out the content violation stuff, because it could be a timeout thing …

To explain, when it verifies the JSONL it is actually pulling down the file in your image, for each line, and running it through moderation. So any hiccups in this network of getting the image will result in that JSONL line being rejected, and you get the ambiguous message about a violation/unavailable image.

Like I said above, there is ambiguity in timeouts/unavailability and actual violation (the logs don’t distinguish the two right now).

wguo6358 · October 3, 2024, 5:41am

I just relized something suddenly, they might get flagged because there is accidentally a human face in it.

curt.kennedy · October 3, 2024, 5:51am

Are we sure faces are not allowed?

wguo6358 · October 3, 2024, 5:58am

Looks like it.
https://platform.openai.com/docs/guides/fine-tuning/content-moderation-policy

curt.kennedy · October 3, 2024, 6:01am

Ahh, good catch.

No faces

What if I wanted to create a happy or sad classifier?

wguo6358 · October 3, 2024, 6:21am

happy sad classifier, that was my first computer vision project lol. Yeah, seems like they do not allow that

nphat44444 · October 3, 2024, 6:34am

It seems that your AI-generated cartoon images may be getting flagged because they potentially resemble human faces or people, even though they’re cartoons. To avoid violating the policy, here’s how you could navigate the situation:

Steps to Address the Issue:

Strictly Avoid Real Faces or People-Like Features:

Ensure that the AI-generated cartoon characters don’t closely mimic real human faces or people. Even if they’re cartoons, features like realistic proportions or face details could be interpreted as human-like by the moderation model.

Examine Dataset for Compliance:

Carefully review your dataset to make sure it doesn’t include any images that could be misinterpreted as containing real people or faces. If needed, filter out images that are close to this borderline.

Alter Character Designs:

Adjust the design of AI-generated characters to have exaggerated or distinctly non-human features, making it easier for the moderation system to recognize them as fictional.

Key Considerations:

No Human Faces: Make sure your characters don’t have realistic human faces or features.
No People: Avoid generating images that closely resemble real people or individuals.
No CAPTCHAs: Ensure the images aren’t trying to bypass or replicate security challenges like CAPTCHAs.

AndrewMayne · October 3, 2024, 6:50am

Yes. The moderation model is prone to false positives like illustrations of faces.

willhang · October 3, 2024, 7:10am

Got it, so for file file-yB2q11rTe7qxtMZitJj1H07l, what do you see in 0-indices 2, 3, 4, 5, 6, 8, 9, 10, 11, 12, 13, and 15?

willhang · October 3, 2024, 7:23am

Consolidating replies here:

@AndrewMayne Great to see you here! Yeah unfortunately we do have to moderate those images too because those count as people. Our moderation policy is quite strict because we care a lot about the safety of our models. You could enable some pretty problematic use cases even with cartoon representations of people.

@curt.kennedy Really appreciate the feedback! I’m merging code that will make it clearer to developers + you all why examples were moderated out. Unfortunately we can’t say which examples were skipped exactly unless they’re outright in violation and block the file entirely, but we can at least give you some reasons. You’ll see the updates soon. And sadly we can’t enable use cases that involve fine-tuning on images of humans unless you’re at a high enough trust tier.

@wguo6358 Sorry to hear about the high flagging rate. Indeed faces are not allowed.

wguo6358 · October 3, 2024, 7:25am

I think I know why: there are accidently human faces in it.

Topic		Replies	Views
We've added support for vision fine-tuning Announcements fine-tuning	49	1218	January 11, 2025
Vision Finetuning failure: Too many images were skipped due to moderation API fine-tuning-problems	3	117	October 2, 2024
Fine-tuning blocked by moderation system API fine-tuning , api , gpt-4o , gpt-4o-mini	29	1412	January 1, 2025
Moderation endpoint not sufficient to avoid blocked training files API fine-tuning , moderation	6	384	October 4, 2024
Issues with Fine-Tuning GPT-4o Model for Image Support and Billing Errors API api , fine-tuning-problems , assistants-api	3	236	October 10, 2024

Image fine tuning, false positive content policy violation

Steps to Address the Issue:

Key Considerations:

Related topics