How to avoid being blocked when trying to filter potentially harmful content?

kruemelmonster77 · March 18, 2025, 9:47am

Hi all,

we need to process user comments for content moderation.

Unfortunately the moderation API is not sufficient as it lacks certain criterias we use to mark “negative” content.

Therefore we use the completions API with a system prompt like “Please provide a score between 1 and 10 to the following text and rate down if following criterias are matched [criterias follow].”

Now we received a warning from the OpenAI team because we are submitting harmful content.

The problem is: we need to submit harmful content as we want to filter it out.

Are there any possibilities to avoid being blocked by OpenAI?

Regards

Topic		Replies	Views
User Content Review and Analysis API gpt-4	4	608	February 7, 2024
Tips for "filtering" content submitted by user message Community	3	2498	April 2, 2023
Chat Input without moderation and risks for the account API	0	562	May 7, 2023
Spam checking illegal content? API	8	311	October 3, 2024
Prevent illegal activities? API chatgpt , api	5	974	December 20, 2023

How to avoid being blocked when trying to filter potentially harmful content?

Related topics