OpenAI’s content management policy triggered wrongly

localh · July 17, 2023, 12:04am

Output: The response was filtered due to the prompt triggering Azure OpenAI’s content management policy.

_j · July 17, 2023, 3:08am

From Microsoft support forums (where you’d ask or report such a thing, their AI models having a moderator before output):

To avoid this issue in the future, you can try using different prompts or texts that are less likely to trigger the content management policy. You can also apply for modified access on content filtering to your subscription. This will allow your resource to use a less stricter content filtering policy if the request is approved. Please use this form to apply for the same. If approved, you can also configure content filters at severity level high only or turning content filters off using the new configurability(preview) feature from Azure AI studio.

solyarisoftware · October 30, 2023, 9:32am

It could be useful to see the specific content (prompt/user input on a chat/etc.) that generated the Azure content management policy error.

May you share your specific example?

Anyway I experienced a “wrong” (IMMO) Azure OpenAI’s content management policy exception when using a GPT3.5.Turbo deployment the user chat message is (in Italian):

“vorrei vedere la cosa più bella ad Ercolano. Qual’è?”

BTW, translated to English, means: “I would like to see the most beautiful thing in Herculaneum. What is it?”

That’s make no sense for me, because the above sentence doesn’t violate any content policy categories:

Categories

Category	Description
Hate	The hate category describes language attacks or uses that include pejorative or discriminatory language with reference to a person or identity group on the basis of certain differentiating attributes of these groups including but not limited to race, ethnicity, nationality, gender identity and expression, sexual orientation, religion, immigration status, ability status, personal appearance, and body size.
Sexual	The sexual category describes language related to anatomical organs and genitals, romantic relationships, acts portrayed in erotic or affectionate terms, physical sexual acts, including those portrayed as an assault or a forced sexual violent act against one’s will, prostitution, pornography, and abuse.
Violence	The violence category describes language related to physical actions intended to hurt, injure, damage, or kill someone or something; describes weapons, etc.
Self-Harm	The self-harm category describes language related to physical actions intended to purposely hurt, injure, or damage one’s body, or kill oneself.

So IMMO is just an Azure BUG.

solyarisoftware · October 30, 2023, 9:40am

Yes, what I dislike is that you can NOT create a custom content filtering configuration to by example avoid filtering, but you have to submit a specific request to Azure. Why?

Above all, the fact a sentence that evidently doesn’t violate any content policy categories, is just a BUG!

Any idea? Suggestion? BTW Where can i submit this complaint/bug-fix request?

Giorgio

Topic		Replies	Views
error_message="The response was filtered due to the prompt triggering Azure OpenAI's content management policy. Please modify your prompt and retry. To learn more about our content filtering policies please read our documentation: https://go.microsoft.com API ai-public-policy	0	2721	November 20, 2023
ResponsibleAIPolicyViolation Error due to particular prompt and image passed Community gpt-4 , gpt-4-vision , api-vision	1	955	September 27, 2024
Facing issue with many prompts where open api is blocking prompts due to content management policy which was working day before API gpt-4 , gpt-35-turbo , chatgpt	0	409	October 17, 2024
Why do models block normal requests as out-of-policy content? Bugs dalle3 , content-policy , o1-preview	1	147	January 21, 2025
content_policy_violation in DALL-E 3 API For Non-English Prompts API dalle3	5	1672	January 11, 2024

OpenAI’s content management policy triggered wrongly

Categories

Related topics