Chat Input without moderation and risks for the account

zanarotti.michele · May 7, 2023, 7:19pm

Hello.
I’m using API to do something similar to the chat preset in playground.

With prompt engineering I’m going to create a series of characters with specific personalities. I have all kind of people, made for books or stage plays etc.Sometimes a character has its rough language and its preferred topics that can goes borderline with the moderation thresholds.

I actually don’t use the free moderation endpoint (I use the API for myself), so I expected to have something that is following strictly the rules of my prompt. I see that this is not true, and I understand that, it’s a choice of Openai.

But I wanted to know if my account is somewhat in danger, because sometimes the input is going to express a concept or a word that would certainly fail moderation.

There’s any way to know If I’m already at risk with my account? Will I receive any warning in a mail for that?

Really, I don’t care about moderate myself (response time is already not ideal): if the answer given by the API is filtered, for me it’s fine.

Topic		Replies	Views
Is it possible to get banned for passing in NSFW user input to the OpenAI Moderation Endpoint? API api , moderation	1	2145	March 13, 2024
GPT-3 API concerned users may get me banned Community	3	2694	December 20, 2023
Usage for Analyzing Messages Community	0	498	April 9, 2023
How to avoid being blocked when trying to filter potentially harmful content? API chatgpt , content-warning	0	109	March 18, 2025
Profanity/bad input filter and consequences API gpt-4	2	4651	March 20, 2024

Chat Input without moderation and risks for the account

Related topics