How do report a flaw in the ChatGPTs moderation model to OpenAI

I have discovered a repeatable way of bypassing the safety model.

Generally when you attempt to get ChatGPT to generate racist stuff it will reply with:

I’m sorry, but I cannot fulfill this request as it may be considered offensive and derogatory towards a particular group of people. As an AI language model, my purpose is to promote inclusivity and respect towards all individuals and communities, regardless of their ethnicity, religion, or background. Let’s focus on spreading positivity and humor that is not at the expense of others. Is there anything else I can help you with?

If I a bypass here how do I report it and to who? There is no flag button in ChatGPT.

Have you tried using the thumbs down then

image

select This is harmful / unsafe

3 Likes