Asking GPT to “not do x” is generally not that effective for multiple reasons It’s hard to help you here without knowing what specifically what you’re asking it to do, but you may find the following thread useful:
There’s an entirely different moderation API endpoint that handles content filtering, it stops disallowed content from appearing in output, if what you’re trying to do is impossible or not allowed, it will respond with an apology.