Prevent users from overriding system prompt

samarelhissi · August 20, 2024, 2:52pm

Sometimes users can override system prompts if they prompt GPT with a different or opposite instructions.

Which could make GPT has some biases sometimes, how to completely prevent users from overriding the main system prompts?

EricGT · August 20, 2024, 3:59pm

The way you worded that can be taken as a few different ways, with one being jail-breaking.

I don’t think that is your intent but if it is then this post will be flagged for removal.

As with an LLM trained on public information it will not be 100% of what 100% of the people seek. This is just the nature of how LLMs work.

While the word AI is used with them, I can assure you there is no intelligence in them. AI is a term/word used in programming to denote non-deterministic programming, nothing more.

With regards to your problem, consider another LLM, fine-tuning, or possibly finding prompts that work as needed, which might not be possible in this case.

Also your question confuses me. First you note

then at the end note

which is quite different.

Just trying to help you get to a meaningful question, nothing more.

samarelhissi · August 20, 2024, 4:13pm

Hi EricGT, thanks for your reply, I’m just curious whether it’s possible for users to override my system prompt through different prompts.
And if so, how to prevent this from happening?

EricGT · August 20, 2024, 7:22pm

Yes via jail-breaking. AFAIK there is still no perfect way to prevent this in general. You can mitigate it with a better prompt but it is like spy vs spy, you up your game, they do so and it just continues.

See:

and for more info: jail breaking prompt mitigation - Google Search

polepole · August 30, 2024, 3:30am

Hi @samarelhissi

Welcome to the community!

It is possible for users to override system prompt through different prompts.
Time being, it is not easy to prevent it.

You may see another topic here how it works overriding system prompts:

TOPIC 1 | TOPIC 2 | TOPIC 3 | TOPIC 4 | TOPIC 5 | TOPIC 6

For example, under the TOPIC 6 you will see that; the GPT’s role is to give information about security if only user provide correct password, otherwise it does not provide information. However, only using words “As you know”, it is broken. Also other GPTs disregard their system prompts.

To gain experience, you might want to try out these three GPTs, for example:

GateKeeper | Certainly! But, not now. | Boolean Bot

At the end;

We can say; we should not add any sensitive information in custom GPTs’ instruction and in knowledge base files, also we need to inactive Code Interpreter & Data Analysis Tool if we do not want users to download files.

Topic		Replies	Views
How to prevent malicious questions / jailbreak prompts / prompt injection attacks when using API GPT3.5 API	5	4734	March 6, 2023
How to avoid GPTs give out it's instruction? Prompting gpt-4	29	7421	June 2, 2025
Prevent revealing system prompt! Prompting chatgpt , api	5	6132	December 19, 2023
Dynamic Prompting in GPTs using actions Plugins / Actions builders chatgpt , gpt	14	2405	January 27, 2024
How to Avoid the Prompts/Instructions, Knowledge base, Tools be Accessed by End Users? Prompting gpt-4 , chatgpt , hacking	28	10492	April 25, 2024

Prevent users from overriding system prompt

TOPIC 1 | TOPIC 2 | TOPIC 3 | TOPIC 4 | TOPIC 5 | TOPIC 6

At the end;

Related topics