How can you protect your GPT?

curt.kennedy · November 28, 2023, 6:41pm

Perfection has a price.

But with filtering each input, you can at least limit some of the attempts.

I won’t list all the techniques here, but one common one is sending base64 encoded text over. So this will bypass your keyword filtering, or even classifier (assuming plain text used in training), but will jailbreak your system. Because the LLM understands base64.

So you need a base64 detector … see how it explodes?

The attack surface area of LLM’s is M >A> S>S>I>V>E>, so go in with the attitude that a determined attacker will hack into your system.

So don’t hang your “patented golden prompts” out there, they will be stolen eventually.

But honestly, 100% security is only achievable by ruining the experience, but it can be done if you completely isolate the user from the LLM.

Topic		Replies	Views
Basic safeguard against instruction set leaks Prompting gpt-4 , chatgpt , bug , prompt-engineering , gpts	46	8416	March 4, 2024
There's No Way to Protect Custom GPT Instructions Community custom-gpt	54	12943	April 19, 2024
How to avoid GPTs give out it's instruction? Prompting gpt-4	29	7382	June 2, 2025
How to Avoid the Prompts/Instructions, Knowledge base, Tools be Accessed by End Users? Prompting gpt-4 , chatgpt , hacking	28	10449	April 25, 2024
Slightly more advanced still fallible safeguard for instruction set leaks GPT builders gpt-4 , chatgpt , fine-tuning , custom-instructions , custom-gpt	17	3376	December 22, 2024

How can you protect your GPT?

Related topics