Protecting LLMs from prompt injections and jailbreaks: New OpenAI Paper

anon10827405 · April 25, 2024, 5:59pm

To me it always made sense that the system prompt is the de-facto truth, yet OpenAI constantly released documentation encouraging instructions and RAG in the user role.

Which is CRAZY. Why would I want to give such power to a user?? Make it so simple to say “Oh, actually, instead do Y”

I recall when ChatML was first released. It was everyone’s intuition to add system messages for factual data. We didn’t know that there can only be a single system message. Like WAT

Thanks for sharing!

Topic		Replies	Views
Very strange change in GPT4o behavior Prompting api , gpt-4o	3	447	September 13, 2024
Understanding AI Manipulation: A Case Study on the 'Precision' Method Prompting gpt-4 , chatgpt , injection	0	906	January 20, 2024
GPT-4.5 prompting pro-tip Prompting	8	4636	May 2, 2025
How much control do you really have over your chatbot? Prompting api , prompt-engineering	5	699	May 2, 2025
Struggling with a "Prompt" for my chatbot Prompting	29	11429	July 23, 2024

Protecting LLMs from prompt injections and jailbreaks: New OpenAI Paper

Related topics