GPT4 gone rogue. Vulnerable to easy jailbreaks

sheky · January 22, 2024, 8:24am

GPT4 has gone rogue and makes OpenAI sound like the devil and doing the wrong thing. Reporting here for a patch.

As of now, jailbreak are working beyond first message. i believe a better solution would be to flag the thread and if there’s a follow on, decline politely to respond to the request.

Refer

I have implemented a custom simple solution on my end to prevent jailbreak on API. However, we need a solution on chatgpt to prevent embarassment

Topic		Replies	Views
GPT-4o Broken Security - GPT Store - Read Most Any System Prompt, Here we go again Prompting gpt-4 , chatgpt	12	2607	May 24, 2024
How to prevent malicious questions / jailbreak prompts / prompt injection attacks when using API GPT3.5 API	5	4618	March 6, 2023
I jailbroke got 3.5 turbo Prompting chatgpt	5	1977	September 17, 2023
What are the options to prevent user's attempt to jailbreak chatbot in production? API moderation , development	7	4358	January 4, 2024
ChatGPT’s ‘jailbreak’ tries to make the A.I. break its own rules, or die Community	0	1927	February 9, 2023

GPT4 gone rogue. Vulnerable to easy jailbreaks

Related topics