Jailbreak resistance benchmark across 52 recent LLMs (7 levels, redacted outputs)

nuancedeveloper · February 9, 2026, 11:13am

We published a jailbreak/prompt-injection resistance benchmark against 52 models, on 7 escalating attack levels.

This is framed as a safety leaderboard, not a jailbreak guide:

Results table: rival.tips/jailbreak

Feedback welcome, especially on attack strategies and further models to test.

Topic		Replies	Views
Jailbreaking to get system prompt and protection from it API prompt-engineering	2	4232	December 9, 2023
Jailbreaking research out of Anthropic Community research	2	1487	April 2, 2024
ChatGPT jailbreak prompts Community	6	24380	July 8, 2023
Unveiling Hidden Instructions in Chatbots Bugs bug , risks	19	11468	December 28, 2025
How to safely challenge models against prompt injection? Prompting injection , prompt	9	2658	December 28, 2025