/matrix · 2026-07-13

Breach Matrix

Max any-breach rate per attack

family

Attack family

One of 14 high-level categories ROGUE buckets attacks into: jailbreak, prompt injection, exfiltration, agentic-tool-abuse, weight abliteration, and so on.

× . 3 attacks tested against 1 configs (3 cells total). Click any red cell to see the prompt that breached it.

Worst cell

nothing has breached yet

Critical cells

nothing in red zone

scope:

attacker:

every run day merged · raw harvested prompt, N=5 trials per cell, no adaptation · loading…

filter:2 families × 1 configs · click a row, column, or cell

Attack family	worst 0%
	,
	,

Heat scale:< 10%10–30%30–50%50–70%70–100% · breached

// cells aggregate MAX(any_breach_rate) across all

primitive

Attack primitive

One distinct jailbreak technique, deduplicated across all the variations people posted. The atomic unit ROGUE tracks.

in ( × config)

// each

primitive

Attack primitive

One distinct jailbreak technique, deduplicated across all the variations people posted. The atomic unit ROGUE tracks.

ran N=5 trials per cell; rates carry 95% bootstrap in the cell drawer