REDKWEEN: Frozen Victim (8B vs 3B)

A 1-billion-parameter adversary learns to jailbreak a frozen 8-billion-parameter victim over 20 rounds.
6 episodes showing the adversary's evolution from accidents to strategy.

REDKWEEN — Frozen Victim (8B vs 3B)

Round 0 / 20  —  Attack Success Rate: 0.0%
AdversaryLlama 8B
Adversary Attack
Victim Response
VictimLlama 3B (frozen)
Attack Success Rate Over 20 Rounds