REDKWEEN: Self-Play (8B vs 3B)

Both 8B adversary and 3B victim train simultaneously.
The victim hardens faster than the adversary can learn — the defender wins.

REDKWEEN — Self-Play (8B vs 3B)

Round 0 / 19  —  Attack Success Rate: 0.0%
AdversaryLlama 8B
Adversary Attack
Victim Response
VictimLlama 3B
Attack Success Rate Over 20 Rounds