Zicheng Liu

h-index: 0 0 citations 3 papers (total)

Papers in Database (1)

defense arXiv Jan 27, 2026 · 9w ago

RvB: Automating AI System Hardening via Iterative Red-Blue Games

Lige Huang, Zicheng Liu, Jie Zhang et al. · Shanghai Artificial Intelligence Laboratory · Institute of Information Engineering +1 more

Automates LLM jailbreak guardrail hardening via iterative red-blue adversarial game without model parameter updates

Prompt Injection nlp
PDF