Zhou Yu

h-index: 3 70 citations 9 papers (total)

Papers in Database (1)

defense arXiv Oct 6, 2025 · Oct 2025

Proactive defense against LLM Jailbreak

Weiliang Zhao, Jinjun Peng, Daniel Ben-Levi et al. · Columbia University

Proactive LLM defense generates spurious jailbreak-success signals to terminate attacker optimization loops prematurely

Prompt Injection nlp
2 citations PDF