Qi Lei

h-index: 1 2 citations 4 papers (total)

Papers in Database (1)

defense EMNLP Sep 29, 2025 · Sep 2025

Think Twice, Generate Once: Safeguarding by Progressive Self-Reflection

Hoang Phan, Victor Li, Qi Lei · New York University

Inference-time jailbreak defense using progressive self-reflection reduces LLM attack success rates from ~80% to under 6%

Prompt Injection nlp
1 citations PDF