Hoang Phan

h-index: 8 155 citations 21 papers (total)

Papers in Database (1)

defense EMNLP Sep 29, 2025 · Sep 2025

Think Twice, Generate Once: Safeguarding by Progressive Self-Reflection

Hoang Phan, Victor Li, Qi Lei · New York University

Inference-time jailbreak defense using progressive self-reflection reduces LLM attack success rates from ~80% to under 6%

Prompt Injection nlp
1 citations PDF