ChenYu Wu

h-index: 1 4 citations 6 papers (total)

Papers in Database (1)

defense arXiv Oct 16, 2025 · Oct 2025

Active Honeypot Guardrail System: Probing and Confirming Multi-Turn LLM Jailbreaks

ChenYu Wu, Yi Wang, Yang Liao · The University of Tokyo · Xi’an Jiaotong University

Proactive honeypot defense uses a fine-tuned bait model to lure multi-turn LLM jailbreak attackers into revealing malicious intent

Prompt Injection nlp
PDF