Kwok Yan Lam

h-index: 3 43 citations 10 papers (total)

Papers in Database (1)

attack arXiv Oct 24, 2025 · Oct 2025

The Trojan Example: Jailbreaking LLMs through Template Filling and Unsafety Reasoning

Mingrui Liu, Sixiao Zhang, Cheng Long et al. · Nanyang Technological University

Black-box jailbreak exploiting safety-reasoning decoupling via template-filling, achieving 97–100% ASR on GPT-4o, Gemini, and DeepSeek

Prompt Injection nlp
2 citations PDF