Jiangyi Deng

h-index: 7 172 citations 15 papers (total)

Papers in Database (2)

defense arXiv Dec 9, 2025 · Dec 2025

Attention is All You Need to Defend Against Indirect Prompt Injection Attacks in LLMs

Yinan Zhong, Qianhao Miao, Yanjiao Chen et al. · Zhejiang University

Attention-based defense framework detects and sanitizes indirect prompt injection attacks in LLMs at token level

Prompt Injection nlp
2 citations PDF
defense arXiv Oct 18, 2025 · Oct 2025

Patronus: Safeguarding Text-to-Image Models against White-Box Adversaries

Xinfeng Li, Shengyuan Pang, Jialin Wu et al. · Nanyang Technological University · Zhejiang University +1 more

Defends text-to-image diffusion models against white-box fine-tuning attacks via non-fine-tunable safety alignment and feature-level input moderation

Transfer Learning Attack visiongenerative
PDF