Jie Zhang

Papers in Database (3)

defense arXiv Aug 4, 2025 · Aug 2025

PoseGuard: Pose-Guided Generation with Safety Guardrails

Kongxin Wang, Jie Zhang, Peigui Qi et al. · University of Science and Technology of China · A*STAR +1 more

Embeds safety guardrails into pose-guided video diffusion models to suppress deepfakes, NSFW content, and impersonation at inference

Output Integrity Attack visiongenerative
PDF
attack arXiv Apr 8, 2026 · 6w ago

CAAP: Capture-Aware Adversarial Patch Attacks on Palmprint Recognition Models

Renyang Liu, Jiale Li, Jie Zhang et al. · National University of Singapore · A*STAR +3 more

Physical adversarial patch attack on palmprint recognition using cross-shaped patches that survive real-world capture distortions

Input Manipulation Attack vision
PDF Code
defense arXiv Aug 28, 2025 · Aug 2025

Token Buncher: Shielding LLMs from Harmful Reinforcement Learning Fine-Tuning

Weitao Feng, Lixu Wang, Tianyi Wei et al. · Nanyang Technological University · A*STAR +1 more

Defends LLM safety alignment against RL fine-tuning attacks by suppressing response entropy via TokenBuncher

Transfer Learning Attack Prompt Injection nlpreinforcement-learning
PDF