Tianhang Zheng

attack arXiv Oct 2, 2025 · Oct 2025

Dynamic Target Attack

Kedong Xiu, Churui Zeng, Tianhang Zheng et al. · Zhejiang University · Hangzhou High-Tech Zone (Binjiang) Institute of Blockchain and Data Security +3 more

Gradient-based jailbreak attack using adaptive harmful-response sampling as optimization targets, achieving 87% ASR on safety-aligned LLMs in 200 iterations

Input Manipulation Attack Prompt Injection nlp

2 citations PDF Code

attack arXiv Oct 3, 2025 · Oct 2025

Untargeted Jailbreak Attack

Xinzhe Huang, Wenjing Hu, Tianhang Zheng et al. · Zhejiang University · Hangzhou High-Tech Zone (Binjiang) Institute of Blockchain and Data Security +3 more

Gradient-based untargeted jailbreak attack maximizes LLM unsafety probability without fixed response targets, achieving 80% ASR in 100 iterations

Input Manipulation Attack Prompt Injection nlp

2 citations PDF Code

defense arXiv Jan 10, 2026 · 12w ago

Attack-Resistant Watermarking for AIGC Image Forensics via Diffusion-based Semantic Deflection

Qingyu Liu, Yitao Zhang, Zhongjie Ba et al. · Zhejiang University

Defends AIGC image copyright by embedding diffusion-trajectory-coupled watermarks robust to 12 real-world spoofing and removal attacks

Output Integrity Attack visiongenerative

PDF Code

Papers in Database (3)

Dynamic Target Attack

Untargeted Jailbreak Attack

Attack-Resistant Watermarking for AIGC Image Forensics via Diffusion-based Semantic Deflection