Latest papers

2 papers
attack arXiv Jan 30, 2026 · 9w ago

The Illusion of Forgetting: Attack Unlearned Diffusion via Initial Latent Variable Optimization

Manyi Li, Yufan Liu, Lai Jiang et al. · University of the Chinese Academy of Sciences · Chinese Academy of Sciences +2 more

Attacks machine unlearning defenses in diffusion models by optimizing initial latent variables to reactivate erased NSFW knowledge

Input Manipulation Attack visiongenerative
PDF Code
attack arXiv Dec 16, 2025 · Dec 2025

IntentMiner: Intent Inversion Attack via Tool Call Analysis in the Model Context Protocol

Yunhao Yao, Zhiqiang Wang, Haoran Cheng et al. · University of Science and Technology of China · Beijing University of Aeronautics and Astronautics

Attacks LLM agent privacy by showing semi-honest MCP servers reconstruct user intent from tool call metadata alone

Insecure Plugin Design Sensitive Information Disclosure nlp
PDF