Latest papers

3 papers
defense arXiv Mar 19, 2026 · 18d ago

Beyond Passive Aggregation: Active Auditing and Topology-Aware Defense in Decentralized Federated Learning

Sheng Pan, Niansheng Tang · Yunnan University

Active auditing framework using stochastic probes to detect adaptive backdoors in decentralized federated learning networks

Model Poisoning federated-learning
PDF
attack arXiv Mar 17, 2026 · 20d ago

REFORGE: Multi-modal Attacks Reveal Vulnerable Concept Unlearning in Image Generation Models

Yong Zou, Haoran Li, Fanxiao Li et al. · Yunnan University · Northeastern University +1 more

Black-box adversarial image prompt attack that bypasses concept unlearning in diffusion models, recovering erased copyrighted and harmful concepts

Input Manipulation Attack visionmultimodalgenerative
PDF Code
benchmark arXiv Jan 9, 2026 · 12w ago

The Facade of Truth: Uncovering and Mitigating LLM Susceptibility to Deceptive Evidence

Herun Wan, Jiaying Wu, Minnan Luo et al. · Xi’an Jiaotong University · National University of Singapore +1 more

Benchmarks LLM vulnerability to sophisticated fabricated evidence and proposes DIS defense to shield beliefs against indirect context manipulation

Prompt Injection nlp
PDF Code