ML Security Papers

Latest papers

4 papers

attack arXiv Apr 8, 2026 · 6w ago

Renyang Liu, Jiale Li, Jie Zhang et al. · National University of Singapore · A*STAR +3 more

Physical adversarial patch attack on palmprint recognition using cross-shaped patches that survive real-world capture distortions

Input Manipulation Attack vision

defense arXiv Mar 19, 2026 · 9w ago

Sheng Pan, Niansheng Tang · Yunnan University

Active auditing framework using stochastic probes to detect adaptive backdoors in decentralized federated learning networks

Model Poisoning federated-learning

attack arXiv Mar 17, 2026 · 9w ago

Yong Zou, Haoran Li, Fanxiao Li et al. · Yunnan University · Northeastern University +1 more

Black-box adversarial image prompt attack that bypasses concept unlearning in diffusion models, recovering erased copyrighted and harmful concepts

Input Manipulation Attack visionmultimodalgenerative

benchmark arXiv Jan 9, 2026 · Jan 2026

Herun Wan, Jiaying Wu, Minnan Luo et al. · Xi’an Jiaotong University · National University of Singapore +1 more

Benchmarks LLM vulnerability to sophisticated fabricated evidence and proposes DIS defense to shield beliefs against indirect context manipulation

Prompt Injection nlp