ML Security Papers

Latest papers

4 papers

attack arXiv Mar 11, 2026 · 26d ago

Sunpill Kim, Chanwoo Hwang, Minsu Kim et al. · Hanyang University

Attacks deepfake detectors using commercial AI chatbots to refine synthetic faces via inadvertently externalized LLM authenticity criteria

Output Integrity Attack visiongenerativenlp

attack arXiv Mar 3, 2026 · 4w ago

Chanwoo Hwang, Sunpill Kim, Yong Kiam Tan et al. · Hanyang University · A*STAR +2 more

Feature-aligned latent inversion achieves 91% speaker impersonation with 10x fewer black-box score queries

Input Manipulation Attack audio

benchmark arXiv Feb 7, 2026 · 8w ago

Sai Puppala, Ismail Hossain, Md Jahangir Alam et al. · Southern Illinois University · University of Texas +2 more

Benchmarks LLM agent architectures across 14 attack classes, exposing authorization confusion and tool hijacking as dominant structural risks

Excessive Agency Insecure Plugin Design Prompt Injection nlp

defense arXiv Nov 24, 2025 · Nov 2025

Hyeong-Gun Joo, Songnam Hong, Seunghwan Lee et al. · Hanyang University

Cryptographic secure aggregation for sign-based FL that hides gradient signs via majority-vote polynomials, blocking inference attacks

Model Inversion Attack federated-learning