Latest papers

4 papers
attack arXiv Mar 11, 2026 · 26d ago

Naïve Exposure of Generative AI Capabilities Undermines Deepfake Detection

Sunpill Kim, Chanwoo Hwang, Minsu Kim et al. · Hanyang University

Attacks deepfake detectors using commercial AI chatbots to refine synthetic faces via inadvertently externalized LLM authenticity criteria

Output Integrity Attack visiongenerativenlp
PDF
attack arXiv Mar 3, 2026 · 4w ago

Scores Know Bobs Voice: Speaker Impersonation Attack

Chanwoo Hwang, Sunpill Kim, Yong Kiam Tan et al. · Hanyang University · A*STAR +2 more

Feature-aligned latent inversion achieves 91% speaker impersonation with 10x fewer black-box score queries

Input Manipulation Attack audio
PDF Code
benchmark arXiv Feb 7, 2026 · 8w ago

Agent-Fence: Mapping Security Vulnerabilities Across Deep Research Agents

Sai Puppala, Ismail Hossain, Md Jahangir Alam et al. · Southern Illinois University · University of Texas +2 more

Benchmarks LLM agent architectures across 14 attack classes, exposing authorization confusion and tool hijacking as dominant structural risks

Excessive Agency Insecure Plugin Design Prompt Injection nlp
PDF
defense arXiv Nov 24, 2025 · Nov 2025

Hi-SAFE: Hierarchical Secure Aggregation for Lightweight Federated Learning

Hyeong-Gun Joo, Songnam Hong, Seunghwan Lee et al. · Hanyang University

Cryptographic secure aggregation for sign-based FL that hides gradient signs via majority-vote polynomials, blocking inference attacks

Model Inversion Attack federated-learning
PDF