ML Security Papers

Latest papers

3 papers

attack arXiv Mar 12, 2026 · 25d ago

Cascade: Composing Software-Hardware Attack Gadgets for Adversarial Threat Amplification in Compound AI Systems

Sarbartha Banerjee, Prateek Sahu, Anjo Vahldiek-Oberwagner et al. · Georgia Tech · The University of Texas at Austin +3 more

Compounds Rowhammer hardware faults and RAG database injection with LLM attacks to jailbreak guardrails and exfiltrate user data

Prompt Injection Sensitive Information Disclosure nlp

PDF

attack arXiv Feb 3, 2026 · 8w ago

Controlling Output Rankings in Generative Engines for LLM-based Search

Haibo Jin, Ruoxi Chen, Peiyan Zhang et al. · University of Illinois at Urbana-Champaign · Starc Institute +2 more

Injects crafted content into product pages to manipulate LLM-based search rankings with 91% promotion success rate

Input Manipulation Attack Prompt Injection nlp

PDF

benchmark ICCVW Sep 22, 2025 · Sep 2025

Is It Certainly a Deepfake? Reliability Analysis in Detection & Generation Ecosystem

Neslihan Kose, Anthony Rhodes, Umur Aybars Ciftci et al. · Intel Labs · Binghamton University +1 more

Benchmarks deepfake detector reliability via Bayesian uncertainty quantification, revealing generator-specific artifacts through pixel-level uncertainty maps

Output Integrity Attack vision

1 citations PDF

Latest papers

Cascade: Composing Software-Hardware Attack Gadgets for Adversarial Threat Amplification in Compound AI Systems

Controlling Output Rankings in Generative Engines for LLM-based Search

Is It Certainly a Deepfake? Reliability Analysis in Detection & Generation Ecosystem

Filters

Time Period

Paper Type

OWASP ML Top 10

OWASP LLM Top 10

Institution

Venue