ML Security Papers

Latest papers

2 papers

survey arXiv Feb 11, 2026 · 7w ago

The Landscape of Prompt Injection Threats in LLM Agents: From Taxonomy to Analysis

Peiran Wang, Xinfeng Li, Chong Xiang et al. · UCLA · NTU +1 more

Systematizes prompt injection attacks and defenses for LLM agents, introducing AgentPI benchmark that exposes context-dependent gaps in existing evaluations

Prompt Injection Excessive Agency nlp

PDF

benchmark arXiv Jan 2, 2026 · Jan 2026

A Comprehensive Dataset for Human vs. AI Generated Image Detection

Rajarshi Roy, Nasrin Imanpour, Ashhar Aziz et al. · Kalyani Government Engineering College · AI Institute USC +12 more

Releases MS COCOAI, a 96K-image benchmark for detecting AI-generated images and attributing them to specific generative models

Output Integrity Attack visiongenerative

1 citations PDF Code

Latest papers

The Landscape of Prompt Injection Threats in LLM Agents: From Taxonomy to Analysis

A Comprehensive Dataset for Human vs. AI Generated Image Detection

Filters

Time Period

Paper Type

OWASP ML Top 10

OWASP LLM Top 10

Institution

Venue