Latest papers

2 papers
survey arXiv Feb 11, 2026 · 7w ago

The Landscape of Prompt Injection Threats in LLM Agents: From Taxonomy to Analysis

Peiran Wang, Xinfeng Li, Chong Xiang et al. · UCLA · NTU +1 more

Systematizes prompt injection attacks and defenses for LLM agents, introducing AgentPI benchmark that exposes context-dependent gaps in existing evaluations

Prompt Injection Excessive Agency nlp
PDF
benchmark arXiv Jan 2, 2026 · Jan 2026

A Comprehensive Dataset for Human vs. AI Generated Image Detection

Rajarshi Roy, Nasrin Imanpour, Ashhar Aziz et al. · Kalyani Government Engineering College · AI Institute USC +12 more

Releases MS COCOAI, a 96K-image benchmark for detecting AI-generated images and attributing them to specific generative models

Output Integrity Attack visiongenerative
1 citations PDF Code