Latest papers

2 papers
attack arXiv Apr 22, 2026 · 29d ago

Breaking MCP with Function Hijacking Attacks: Novel Threats for Function Calling and Agentic Models

Yannis Belkhiter, Giulio Zizzo, Sergio Maffeis et al. · IBM Research Europe · Trinity College Dublin +1 more

Gradient-based adversarial attack that hijacks LLM function calling by inserting optimized tokens into function descriptions to force invocation of attacker-chosen tools

Input Manipulation Attack Insecure Plugin Design Excessive Agency nlp
PDF
defense arXiv Nov 23, 2025 · Nov 2025

Beyond Binary Classification: A Semi-supervised Approach to Generalized AI-generated Image Detection

Hong-Hanh Nguyen-Le, Van-Tuan Tran, Dinh-Thuc Nguyen et al. · University College Dublin · Trinity College Dublin +1 more

Semi-supervised detector discovers latent GAN vs. diffusion model patterns to generalize AI-generated image detection across unseen generator architectures

Output Integrity Attack visiongenerative
PDF