Latest papers

3 papers
defense arXiv Apr 20, 2026 · 4w ago

AgenTEE: Confidential LLM Agent Execution on Edge Devices

Sina Abdollahi, Mohammad M Maheri, Javad Forough et al. · Imperial College London · Dartmouth College

Secure LLM agent deployment system using Arm confidential VMs to isolate runtime, inference, and plugins on edge devices

AI Supply Chain Attacks Insecure Plugin Design Excessive Agency nlp
PDF
defense arXiv Nov 29, 2025 · Nov 2025

Teleportation-Based Defenses for Privacy in Approximate Machine Unlearning

Mohammad M Maheri, Xavier Cadet, Peter Chin et al. · Imperial College London · Dartmouth College

Proposes WARP teleportation defense that obfuscates unlearning signals, resisting membership inference and data reconstruction attacks

Membership Inference Attack Model Inversion Attack vision
PDF
attack EMNLP Nov 5, 2025 · Nov 2025

From Insight to Exploit: Leveraging LLM Collaboration for Adaptive Adversarial Text Generation

Najrin Sultana, Md Rafi Ur Rashid, Kang Gu et al. · The Pennsylvania State University · Dartmouth College

LLM-driven adversarial text generation that fools LLM classifiers via semantic-preserving perturbations without gradient access

Prompt Injection nlp
PDF Code