Latest papers

2 papers
defense arXiv Nov 29, 2025 · Nov 2025

Teleportation-Based Defenses for Privacy in Approximate Machine Unlearning

Mohammad M Maheri, Xavier Cadet, Peter Chin et al. · Imperial College London · Dartmouth College

Proposes WARP teleportation defense that obfuscates unlearning signals, resisting membership inference and data reconstruction attacks

Membership Inference Attack Model Inversion Attack vision
PDF
attack EMNLP Nov 5, 2025 · Nov 2025

From Insight to Exploit: Leveraging LLM Collaboration for Adaptive Adversarial Text Generation

Najrin Sultana, Md Rafi Ur Rashid, Kang Gu et al. · The Pennsylvania State University · Dartmouth College

LLM-driven adversarial text generation that fools LLM classifiers via semantic-preserving perturbations without gradient access

Prompt Injection nlp
PDF Code