Rashmi Gangadharaiah

Papers in Database (1)

attack arXiv Sep 5, 2025 · Sep 2025

Neural Breadcrumbs: Membership Inference Attacks on LLMs Through Hidden State and Attention Pattern Analysis

Disha Makhija, Manoj Ghuhan Arivazhagan, Vinayshekhar Bannihatti Kumar et al. · AWS AI Labs

White-box membership inference attack on LLMs using hidden states and attention patterns achieves AUC 0.85, surpassing output-based methods

Membership Inference Attack nlp
PDF