Latest papers

2 papers
survey arXiv Dec 10, 2025 · Dec 2025

Chasing Shadows: Pitfalls in LLM Security Research

Jonathan Evertz, Niklas Risse, Nicolai Neuer et al. · CISPA Helmholtz Center for Information Security · Max Planck Institute for Security and Privacy +4 more

Surveys nine methodological pitfalls in LLM security research found in all 72 surveyed papers, with case studies showing how each misleads results

Data Poisoning Attack Prompt Injection nlp
2 citations PDF
defense arXiv Sep 26, 2025 · Sep 2025

Erase or Hide? Suppressing Spurious Unlearning Neurons for Robust Unlearning

Nakyeong Yang, Dong-Kyum Kim, Jea Kwon et al. · Seoul National University · Max Planck Institute for Security and Privacy

Defends LLM unlearning against adversarial relearning attacks by suppressing spurious neurons that hide rather than erase private knowledge

Sensitive Information Disclosure nlp
1 citations PDF