Frauke Kreuter

h-index: 10 305 citations 26 papers (total)

Papers in Database (1)

benchmark arXiv Oct 24, 2025 · Oct 2025

Toward Understanding the Transferability of Adversarial Suffixes in Large Language Models

Sarah Ball, Niki Hasrati, Alexander Robey et al. · Ludwig-Maximilians-Universität München · Carnegie Mellon University +1 more

Analyzes why gradient-optimized adversarial suffixes transfer across LLMs using refusal-direction geometry in activation space

Input Manipulation Attack Prompt Injection nlp
PDF Code