Francis Kulumba

h-index: 1 13 citations 2 papers (total)

Papers in Database (1)

benchmark arXiv Feb 11, 2026 · 7w ago

Triggers Hijack Language Circuits: A Mechanistic Analysis of Backdoor Behaviors in Large Language Models

Théo Lasnier, Wissam Antoun, Francis Kulumba et al. · Inria Paris

Mechanistic analysis reveals LLM backdoor triggers hijack existing language-encoding circuits rather than forming isolated hidden pathways

Model Poisoning nlp
PDF