Alexandre Le Mercier

Papers in Database (1)

defense arXiv Mar 12, 2026 · 25d ago

CLASP: Defending Hybrid Large Language Models Against Hidden State Poisoning Attacks

Alexandre Le Mercier, Thomas Demeester, Chris Develder · Ghent University

Defends SSM-based hybrid LLMs against hidden state poisoning and prompt injection using Mamba block output embeddings and XGBoost detection

Prompt Injection nlp
PDF Code