Davide Gabrielli

h-index: 1 8 citations 5 papers (total)

Papers in Database (1)

defense arXiv Oct 2, 2025 · Oct 2025

Inverse Language Modeling towards Robust and Grounded LLMs

Davide Gabrielli, Simone Sestito, Iacopo Masi · Sapienza University of Rome

Defends LLMs against adversarial perturbations and unsafe triggers by inverting model outputs to expose attack inputs

Input Manipulation Attack Prompt Injection nlp
PDF Code