Maximilian Dreyer

Papers in Database (1)

defense arXiv Aug 28, 2025 · Aug 2025

Towards Mechanistic Defenses Against Typographic Attacks in CLIP

Lorenz Hufe, Constantin Venhoff, Erblina Purelku et al. · Fraunhofer Heinrich Hertz Institute · University of Oxford +2 more

Defends CLIP against typographic image-text attacks via gradient-free attention head ablation, improving robustness 22% with <1% accuracy loss

Input Manipulation Attack Prompt Injection visionmultimodal
PDF Code