ML Security Papers

Latest papers

5 papers

benchmark arXiv Apr 22, 2026 · 29d ago

Eduarda Caldeira, Guray Ozgur, Fadi Boutros et al. · Fraunhofer Institute for Computer Graphics Research IGD · TU Darmstadt

Evaluates how face segmentation preprocessing affects both face recognition accuracy and morphing attack detection in operational biometrics

Input Manipulation Attack vision

attack arXiv Mar 4, 2026 · 11w ago

Ziyuan Chen, Yujin Jeong, Tobias Braun et al. · TU Darmstadt · Hessian Center for Artificial Intelligence

Proposes MELT, a LoRA-based backdoor attack on Stable Diffusion 3 requiring tuning fewer than 0.2% of encoder parameters

Model Poisoning Transfer Learning Attack visiongenerativemultimodal

benchmark arXiv Feb 2, 2026 · Feb 2026

Daniil Orel, Dilshod Azizov, Indraneil Paul et al. · Mohamed bin Zayed University of Artificial Intelligence · TU Darmstadt +1 more

Large-scale benchmark revealing AI-generated code detectors fail severely under distribution shift and adversarial conditions

Output Integrity Attack nlp

benchmark arXiv Jan 21, 2026 · Jan 2026

Anmol Goel, Cornelius Emde, Sangdoo Yun et al. · Parameter Lab · TU Darmstadt +3 more

Benign fine-tuning silently breaks contextual privacy in LLMs, causing inappropriate data disclosure undetected by standard safety benchmarks

Transfer Learning Attack Sensitive Information Disclosure nlp

attack arXiv Jan 19, 2026 · Jan 2026

Jesus-German Ortiz-Barajas, Jonathan Tonglet, Vivek Gupta et al. · INSAIT · Sofia University +3 more

Jailbreaks MLLMs via adversarial prompting to auto-generate misleading charts, reducing human and MLLM QA accuracy by ~20 points

Prompt Injection multimodalvisionnlp