ML Security Papers

Latest papers

4 papers

defense arXiv Jan 26, 2026 · 10w ago

RealStats: A Rigorous Real-Only Statistical Framework for Fake Image Detection

Haim Zisman, Uri Shaham · Bar-Ilan University

Training-free fake image detector using p-value aggregation over real-image statistics for interpretable, adaptable AI-image detection

Output Integrity Attack vision

PDF Code

defense arXiv Jan 19, 2026 · 11w ago

Context and Transcripts Improve Detection of Deepfake Audios of Public Figures

Chongyang Gao, Marco Postiglione, Julian Baldwin et al. · Northwestern University · Bar-Ilan University

Novel context-aware audio deepfake detector boosts F1-score up to 37% and resists 5 adversarial evasion strategies for public-figure impersonation

Output Integrity Attack audiomultimodalnlp

PDF Code

defense arXiv Jan 18, 2026 · 11w ago

LR-DWM: Efficient Watermarking for Diffusion Language Models

Ofek Raban, Ethan Fetaya, Gal Chechik · Bar-Ilan University · NVIDIA

Proposes LR-DWM, an efficient watermarking scheme for Diffusion Language Models using bidirectional neighbor context with negligible overhead

Output Integrity Attack nlpgenerative

PDF

benchmark arXiv Sep 25, 2025 · Sep 2025

No Prior, No Leakage: Revisiting Reconstruction Attacks in Trained Neural Networks

Yehonatan Refael, Guy Smorodinsky, Ofir Lindenbaum et al. · Tel Aviv University · Ben-Gurion University of the Negev +1 more

Theoretically proves reconstruction attacks on neural networks are fundamentally unreliable without prior data knowledge, and that better-trained models leak less

Model Inversion Attack vision

PDF

Latest papers

RealStats: A Rigorous Real-Only Statistical Framework for Fake Image Detection

Context and Transcripts Improve Detection of Deepfake Audios of Public Figures

LR-DWM: Efficient Watermarking for Diffusion Language Models

No Prior, No Leakage: Revisiting Reconstruction Attacks in Trained Neural Networks

Filters

Time Period

Paper Type

OWASP ML Top 10

OWASP LLM Top 10

Institution

Venue