ML Security Papers

Stats

Latest papers

4 papers

defense arXiv Feb 25, 2026 · 6w ago

Leveraging large multimodal models for audio-video deepfake detection: a pilot study

Songjun Cao, Yuqi Li, Yunpeng Luo et al. · Tencent Youtu Lab · Fudan University

Fine-tunes Qwen 2.5 Omni as a unified audio-visual deepfake detector via two-stage LoRA and encoder fine-tuning

Output Integrity Attack multimodalaudiovision

PDF

defense arXiv Feb 25, 2026 · 6w ago

TranX-Adapter: Bridging Artifacts and Semantics within MLLMs for Robust AI-generated Image Detection

Wenbin Wang, Yuge Huang, Jianqing Xu et al. · Wuhan University · Tencent Youtu Lab +1 more

Fixes attention dilution in MLLM-based AI-generated image detectors via optimal transport and cross-attention fusion

Output Integrity Attack visionmultimodal

PDF Code

defense arXiv Feb 2, 2026 · 9w ago

MIRROR: Manifold Ideal Reference ReconstructOR for Generalizable AI-Generated Image Detection

Ruiqi Liu, Manni Cui, Ziheng Qin et al. · Institute of Automation · School of Advanced Interdisciplinary Sciences +7 more

Detects AI-generated images by projecting inputs to a real-image manifold and using reconstruction residuals as forgery signals, surpassing human experts

Output Integrity Attack visiongenerative

PDF Code

defense arXiv Sep 29, 2025 · Sep 2025

Seeing Before Reasoning: A Unified Framework for Generalizable and Explainable Fake Image Detection

Kaiqing Lin, Zhiyuan Yan, Ruoxin Chen et al. · Shenzhen University · Tencent Youtu Lab +2 more

Proposes Forensic-Chat, a two-stage MLLM training paradigm enabling artifact-aware visual perception before reasoning for explainable AI-generated image detection

Output Integrity Attack visionmultimodal

9 citations PDF

Latest papers

Leveraging large multimodal models for audio-video deepfake detection: a pilot study

TranX-Adapter: Bridging Artifacts and Semantics within MLLMs for Robust AI-generated Image Detection

MIRROR: Manifold Ideal Reference ReconstructOR for Generalizable AI-Generated Image Detection

Seeing Before Reasoning: A Unified Framework for Generalizable and Explainable Fake Image Detection

Filters

Time Period

Paper Type

OWASP ML Top 10

OWASP LLM Top 10

Institution

Venue