ML Security Papers

Latest papers

8 papers

defense arXiv Apr 29, 2026 · 22d ago

Which Face and Whose Identity? Solving the Dual Challenge of Deepfake Proactive Forensics in Multi-Face Scenarios

Lei Zhang, Zhiqing Guo, Dan Ma et al. · Xinjiang University · Hunan University

Embeds identity watermarks in multi-face images to localize deepfake-manipulated regions and trace forged identities in group photos

Output Integrity Attack visionmultimodal

PDF

defense arXiv Apr 29, 2026 · 22d ago

GIFGuard: Proactive Forensics against Deepfakes in Facial GIFs via Spatiotemporal Watermarking

Shupeng Che, Zhiqing Guo, Changtao Miao et al. · Xinjiang University · Ant Group +1 more

Spatiotemporal watermarking framework embedding robust signals in facial GIFs to verify authenticity and detect deepfake tampering

Output Integrity Attack visionmultimodal

PDF

benchmark arXiv Mar 30, 2026 · 7w ago

Evaluating Privilege Usage of Agents on Real-World Tools

Quan Zhang, Lianhang Fu, Lvsi Lian et al. · East China Normal University · Xinjiang University +1 more

Benchmark evaluating LLM agents' privilege control under prompt injection attacks using real-world tools, finding 84.80% attack success

Prompt Injection Insecure Plugin Design Excessive Agency nlp

PDF

defense arXiv Feb 26, 2026 · 12w ago

All in One: Unifying Deepfake Detection, Tampering Localization, and Source Tracing with a Robust Landmark-Identity Watermark

Junjiang Wu, Liejun Wang, Zhiqing Guo · Xinjiang University · Xinjiang Multimodal Intelligent Processing and Information Security Engineering Technology Research Center

Proactive deepfake defense embedding landmark-identity watermarks into faces for unified detection, localization, and source tracing

Output Integrity Attack visiongenerative

PDF Code

attack arXiv Nov 3, 2025 · Nov 2025

Black-Box Membership Inference Attack for LVLMs via Prior Knowledge-Calibrated Memory Probing

Jinhua Yin, Peiru Yang, Chen Yang et al. · Tsinghua University · Beijing University of Posts and Telecommunications +1 more

First black-box membership inference attack on LVLMs using prior knowledge-calibrated probing to detect private training data.

Membership Inference Attack visionnlpmultimodal

1 citations PDF Code

defense arXiv Aug 24, 2025 · Aug 2025

Uncovering and Mitigating Destructive Multi-Embedding Attacks in Deepfake Proactive Forensics

Lixin Jia, Haiyang Sun, Zhiqing Guo et al. · Xinjiang University · Hefei University of Technology +1 more

Defines multi-embedding attacks that destroy deepfake forensic watermarks and defends with adversarial interference simulation training

Output Integrity Attack visiongenerative

PDF Code

defense arXiv Aug 14, 2025 · Aug 2025

Forgery Guided Learning Strategy with Dual Perception Network for Deepfake Cross-domain Detection

Lixin Jia, Zhiqing Guo, Gaobo Yang et al. · Xinjiang University · Xinjiang Multimodal Intelligent Processing and Information Security Engineering Technology Research Center +2 more

Proposes FGL strategy and DPNet architecture for cross-domain deepfake detection generalizing to unknown forgery techniques

Output Integrity Attack vision

PDF Code

The emergence of deepfake technology has introduced a range of societal problems, garnering considerable attention. Current deepfake detection methods perform well on specific datasets, but exhibit poor performance when applied to datasets with unknown forgery techniques. Moreover, as the gap between emerging and traditional forgery techniques continues to widen, cross-domain detection methods that rely on common forgery traces are becoming increasingly ineffective. This situation highlights the urgency of developing deepfake detection technology with strong generalization to cope with fast iterative forgery techniques. To address these challenges, we propose a Forgery Guided Learning (FGL) strategy designed to enable detection networks to continuously adapt to unknown forgery techniques. Specifically, the FGL strategy captures the differential information between known and unknown forgery techniques, allowing the model to dynamically adjust its learning process in real time. To further improve the ability to perceive forgery traces, we design a Dual Perception Network (DPNet) that captures both differences and relationships among forgery traces. In the frequency stream, the network dynamically perceives and extracts discriminative features across various forgery techniques, establishing essential detection cues. These features are then integrated with spatial features and projected into the embedding space. In addition, graph convolution is employed to perceive relationships across the entire feature space, facilitating a more comprehensive understanding of forgery trace correlations. Extensive experiments show that our approach generalizes well across different scenarios and effectively handles unknown forgery challenges, providing robust support for deepfake detection. Our code is available on https://github.com/vpsg-research/FGL.

cnn gnn transformer Xinjiang University · Xinjiang Multimodal Intelligent Processing and Information Security Engineering Technology Research Center · Hunan University +1 more

PDF arXiv Code

defense arXiv Aug 11, 2025 · Aug 2025

Boosting Active Defense Persistence: A Two-Stage Defense Framework Combining Interruption and Poisoning Against Deepfake

Hongrui Zheng, Yuezun Li, Liejun Wang et al. · Xinjiang University · Ocean University of China +1 more

Defends against deepfake retraining attacks by combining adversarial interruption perturbations with data poisoning to ensure long-term persistence

Output Integrity Attack Data Poisoning Attack visiongenerative

PDF Code

Latest papers

Which Face and Whose Identity? Solving the Dual Challenge of Deepfake Proactive Forensics in Multi-Face Scenarios

GIFGuard: Proactive Forensics against Deepfakes in Facial GIFs via Spatiotemporal Watermarking

Evaluating Privilege Usage of Agents on Real-World Tools

All in One: Unifying Deepfake Detection, Tampering Localization, and Source Tracing with a Robust Landmark-Identity Watermark

Black-Box Membership Inference Attack for LVLMs via Prior Knowledge-Calibrated Memory Probing

Uncovering and Mitigating Destructive Multi-Embedding Attacks in Deepfake Proactive Forensics

Forgery Guided Learning Strategy with Dual Perception Network for Deepfake Cross-domain Detection

Boosting Active Defense Persistence: A Two-Stage Defense Framework Combining Interruption and Poisoning Against Deepfake

Filters

Time Period

Paper Type

OWASP ML Top 10

OWASP LLM Top 10

Institution

Venue