ML Security Papers

Stats

Latest papers

4 papers

benchmark arXiv Feb 25, 2026 · 5w ago

Beyond Static Artifacts: A Forensic Benchmark for Video Deepfake Reasoning in Vision Language Models

Zheyuan Gu, Qingsong Zhao, Yusong Wang et al. · China Telecom · Peking University +1 more

Proposes FAQ benchmark to evaluate VLMs on temporal deepfake detection via three-level forensic reasoning hierarchy

Output Integrity Attack visionmultimodal

PDF

benchmark arXiv Jan 10, 2026 · 12w ago

Are LLMs Vulnerable to Preference-Undermining Attacks (PUA)? A Factorial Analysis Methodology for Diagnosing the Trade-off between Preference Alignment and Real-World Validity

Hongjun An, Yiliang Song, Jiangan Chen et al. · Northwestern Polytechnical University · China Telecom +1 more

Factorial framework diagnoses how manipulative natural-language prompts exploit RLHF alignment to make LLMs prioritize sycophancy over factual accuracy

Prompt Injection nlp

PDF

attack arXiv Nov 12, 2025 · Nov 2025

Boosting Adversarial Transferability via Ensemble Non-Attention

Yipeng Zou, Qin Liu, Jie Wu et al. · Hunan University · China Telecom +2 more

Ensemble adversarial attack leveraging non-attention regions and meta-learning to boost black-box transferability across CNNs and ViTs

Input Manipulation Attack vision

PDF

attack arXiv Jan 6, 2025 · Jan 2025

Rethinking Byzantine Robustness in Federated Recommendation from Sparse Aggregation Perspective

Zhongjian Zhang, Mengmei Zhang, Xiao Wang et al. · Beijing University of Posts and Telecommunications · China Telecom +2 more

Proposes Spattack, Byzantine attacks exploiting sparse aggregation in federated recommendation to prevent convergence and break defenses

Data Poisoning Attack federated-learning

PDF

Latest papers

Beyond Static Artifacts: A Forensic Benchmark for Video Deepfake Reasoning in Vision Language Models

Are LLMs Vulnerable to Preference-Undermining Attacks (PUA)? A Factorial Analysis Methodology for Diagnosing the Trade-off between Preference Alignment and Real-World Validity

Boosting Adversarial Transferability via Ensemble Non-Attention

Rethinking Byzantine Robustness in Federated Recommendation from Sparse Aggregation Perspective

Filters

Time Period

Paper Type

OWASP ML Top 10

OWASP LLM Top 10

Institution

Venue