Latest papers

6 papers
defense arXiv Mar 6, 2026 · 4w ago

Word-Anchored Temporal Forgery Localization

Tianyi Wang, Xi Shao, Harry Cheng et al. · National University of Singapore · Nanjing University of Posts and Telecommunications +1 more

Detects audio-visual deepfake segments via word-token binary classification, outperforming regression-based TFL baselines

Output Integrity Attack audiovisionmultimodal
PDF
attack arXiv Feb 11, 2026 · 7w ago

Transferable Backdoor Attacks for Code Models via Sharpness-Aware Adversarial Perturbation

Shuyu Chang, Haiping Huang, Yanjun Zhang et al. · Nanjing University of Posts and Telecommunications · State Key Laboratory of Tibetan Intelligence +5 more

Backdoor attack on code models using sharpness-aware training and Gumbel-Softmax triggers for cross-dataset transferability and stealthiness

Model Poisoning nlp
PDF
attack arXiv Jan 29, 2026 · 9w ago

Noise as a Probe: Membership Inference Attacks on Diffusion Models Leveraging Initial Noise

Puwei Lian, Yujun Cai, Songze Li et al. · Southeast University · The University of Queensland +1 more

Exploits residual semantics in diffusion model noise schedules to perform black-box membership inference without auxiliary data

Membership Inference Attack visiongenerative
PDF
attack arXiv Jan 28, 2026 · 9w ago

ICON: Intent-Context Coupling for Efficient Multi-Turn Jailbreak Attack

Xingwei Lin, Wenhao Lin, Sicong Cao et al. · Zhejiang University · Nanjing University of Posts and Telecommunications +2 more

Exploits intent-context coupling in multi-turn jailbreaks to bypass LLM safety with 97.1% attack success rate

Prompt Injection nlp
PDF Code
attack arXiv Nov 10, 2025 · Nov 2025

Differentiated Directional Intervention A Framework for Evading LLM Safety Alignment

Peng Zhang, Peijie Sun · Nanjing University of Posts and Telecommunications

White-box activation attack decomposes LLM safety alignment into two directions and neutralizes both, achieving 97.88% jailbreak success on Llama-2

Prompt Injection nlp
1 citations PDF
benchmark arXiv Oct 26, 2025 · Oct 2025

DeepfakeBench-MM: A Comprehensive Benchmark for Multimodal Deepfake Detection

Kangran Zhao, Yupeng Chen, Xiaoyu Zhang et al. · The Chinese University of Hong Kong · State University of New York +1 more

Proposes the largest multimodal deepfake benchmark (1.1M forged samples, 21 pipelines) and unified evaluation framework for audiovisual deepfake detection

Output Integrity Attack visionaudiomultimodal
1 citations PDF