Latest papers

2 papers
defense arXiv Mar 25, 2026 · 12d ago

Enhancing and Reporting Robustness Boundary of Neural Code Models for Intelligent Code Understanding

Tingxu Han, Wei Song, Weisong Sun et al. · Nanjing University · University of New South Wales +2 more

Black-box certified defense for code models using randomized smoothing to reduce adversarial attack success from 42% to 9.74%

Input Manipulation Attack nlp
PDF
attack arXiv Sep 30, 2025 · Sep 2025

STaR-Attack: A Spatio-Temporal and Narrative Reasoning Attack Framework for Unified Multimodal Understanding and Generation Models

Shaoxiong Guo, Tianyi Du, Lijun Li et al. · Shanghai Artificial Intelligence Laboratory · East China Normal University +2 more

Multi-turn narrative jailbreak exploiting UMM generation-understanding coupling to bypass safety alignment via story framing

Prompt Injection multimodalnlpvision
PDF