Latest papers

2 papers
attack arXiv Mar 13, 2026 · 26d ago

Purify Once, Edit Freely: Breaking Image Protections under Model Mismatch

Qichen Zhao, Shengfang Zhai, Xinjian Bai et al. · Peking University · National University of Singapore +1 more

Defeats image protection schemes via purification attacks, removing adversarial perturbations to restore full editability under model mismatch

Output Integrity Attack visiongenerative
PDF
defense arXiv Sep 2, 2025 · Sep 2025

Enhancing Reliability in LLM-Integrated Robotic Systems: A Unified Approach to Security and Safety

Wenxiao Zhang, Xiangrui Kong, Conan Dewitt et al. · The University of Western Australia

Defends LLM-driven mobile robots against prompt injection via structured prompt assembly, state management, and safety validation

Prompt Injection Excessive Agency nlpmultimodal
PDF Code