Wei Wang

defense arXiv Aug 25, 2025 · Aug 2025

Quanwei Wu, Jun Guo, Wei Wang et al. · Dongguan University of Technology · The Hong Kong University of Science and Technology

Proposes feature-space adapter for adversarial training that eliminates robust overfitting with negligible computational overhead

Input Manipulation Attack vision

defense arXiv Feb 9, 2026 · 8w ago

Liwen Wang, Zongjie Li, Yuchong Xie et al. · The Hong Kong University of Science and Technology · HSBC

Watermarks agentic LLM systems by biasing tool execution paths, so stolen imitation models inherit detectable signatures

Model Theft Model Theft nlp

Papers in Database (2)