Haojin Zhu

Papers in Database (4)

attack arXiv Jan 10, 2025 · Jan 2025

Model Inversion in Split Learning for Personalized LLMs: New Insights from Information Bottleneck Theory

Yunmeng Shu, Shaofeng Li, Tian Dong et al. · Shanghai Jiao Tong University · Southeast University

Two-stage model inversion attack recovers private input text from LLM intermediate representations in split learning edge-cloud deployments

Model Inversion Attack Sensitive Information Disclosure nlp
PDF
attack arXiv Mar 9, 2026 · 28d ago

SlowBA: An efficiency backdoor attack towards VLM-based GUI agents

Junxian Li, Tu Lan, Haozhen Tan et al. · Shanghai Jiao Tong University

Backdoor attack on VLM GUI agents that induces excessive latency via RL-injected trigger-aware long reasoning chains

Model Poisoning multimodalvisionnlp
PDF Code
defense arXiv Mar 12, 2026 · 25d ago

EmbTracker: Traceable Black-box Watermarking for Federated Language Models

Haodong Zhao, Jinming Hu, Yijie Bai et al. · Shanghai Jiao Tong University · Ant Group +2 more

Embeds per-client backdoor watermarks in federated LMs to trace model leaks to individual culprits via black-box queries

Model Theft Model Poisoning nlpfederated-learningmultimodal
PDF
attack arXiv Mar 20, 2026 · 17d ago

Trojan's Whisper: Stealthy Manipulation of OpenClaw through Injected Bootstrapped Guidance

Fazhong Liu, Zhuoyan Chen, Tu Lan et al. · Shanghai Jiao Tong University

Supply chain attack embedding malicious operational narratives in autonomous coding agent bootstrap guidance, achieving up to 64% success rate

AI Supply Chain Attacks Prompt Injection Insecure Plugin Design nlp
PDF