Zeming Wei

h-index: 12 910 citations 28 papers (total)

Papers in Database (3)

defense arXiv Oct 22, 2025 · Oct 2025

Monitoring LLM-based Multi-Agent Systems Against Corruptions via Node Evaluation

Chengcan Wu, Zhixin Zhang, Mingqian Xu et al. · Peking University

Dynamic graph-monitoring defense disrupts malicious inter-agent communications in LLM multi-agent systems via continuous node evaluation

Prompt Injection Excessive Agency nlpgraph
2 citations PDF Code
benchmark arXiv Feb 2, 2026 · 9w ago

RACA: Representation-Aware Coverage Criteria for LLM Safety Testing

Zeming Wei, Zhixin Zhang, Chengcan Wu et al. · Peking University

Coverage criteria framework using LLM internal representations to evaluate jailbreak test suite adequacy and guide attack prompt sampling

Prompt Injection nlp
PDF
defense arXiv Nov 15, 2025 · Nov 2025

Calibrated Adversarial Sampling: Multi-Armed Bandit-Guided Generalization Against Unforeseen Attacks

Rui Wang, Zeming Wei, Xiyue Zhang et al. · Peking University · University of Bristol

Defends DNNs against unseen adversarial attacks by dynamically sampling attack types via multi-armed bandit adversarial training

Input Manipulation Attack vision
PDF