Zhixin Zhang

h-index: 2 16 citations 10 papers (total)

Papers in Database (2)

defense arXiv Oct 22, 2025 · Oct 2025

Monitoring LLM-based Multi-Agent Systems Against Corruptions via Node Evaluation

Chengcan Wu, Zhixin Zhang, Mingqian Xu et al. · Peking University

Dynamic graph-monitoring defense disrupts malicious inter-agent communications in LLM multi-agent systems via continuous node evaluation

Prompt Injection Excessive Agency nlpgraph
2 citations PDF Code
benchmark arXiv Feb 2, 2026 · 9w ago

RACA: Representation-Aware Coverage Criteria for LLM Safety Testing

Zeming Wei, Zhixin Zhang, Chengcan Wu et al. · Peking University

Coverage criteria framework using LLM internal representations to evaluate jailbreak test suite adequacy and guide attack prompt sampling

Prompt Injection nlp
PDF