Yihao Zhang

h-index: 6 118 citations 15 papers (total)

Papers in Database (1)

benchmark arXiv Feb 2, 2026 · 9w ago

RACA: Representation-Aware Coverage Criteria for LLM Safety Testing

Zeming Wei, Zhixin Zhang, Chengcan Wu et al. · Peking University

Coverage criteria framework using LLM internal representations to evaluate jailbreak test suite adequacy and guide attack prompt sampling

Prompt Injection nlp
PDF