Tianyu Zhang

h-index: 1 2 citations 2 papers (total)

Papers in Database (1)

attack arXiv Nov 27, 2025 · Nov 2025

Distillability of LLM Security Logic: Predicting Attack Success Rate of Outline Filling Attack via Ranking Regression

Tianyu Zhang, Zihang Xi, Jingyu Hua et al. · Nanjing University

Builds a lightweight proxy that predicts jailbreak success rates, enabling black-box-to-quasi-white-box attack optimization of LLMs

Prompt Injection nlp
PDF