Hongwei Li

attack arXiv Apr 23, 2026 · 28d ago

Zihan Wang, Rui Zhang, Yu Liu et al. · University of Electronic Science and Technology of China

Black-box attacks extract proprietary LLM agent skills in 3 interactions; defenses tested but low-cost repeated attacks remain effective

Sensitive Information Disclosure Prompt Injection nlp

Papers in Database (1)