Shuo Shao

h-index: 9 311 citations 23 papers (total)

Papers in Database (3)

attack arXiv Oct 3, 2025 · Oct 2025

External Data Extraction Attacks against Retrieval-Augmented Large Language Models

Yu He, Yifei Chen, Yiming Li et al. · Zhejiang University · Hangzhou High-Tech Zone (Binjiang) Institute of Blockchain and Data Security +1 more

Proposes SECRET, an adaptive jailbreak-plus-retrieval-trigger attack that extracts RAG knowledge base contents verbatim from leading commercial LLMs

Sensitive Information Disclosure Prompt Injection nlp
1 citations PDF
attack arXiv Nov 6, 2025 · Nov 2025

Black-Box Guardrail Reverse-engineering Attack

Hongwei Yao, Yun Xia, Shuo Shao et al. · City University of Hong Kong · Hangzhou Dianzi University +1 more

Clones black-box LLM guardrail policies via RL and genetic algorithms, achieving 0.92 fidelity for under $85 in API queries

Model Theft Model Theft nlp
PDF
attack arXiv Dec 9, 2025 · Dec 2025

MIRAGE: Misleading Retrieval-Augmented Generation via Black-box and Query-agnostic Poisoning Attacks

Tailun Chen, Yu He, Yan Wang et al. · Zhejiang University · Alibaba Group +1 more

Black-box RAG corpus poisoning attack using persona-driven query synthesis, semantic anchoring, and adversarial preference optimization to mislead LLMs

Data Poisoning Attack Prompt Injection nlp
PDF