Guoli Wang

h-index: 1 1 citations 2 papers (total)

Papers in Database (1)

defense arXiv Nov 9, 2025 · Nov 2025

EASE: Practical and Efficient Safety Alignment for Small Language Models

Haonan Shi, Guoli Wang, Tu Ouyang et al. · Case Western Reserve University

Defends small LLMs against jailbreaks via selective safety reasoning that activates only for dangerous queries, cutting overhead 90%

Prompt Injection nlp
PDF Code