Jehyeok Yeon

h-index: 2 6 citations 3 papers (total)

Papers in Database (2)

benchmark arXiv Oct 5, 2025 · Oct 2025

Quantifying Distributional Robustness of Agentic Tool-Selection

Jehyeok Yeon, Isha Chaudhary, Gagandeep Singh · University of Illinois Urbana-Champaign

Statistical framework certifying LLM agent tool-selection robustness against adaptive adversarial tool injection, revealing near-zero certified accuracy under attack

Insecure Plugin Design Prompt Injection nlp
3 citations PDF
defense arXiv Dec 7, 2025 · Dec 2025

GSAE: Graph-Regularized Sparse Autoencoders for Robust LLM Safety Steering

Jehyeok Yeon, Federico Cinus, Yifan Wu et al. · University of Illinois Urbana-Champaign · University of Southern California +1 more

Proposes graph-regularized sparse autoencoders to capture distributed LLM safety representations for adaptive jailbreak defense with 82% refusal rate

Prompt Injection nlp
1 citations PDF