Tri Nguyen

Papers in Database (1)

tool arXiv Feb 10, 2026 · 7w ago

Detecting Jailbreak Attempts in Clinical Training LLMs Through Automated Linguistic Feature Extraction

Tri Nguyen, Huy Hoang Bao Le, Lohith Srikanth Pentapalli et al. · University of Cincinnati

Two-layer jailbreak detector using BERT-extracted linguistic features to catch unsafe prompt manipulation in clinical training LLMs

Prompt Injection nlp
PDF