Ivan Zhang

Papers in Database (1)

defense arXiv Aug 10, 2025 · Aug 2025

A Real-Time, Self-Tuning Moderator Framework for Adversarial Prompt Detection

Ivan Zhang · Non-Trivial Research Fellowship · Carnegie Mellon University

Proposes a real-time self-tuning two-agent moderator that adapts to novel LLM jailbreaks without expensive retraining

Prompt Injection nlp
PDF