Rishi Jha

defense arXiv Oct 20, 2025 · Oct 2025

Rishi Jha, Harold Triedman, Justin Wagle et al. · Cornell University · Microsoft

Breaks alignment-based defenses for LLM multi-agent control-flow hijacking and proposes ControlValve using control-flow graphs and least privilege

Prompt Injection Excessive Agency nlp

3 citations PDF

Papers in Database (1)