Doron Shavit

h-index: 0 0 citations 1 papers (total)

Papers in Database (1)

defense arXiv Feb 18, 2026 · 6w ago

Recursive language models for jailbreak detection: a procedural defense for tool-augmented agents

Doron Shavit · Silverfort

Recursive LLM orchestration framework that de-obfuscates, chunks, and aggregates multi-segment evidence to detect jailbreaks in tool-augmented agents

Prompt Injection nlp
PDF