Tyler Slater

h-index: 0 0 citations 2 papers (total)

Papers in Database (1)

defense arXiv Nov 10, 2025 · Nov 2025

A Self-Improving Architecture for Dynamic Safety in Large Language Models

Tyler Slater · Georgia Institute of Technology

Self-adapting runtime safety framework autonomously synthesizes new jailbreak defenses from breach feedback, cutting LLM ASR from 100% to 45.58%

Prompt Injection nlp
PDF