Javad Forough

h-index: 3 24 citations 9 papers (total)

Papers in Database (1)

defense arXiv Sep 27, 2025 · Sep 2025

GuardNet: Graph-Attention Filtering for Jailbreak Defense in Large Language Models

Javad Forough, Mohammad Maheri, Hamed Haddadi · Imperial College London

GNN-based hierarchical filter detects and localizes jailbreak prompts in LLMs, achieving 99.8% F1 on LLM-Fuzzer

Prompt Injection nlpgraph
1 citations PDF