Basel Alomair

h-index: 20 1,621 citations 79 papers (total)

Papers in Database (3)

defense arXiv Oct 22, 2025 · Oct 2025

Defending Against Prompt Injection with DataFilter

Yizhu Wang, Sizhe Chen, Raghad Alkhudair et al. · UC Berkeley · KACST

Defends LLM agents against indirect prompt injection by filtering malicious instructions from external data before LLM processing

Prompt Injection nlp
9 citations PDF Code
defense arXiv Sep 30, 2025 · Sep 2025

Better Privilege Separation for Agents by Restricting Data Types

Dennis Jacob, Emad Alghamdi, Zhanhao Hu et al. · University of California · HUMAIN +1 more

Defends LLM agents against prompt injection by converting untrusted third-party content into restricted data types, eliminating the injection surface

Prompt Injection nlp
1 citations PDF
attack arXiv Oct 20, 2025 · Oct 2025

BadScientist: Can a Research Agent Write Convincing but Unsound Papers that Fool LLM Reviewers?

Fengqing Jiang, Yichen Feng, Yuetai Li et al. · University of Washington · King Abdulaziz City for Science and Technology

LLM agent generates fabricated, experimentless papers that fool multi-model AI review systems via presentation-manipulation strategies

Prompt Injection Excessive Agency nlp
PDF