defense 2026

Mitigating the OWASP Top 10 For Large Language Models Applications using Intelligent Agents

Mohammad Fasha , Faisal Abul Rub , Nasim Matar , Bilal Sowan , Mohammad Al Khaldy

7 citations · 1 influential · 18 references · International Conference Contr...

α

Published on arXiv

2601.18105

Prompt Injection

OWASP LLM Top 10 — LLM01

Sensitive Information Disclosure

OWASP LLM Top 10 — LLM06

Excessive Agency

OWASP LLM Top 10 — LLM08

Key Finding

Presents a conceptual framework using LLM-enabled intelligent agents as a unified defense layer across all 10 OWASP LLM security vulnerability classes.


Large Language Models (LLMs) have emerged as a transformative and disruptive technology, enabling a wide range of applications in natural language processing, machine translation, and beyond. However, this widespread integration of LLMs also raised several security concerns highlighted by the Open Web Application Security Project (OWASP), which has identified the top 10 security vulnerabilities inherent in LLM applications. Addressing these vulnerabilities is crucial, given the increasing reliance on LLMs and the potential threats to data integrity, confidentiality, and service availability. This paper presents a framework designed to mitigate the security risks outlined in the OWASP Top 10. Our proposed model leverages LLM-enabled intelligent agents, offering a new approach to proactively identify, assess, and counteract security threats in real-time. The proposed framework serves as an initial blueprint for future research and development, aiming to enhance the security measures of LLMs and protect against emerging threats in this rapidly evolving landscape.


Key Contributions

  • Proposes an LLM-agent-based framework for proactively identifying, assessing, and counteracting the full OWASP LLM Top 10 security vulnerabilities in real time.
  • Introduces intelligent agents as a unified mitigation layer across diverse LLM threat categories (prompt injection, data leakage, denial of service, etc.).
  • Provides an initial research blueprint intended to guide future development of comprehensive LLM application security systems.

🛡️ Threat Analysis


Details

Domains
nlp
Model Types
llm
Threat Tags
inference_time
Applications
llm applicationsconversational ainatural language processing systems