Mitigating the OWASP Top 10 For Large Language Models Applications using Intelligent Agents
Mohammad Fasha , Faisal Abul Rub , Nasim Matar , Bilal Sowan , Mohammad Al Khaldy
Published on arXiv
2601.18105
Prompt Injection
OWASP LLM Top 10 — LLM01
Sensitive Information Disclosure
OWASP LLM Top 10 — LLM06
Excessive Agency
OWASP LLM Top 10 — LLM08
Key Finding
Presents a conceptual framework using LLM-enabled intelligent agents as a unified defense layer across all 10 OWASP LLM security vulnerability classes.
Large Language Models (LLMs) have emerged as a transformative and disruptive technology, enabling a wide range of applications in natural language processing, machine translation, and beyond. However, this widespread integration of LLMs also raised several security concerns highlighted by the Open Web Application Security Project (OWASP), which has identified the top 10 security vulnerabilities inherent in LLM applications. Addressing these vulnerabilities is crucial, given the increasing reliance on LLMs and the potential threats to data integrity, confidentiality, and service availability. This paper presents a framework designed to mitigate the security risks outlined in the OWASP Top 10. Our proposed model leverages LLM-enabled intelligent agents, offering a new approach to proactively identify, assess, and counteract security threats in real-time. The proposed framework serves as an initial blueprint for future research and development, aiming to enhance the security measures of LLMs and protect against emerging threats in this rapidly evolving landscape.
Key Contributions
- Proposes an LLM-agent-based framework for proactively identifying, assessing, and counteracting the full OWASP LLM Top 10 security vulnerabilities in real time.
- Introduces intelligent agents as a unified mitigation layer across diverse LLM threat categories (prompt injection, data leakage, denial of service, etc.).
- Provides an initial research blueprint intended to guide future development of comprehensive LLM application security systems.