Monthly publications
Paper types
benchmark 24
attack 13
tool 10
defense 4
survey 2
Domains
nlp 53
multimodal 5
vision 2
audio 1
reinforcement-learning 1
Co-occurring categories
Other OWASP categories that appear on the same papers
Top cited papers
142232415161718191101
OpenRT: An Open-Source Red Teaming Framework for Multimodal LLMs
2026 tool
Anecdoctoring: Automated Red-Teaming Across Language and Place
2025 attack
Guarding the Guardrails: A Taxonomy-Driven Approach to Jailbreak Detection
2025 benchmark
Red-Teaming Claude Opus and ChatGPT-based Security Advisors for Trusted Execution Environments
2026 benchmark
Learning-Based Automated Adversarial Red-Teaming for Robustness Evaluation of Large Language Models
2025 tool
Async Control: Stress-testing Asynchronous Control Measures for LLM Agents
2025 defense
StealthGraph: Exposing Domain-Specific Risks in LLMs through Knowledge-Graph-Guided Harmful Prompt Generation
2026 attack
Comparison requires valid measurement: Rethinking attack success rate comparisons in AI red teaming
2026 benchmark
SoK: Understanding (New) Security Issues Across AI4Code Use Cases
2025 survey
RedTWIZ: Diverse LLM Red Teaming via Adaptive Attack Planning
2025 tool