Monthly publications
Paper types
attack 24
defense 13
survey 3
benchmark 2
tool 1
Domains
nlp 43
vision 4
reinforcement-learning 4
federated-learning 4
multimodal 1
graph 1
generative 1
Co-occurring categories
Other OWASP categories that appear on the same papers
Top cited papers
13221035425262718190100
Poisoning Attacks on LLMs Require a Near-constant Number of Poison Samples
2025 attack
Weird Generalization and Inductive Backdoors: New Ways to Corrupt LLMs
2025 attack
Adaptive Defense against Harmful Fine-Tuning for Large Language Models via Bayesian Data Scheduler
2025 defense
Virus Infection Attack on LLMs: Your Poisoning Can Spread "VIA" Synthetic Data
2025 attack
Subliminal Corruption: Mechanisms, Thresholds, and Interpretability
2025 benchmark
AutoBackdoor: Automating Backdoor Attacks via LLM Agents
2025 attack
Data Poisoning Vulnerabilities Across Healthcare AI Architectures: A Security Threat Analysis
2025 survey
Cost-Minimized Label-Flipping Poisoning Attack to LLM Alignment
2025 attack
Incentive-Aware AI Safety via Strategic Resource Allocation: A Stackelberg Security Games Perspective
2026 defense
Understanding and Mitigating Dataset Corruption in LLM Steering
2026 defense