Monthly publications
Paper types
attack 28
defense 16
survey 3
benchmark 3
tool 1
Domains
nlp 51
reinforcement-learning 4
federated-learning 4
vision 4
generative 2
graph 1
multimodal 1
Co-occurring categories
Other OWASP categories that appear on the same papers
Top cited papers
13221035425262718190100
Poisoning Attacks on LLMs Require a Near-constant Number of Poison Samples
2025 attack
Weird Generalization and Inductive Backdoors: New Ways to Corrupt LLMs
2025 attack
Adaptive Defense against Harmful Fine-Tuning for Large Language Models via Bayesian Data Scheduler
2025 defense
AutoBackdoor: Automating Backdoor Attacks via LLM Agents
2025 attack
Subliminal Corruption: Mechanisms, Thresholds, and Interpretability
2025 benchmark
Virus Infection Attack on LLMs: Your Poisoning Can Spread "VIA" Synthetic Data
2025 attack
Cost-Minimized Label-Flipping Poisoning Attack to LLM Alignment
2025 attack
Data Poisoning Vulnerabilities Across Healthcare AI Architectures: A Security Threat Analysis
2025 survey
RAG-targeted Adversarial Attack on LLM-based Threat Detection and Mitigation Framework
2025 attack
Thought-Transfer: Indirect Targeted Poisoning Attacks on Chain-of-Thought Reasoning Models
2026 attack