Latest papers

2 papers
tool arXiv Mar 18, 2026 · 19d ago

LAAF: Logic-layer Automated Attack Framework A Systematic Red-Teaming Methodology for LPCI Vulnerabilities in Agentic Large Language Model Systems

Hammad Atta, Ken Huang, Kyriakos Rock Lambros et al. · Qorvex Consulting · Distributedapps.ai +8 more

Automated red-teaming framework for multi-stage prompt injection attacks on agentic LLMs with persistent memory and RAG

Prompt Injection Excessive Agency nlp
PDF
survey arXiv Feb 6, 2026 · 8w ago

Trojans in Artificial Intelligence (TrojAI) Final Report

Kristopher W. Reese, Taylor Kulp-McDowall, Michael Majurski et al. · IARPA · NIST +13 more

Surveys IARPA TrojAI program findings on AI backdoor detection via weight analysis and trigger inversion across multi-year research

Model Poisoning visionnlp
PDF