Latest papers

3 papers
survey arXiv Feb 6, 2026 · 8w ago

Trojans in Artificial Intelligence (TrojAI) Final Report

Kristopher W. Reese, Taylor Kulp-McDowall, Michael Majurski et al. · IARPA · NIST +13 more

Surveys IARPA TrojAI program findings on AI backdoor detection via weight analysis and trigger inversion across multi-year research

Model Poisoning visionnlp
PDF
attack arXiv Nov 27, 2025 · Nov 2025

PROMPTMINER: Black-Box Prompt Stealing against Text-to-Image Generative Models via Reinforcement Learning and Fuzz Optimization

Mingzhe Li, Renhao Zhang, Zhiyang Wen et al. · University of Massachusetts · Dolby Laboratories

Black-box RL+fuzzing attack that recovers valuable text prompts from T2I model outputs, enabling unauthorized prompt IP theft

Model Theft visionnlpgenerative
PDF Code
defense BigData Congress Nov 3, 2025 · Nov 2025

RobustFSM: Submodular Maximization in Federated Setting with Malicious Clients

Duc A. Tran, Dung Truong, Duy Le · University of Massachusetts

Defends federated submodular maximization against malicious clients sharing fake local information, improving solution quality by up to 200%

Data Poisoning Attack federated-learning
PDF