ML Security Papers

Latest papers

4 papers

defense arXiv Mar 18, 2026 · 19d ago

Suryasis Jana, Abhik Ghosh · Indian Statistical Institute

Unified robust neural training framework defending against both label noise and adversarial attacks via minimum-divergence estimation

Input Manipulation Attack Data Poisoning Attack vision

attack arXiv Jan 28, 2026 · 9w ago

Tanmay Karmakar, Sourav Saha, Debapriyo Majumdar et al. · Indian Statistical Institute

Single-word adversarial insertions achieve 91% success promoting target documents in BERT/monoT5 neural re-rankers

Input Manipulation Attack nlp

defense arXiv Jan 7, 2026 · 12w ago

Sharanya Dasgupta, Arkaprabha Basu, Sujoy Nath et al. · Indian Statistical Institute · University of Surrey +1 more

Defends LLMs against jailbreaks and hallucinations by steering hidden states via GAN-trained intervention without fine-tuning

Prompt Injection nlp

attack arXiv Sep 25, 2025 · Sep 2025

Anh Tu Ngo, Anupam Chattopadhyay, Subhamoy Maitra · Nanyang Technological University · Indian Statistical Institute

Cryptographic backdoors enable undetectable NN attacks and, repurposed defensively, provably robust watermarking and IP tracking

Model Poisoning Model Theft vision