Latest papers

4 papers
defense arXiv Mar 18, 2026 · 19d ago

rSDNet: Unified Robust Neural Learning against Label Noise and Adversarial Attacks

Suryasis Jana, Abhik Ghosh · Indian Statistical Institute

Unified robust neural training framework defending against both label noise and adversarial attacks via minimum-divergence estimation

Input Manipulation Attack Data Poisoning Attack vision
PDF
attack arXiv Jan 28, 2026 · 9w ago

One Word is Enough: Minimal Adversarial Perturbations for Neural Text Ranking

Tanmay Karmakar, Sourav Saha, Debapriyo Majumdar et al. · Indian Statistical Institute

Single-word adversarial insertions achieve 91% success promoting target documents in BERT/monoT5 neural re-rankers

Input Manipulation Attack nlp
PDF
defense arXiv Jan 7, 2026 · 12w ago

ARREST: Adversarial Resilient Regulation Enhancing Safety and Truth in Large Language Models

Sharanya Dasgupta, Arkaprabha Basu, Sujoy Nath et al. · Indian Statistical Institute · University of Surrey +1 more

Defends LLMs against jailbreaks and hallucinations by steering hidden states via GAN-trained intervention without fine-tuning

Prompt Injection nlp
PDF Code
attack arXiv Sep 25, 2025 · Sep 2025

Cryptographic Backdoor for Neural Networks: Boon and Bane

Anh Tu Ngo, Anupam Chattopadhyay, Subhamoy Maitra · Nanyang Technological University · Indian Statistical Institute

Cryptographic backdoors enable undetectable NN attacks and, repurposed defensively, provably robust watermarking and IP tracking

Model Poisoning Model Theft vision
PDF Code