Latest papers

8 papers
defense arXiv Mar 13, 2026 · 24d ago

STRAP-ViT: Segregated Tokens with Randomized -- Transformations for Defense against Adversarial Patches in ViTs

Nandish Chattopadhyay, Anadi Goyal, Chandan Karfa et al. · Indian Institute of Technology · Nanyang Technological University

Detects and neutralizes adversarial patches on ViTs by identifying anomalous tokens and applying randomized transformations

Input Manipulation Attack vision
PDF
defense arXiv Dec 19, 2025 · Dec 2025

Verifiability-First Agents: Provable Observability and Lightweight Audit Agents for Controlling Autonomous LLM Systems

Abhivansh Gupta · Indian Institute of Technology

Proposes cryptographic attestation architecture and benchmark to detect and remediate misaligned autonomous LLM agents

Excessive Agency Prompt Injection nlp
PDF
defense arXiv Dec 11, 2025 · Dec 2025

D2M: A Decentralized, Privacy-Preserving, Incentive-Compatible Data Marketplace for Collaborative Learning

Yash Srivastava, Shalin Jain, Sneha Awathare et al. · Indian Institute of Technology · Eastern University Pennsylvania

Defends federated learning against Byzantine participants via Corrected OSMD aggregation in a blockchain-arbitrated decentralized data marketplace

Data Poisoning Attack federated-learning
PDF
tool arXiv Nov 27, 2025 · Nov 2025

INSIGHT: An Interpretable Neural Vision-Language Framework for Reasoning of Generative Artifacts

Anshul Bagaria · Indian Institute of Technology

Proposes interpretable multimodal deepfake detector combining super-resolution and VLM reasoning, robust to extreme image degradation

Output Integrity Attack visionmultimodal
PDF
defense arXiv Oct 31, 2025 · Oct 2025

C-LEAD: Contrastive Learning for Enhanced Adversarial Defense

Suklav Ghosh, Sonal Kumar, Arijit Sur · Indian Institute of Technology

Defends DNNs against adversarial examples by incorporating contrastive loss into adversarial training to learn robust representations

Input Manipulation Attack vision
1 citations PDF
defense arXiv Sep 25, 2025 · Sep 2025

DAC-LoRA: Dynamic Adversarial Curriculum for Efficient and Robust Few-Shot Adaptation

Ved Umrajkar · Indian Institute of Technology

Defends CLIP-based VLMs against adversarial attacks by embedding a dynamic curriculum adversarial training into LoRA fine-tuning

Input Manipulation Attack visionmultimodal
PDF
attack arXiv Aug 3, 2025 · Aug 2025

"Energon": Unveiling Transformers from GPU Power and Thermal Side-Channels

Arunava Chaudhuri, Shubhi Shukla, Sarani Bhattacharya et al. · Indian Institute of Technology

GPU power and thermal side-channels leak transformer architecture details, enabling model theft and 93%+ black-box adversarial transfer attacks

Model Theft Input Manipulation Attack nlpvision
PDF
survey arXiv Jan 2, 2025 · Jan 2025

State-of-the-art AI-based Learning Approaches for Deepfake Generation and Detection, Analyzing Opportunities, Threading through Pros, Cons, and Future Prospects

Harshika Goyal, Mohammad Saif Wajid, Mohd Anas Wajid et al. · Indian Institute of Technology · Tecnológico de Monterrey +6 more

Surveys ~400 papers on deepfake generation (GANs, VAEs, Transformers) and detection, benchmarking datasets and future challenges

Output Integrity Attack visiongenerative
5 citations PDF