ML Security Papers

Latest papers

8 papers

defense arXiv Mar 13, 2026 · 24d ago

STRAP-ViT: Segregated Tokens with Randomized -- Transformations for Defense against Adversarial Patches in ViTs

Nandish Chattopadhyay, Anadi Goyal, Chandan Karfa et al. · Indian Institute of Technology · Nanyang Technological University

Detects and neutralizes adversarial patches on ViTs by identifying anomalous tokens and applying randomized transformations

Input Manipulation Attack vision

PDF

defense arXiv Dec 19, 2025 · Dec 2025

Verifiability-First Agents: Provable Observability and Lightweight Audit Agents for Controlling Autonomous LLM Systems

Abhivansh Gupta · Indian Institute of Technology

Proposes cryptographic attestation architecture and benchmark to detect and remediate misaligned autonomous LLM agents

Excessive Agency Prompt Injection nlp

PDF

defense arXiv Dec 11, 2025 · Dec 2025

D2M: A Decentralized, Privacy-Preserving, Incentive-Compatible Data Marketplace for Collaborative Learning

Yash Srivastava, Shalin Jain, Sneha Awathare et al. · Indian Institute of Technology · Eastern University Pennsylvania

Defends federated learning against Byzantine participants via Corrected OSMD aggregation in a blockchain-arbitrated decentralized data marketplace

Data Poisoning Attack federated-learning

PDF

tool arXiv Nov 27, 2025 · Nov 2025

INSIGHT: An Interpretable Neural Vision-Language Framework for Reasoning of Generative Artifacts

Anshul Bagaria · Indian Institute of Technology

Proposes interpretable multimodal deepfake detector combining super-resolution and VLM reasoning, robust to extreme image degradation

Output Integrity Attack visionmultimodal

PDF

defense arXiv Oct 31, 2025 · Oct 2025

C-LEAD: Contrastive Learning for Enhanced Adversarial Defense

Suklav Ghosh, Sonal Kumar, Arijit Sur · Indian Institute of Technology

Defends DNNs against adversarial examples by incorporating contrastive loss into adversarial training to learn robust representations

Input Manipulation Attack vision

1 citations PDF

defense arXiv Sep 25, 2025 · Sep 2025

DAC-LoRA: Dynamic Adversarial Curriculum for Efficient and Robust Few-Shot Adaptation

Ved Umrajkar · Indian Institute of Technology

Defends CLIP-based VLMs against adversarial attacks by embedding a dynamic curriculum adversarial training into LoRA fine-tuning

Input Manipulation Attack visionmultimodal

PDF

attack arXiv Aug 3, 2025 · Aug 2025

"Energon": Unveiling Transformers from GPU Power and Thermal Side-Channels

Arunava Chaudhuri, Shubhi Shukla, Sarani Bhattacharya et al. · Indian Institute of Technology

GPU power and thermal side-channels leak transformer architecture details, enabling model theft and 93%+ black-box adversarial transfer attacks

Model Theft Input Manipulation Attack nlpvision

PDF

survey arXiv Jan 2, 2025 · Jan 2025

State-of-the-art AI-based Learning Approaches for Deepfake Generation and Detection, Analyzing Opportunities, Threading through Pros, Cons, and Future Prospects

Harshika Goyal, Mohammad Saif Wajid, Mohd Anas Wajid et al. · Indian Institute of Technology · Tecnológico de Monterrey +6 more

Surveys ~400 papers on deepfake generation (GANs, VAEs, Transformers) and detection, benchmarking datasets and future challenges

Output Integrity Attack visiongenerative

5 citations PDF

The rapid advancement of deepfake technologies, specifically designed to create incredibly lifelike facial imagery and video content, has ignited a remarkable level of interest and curiosity across many fields, including forensic analysis, cybersecurity and the innovative creation of digital characters. By harnessing the latest breakthroughs in deep learning methods, such as Generative Adversarial Networks, Variational Autoencoders, Few-Shot Learning Strategies, and Transformers, the outcomes achieved in generating deepfakes have been nothing short of astounding and transformative. Also, the ongoing evolution of detection technologies is being developed to counteract the potential for misuse associated with deepfakes, effectively addressing critical concerns that range from political manipulation to the dissemination of fake news and the ever-growing issue of cyberbullying. This comprehensive review paper meticulously investigates the most recent developments in deepfake generation and detection, including around 400 publications, providing an in-depth analysis of the cutting-edge innovations shaping this rapidly evolving landscape. Starting with a thorough examination of systematic literature review methodologies, we embark on a journey that delves into the complex technical intricacies inherent in the various techniques used for deepfake generation, comprehensively addressing the challenges faced, potential solutions available, and the nuanced details surrounding manipulation formulations. Subsequently, the paper is dedicated to accurately benchmarking leading approaches against prominent datasets, offering thorough assessments of the contributions that have significantly impacted these vital domains. Ultimately, we engage in a thoughtful discussion of the existing challenges, paving the way for continuous advancements in this critical and ever-dynamic study area.

gan diffusion transformer rnn Indian Institute of Technology · Tecnológico de Monterrey · TEC de Monterrey +5 more

PDF arXiv DOI

Latest papers

STRAP-ViT: Segregated Tokens with Randomized -- Transformations for Defense against Adversarial Patches in ViTs

Verifiability-First Agents: Provable Observability and Lightweight Audit Agents for Controlling Autonomous LLM Systems

D2M: A Decentralized, Privacy-Preserving, Incentive-Compatible Data Marketplace for Collaborative Learning

INSIGHT: An Interpretable Neural Vision-Language Framework for Reasoning of Generative Artifacts

C-LEAD: Contrastive Learning for Enhanced Adversarial Defense

DAC-LoRA: Dynamic Adversarial Curriculum for Efficient and Robust Few-Shot Adaptation

"Energon": Unveiling Transformers from GPU Power and Thermal Side-Channels

State-of-the-art AI-based Learning Approaches for Deepfake Generation and Detection, Analyzing Opportunities, Threading through Pros, Cons, and Future Prospects

Filters

Time Period

Paper Type

OWASP ML Top 10

OWASP LLM Top 10

Institution

Venue