Shagufta Mehnaz

attack EMNLP Nov 5, 2025 · Nov 2025

From Insight to Exploit: Leveraging LLM Collaboration for Adaptive Adversarial Text Generation

Najrin Sultana, Md Rafi Ur Rashid, Kang Gu et al. · The Pennsylvania State University · Dartmouth College

LLM-driven adversarial text generation that fools LLM classifiers via semantic-preserving perturbations without gradient access

Prompt Injection nlp

PDF Code

attack arXiv Feb 8, 2026 · 8w ago

Robustness of Vision Language Models Against Split-Image Harmful Input Attacks

Md Rafi Ur Rashid, MD Sadik Hossain Shanto, Vishnu Asutosh Dasu et al. · Pennsylvania State University · Bangladesh University of Engineering and Technology

Exploits VLM safety alignment gaps using split-image inputs to jailbreak modern VLMs with 60% better transfer than baselines

Input Manipulation Attack Prompt Injection visionmultimodalnlp

PDF

Papers in Database (2)

From Insight to Exploit: Leveraging LLM Collaboration for Adaptive Adversarial Text Generation

Robustness of Vision Language Models Against Split-Image Harmful Input Attacks