Shagufta Mehnaz

h-index: 14 722 citations 34 papers (total)

Papers in Database (2)

attack EMNLP Nov 5, 2025 · Nov 2025

From Insight to Exploit: Leveraging LLM Collaboration for Adaptive Adversarial Text Generation

Najrin Sultana, Md Rafi Ur Rashid, Kang Gu et al. · The Pennsylvania State University · Dartmouth College

LLM-driven adversarial text generation that fools LLM classifiers via semantic-preserving perturbations without gradient access

Prompt Injection nlp
PDF Code
attack arXiv Feb 8, 2026 · 8w ago

Robustness of Vision Language Models Against Split-Image Harmful Input Attacks

Md Rafi Ur Rashid, MD Sadik Hossain Shanto, Vishnu Asutosh Dasu et al. · Pennsylvania State University · Bangladesh University of Engineering and Technology

Exploits VLM safety alignment gaps using split-image inputs to jailbreak modern VLMs with 60% better transfer than baselines

Input Manipulation Attack Prompt Injection visionmultimodalnlp
PDF