Jonathan Rusert

h-index: 4 47 citations 14 papers (total)

Papers in Database (2)

attack EMNLP Sep 25, 2025 · Sep 2025

RedHerring Attack: Testing the Reliability of Attack Detection

Jonathan Rusert · Purdue University

Adversarial text attack that fools attack detectors into false positives while keeping the underlying NLP classifier correct, undermining human trust in detectors

Input Manipulation Attack nlp
PDF
attack EMNLP Sep 25, 2025 · Sep 2025

Overcoming Black-box Attack Inefficiency with Hybrid and Dynamic Select Algorithms

Abhinay Shankar Belde, Rohit Ramkumar, Jonathan Rusert · Purdue University

Proposes Hybrid/Dynamic Select strategies reducing black-box adversarial text attack queries by 25% across NLP classifiers and LLMs

Input Manipulation Attack Prompt Injection nlp
PDF