Jinyuan Jia

Papers in Database (3)

defense arXiv Jan 7, 2025 · Jan 2025

TrojanDec: Data-free Detection of Trojan Inputs in Self-supervised Learning

Yupei Liu, Yanting Wang, Jinyuan Jia · The Pennsylvania State University

Data-free defense that detects and removes trojan triggers from test inputs in self-supervised learning encoders

Model Poisoning vision
PDF
defense arXiv Apr 1, 2026 · 5d ago

AgentWatcher: A Rule-based Prompt Injection Monitor

Yanting Wang, Wei Zou, Runpeng Geng et al. · The Pennsylvania State University

Rule-based prompt injection detector using causal attribution to identify malicious context segments in long-context LLM agents

Prompt Injection Excessive Agency nlp
PDF Code
attack arXiv Mar 13, 2026 · 24d ago

PISmith: Reinforcement Learning-based Red Teaming for Prompt Injection Defenses

Chenlong Yin, Runpeng Geng, Yanting Wang et al. · The Pennsylvania State University

RL-based adaptive prompt injection attack that systematically breaks state-of-the-art LLM defenses using entropy regularization and advantage weighting

Prompt Injection nlp
PDF Code