Jiawei Chen

h-index: 4 77 citations 17 papers (total)

Papers in Database (3)

defense arXiv Nov 9, 2025 · Nov 2025

KG-DF: A Black-box Defense Framework against Jailbreak Attacks Based on Knowledge Graphs

Shuyuan Liu, Jiawei Chen, Xiao Yang et al. · East China Normal University · Zhongguancun Academy +1 more

Knowledge graph-based black-box defense that detects jailbreak intent via semantic parsing without accessing LLM internals

Prompt Injection nlp
PDF
defense arXiv Oct 13, 2025 · Oct 2025

Large Language Models Are Effective Code Watermarkers

Rui Xu, Jiawei Chen, Zhaoxia Yin et al. · East China Normal University · Fudan University

Embeds robust provenance watermarks in source code using LLM-driven semantic-preserving transformations, resisting obfuscation attacks

Output Integrity Attack nlp
PDF
defense arXiv Dec 20, 2025 · Dec 2025

Who Can See Through You? Adversarial Shielding Against VLM-Based Attribute Inference Attacks

Yucheng Fan, Jiawei Chen, Yu Tian et al. · East China Normal University · Zhongguancun Academy +1 more

Adversarial image perturbations shield social-media photos from VLM-based private attribute inference while preserving visual quality

Input Manipulation Attack visionmultimodal
PDF