Kangjie Chen

h-index: 9 848 citations 25 papers (total)

Papers in Database (3)

attack arXiv Oct 9, 2025 · Oct 2025

When Search Goes Wrong: Red-Teaming Web-Augmented Large Language Models

Haoran Ou, Kangjie Chen, Xingshuo Han et al. · Nanyang Technological University · Nanjing University of Aeronautics and Astronautics +2 more

Red-teams web-augmented LLMs with benign-looking search queries that bypass safety filters and force harmful content citations

Prompt Injection nlp
1 citations PDF
defense arXiv Jan 13, 2026 · 11w ago

SafeRedir: Prompt Embedding Redirection for Robust Unlearning in Image Generation Models

Renyang Liu, Kangjie Chen, Han Qiu et al. · National University of Singapore · Nanyang Technological University +2 more

Inference-time prompt-embedding redirector blocks NSFW and copyright generation in diffusion models while resisting adversarial bypass attacks

Input Manipulation Attack visiongenerative
1 citations PDF Code
attack arXiv Jan 31, 2026 · 9w ago

DECEIVE-AFC: Adversarial Claim Attacks against Search-Enabled LLM-based Fact-Checking Systems

Haoran Ou, Kangjie Chen, Gelei Deng et al. · Nanyang Technological University · A*STAR

Agent-based adversarial claim attacks on search-augmented LLM fact-checkers disrupt retrieval and reasoning, dropping accuracy from 78.7% to 53.7%

Prompt Injection nlp
PDF