Ee-Chien Chang

defense arXiv Mar 18, 2026 · 9w ago

Proof-of-Authorship for Diffusion-based AI Generated Content

De Zhang Lee, Han Fang, Ee-Chien Chang · National University of Singapore

Cryptographic proof-of-authorship for diffusion-generated images by binding generation seeds to author identity using pseudorandom functions

Output Integrity Attack visiongenerative

PDF

defense arXiv Sep 15, 2025 · Sep 2025

Removal Attack and Defense on AI-generated Content Latent-based Watermarking

De Zhang Lee, Han Fang, Hanyi Wang et al. · National University of Singapore · Shanghai Jiao Tong University

Attacks latent-based AIGC watermarks via boundary leakage, cutting removal distortion 15×; defends with secret boundary transformation provably equal to white-noise.

Output Integrity Attack visiongenerative

PDF

attack arXiv Mar 7, 2026 · 10w ago

Targeted Bit-Flip Attacks on LLM-Based Agents

Jialai Wang, Ya Wen, Zhongmou Liu et al. · National University of Singapore · Tsinghua University +1 more

Flip-Agent exploits hardware bit-flips to corrupt LLM agent weights, hijacking tool calls and final outputs in multi-stage pipelines

Model Poisoning Excessive Agency nlp

PDF

defense arXiv Apr 28, 2026 · 23d ago

SnapGuard: Lightweight Prompt Injection Detection for Screenshot-Based Web Agents

Mengyao Du, Han Fang, Haokai Ma et al. · National University of Defense Technology · University of Science and Technology of China +2 more

Lightweight detector that identifies prompt injection attacks in web agent screenshots using visual gradient analysis and text recovery

Prompt Injection Excessive Agency multimodalnlp

PDF

Papers in Database (4)

Proof-of-Authorship for Diffusion-based AI Generated Content

Removal Attack and Defense on AI-generated Content Latent-based Watermarking

Targeted Bit-Flip Attacks on LLM-Based Agents

SnapGuard: Lightweight Prompt Injection Detection for Screenshot-Based Web Agents