Renyang Liu

Papers in Database (1)

attack arXiv Mar 17, 2026 · 22d ago

REFORGE: Multi-modal Attacks Reveal Vulnerable Concept Unlearning in Image Generation Models

Yong Zou, Haoran Li, Fanxiao Li et al. · Yunnan University · Northeastern University +1 more

Black-box adversarial image prompt attack that bypasses concept unlearning in diffusion models, recovering erased copyrighted and harmful concepts

Input Manipulation Attack visionmultimodalgenerative
PDF Code