Hyesoo Hong

h-index: 2 5 citations 3 papers (total)

Papers in Database (1)

defense arXiv Sep 27, 2025 · Sep 2025

A2D: Any-Order, Any-Step Safety Alignment for Diffusion Language Models

Wonje Jeung, Sangyeon Yoon, Yoonjun Cho et al. · Yonsei University

Token-level safety alignment for diffusion LLMs that blocks any-order jailbreaks and prefilling attacks, cutting DIJA success from 80% to near-zero

Prompt Injection nlpgenerative
2 citations PDF