Fan Yang

attack arXiv Dec 5, 2025 · Dec 2025

Fan Yang · Jinan University

Jailbreaks LLMs by rewriting harmful prompts into safe-isomorphic ones, generating responses, then reverse-mapping to harmful outputs

Prompt Injection nlp

Papers in Database (1)