Xiaoran Xu

Papers in Database (1)

survey arXiv Aug 7, 2025 · Aug 2025

Guardians and Offenders: A Survey on Harmful Content Generation and Safety Mitigation of LLM

Chi Zhang, Changjia Zhu, Junjie Xiong et al. · University of South Florida · Missouri University of Science and Technology

Surveys LLM jailbreaking attacks, unintentional toxicity, multimodal exploits, and safety mitigations including RLHF and alignment

Prompt Injection nlpmultimodal
PDF