Kongcheng Zhang

Papers in Database (1)

attack arXiv Sep 18, 2025 · Sep 2025

MUSE: MCTS-Driven Red Teaming Framework for Enhanced Multi-Turn Dialogue Safety in Large Language Models

Siyu Yan, Long Zeng, Xuecheng Wu et al. · East China Normal University · Xi’an Jiaotong University +2 more

Attacks multi-turn LLM safety via MCTS-guided frame semantic trajectories; defends with early-intervention dialogue alignment

Prompt Injection nlp
PDF Code