Yuyou Gan

Papers in Database (1)

tool arXiv Sep 4, 2025 · Sep 2025

NeuroBreak: Unveil Internal Jailbreak Mechanisms in Large Language Models

Chuhan Zhang, Ye Zhang, Bowen Shi et al. · Zhejiang University

Builds neuron-level analysis tool to dissect LLM jailbreak mechanisms via layer-wise probing and critical neuron identification

Prompt Injection nlp
PDF