Ningning Ding

Papers in Database (1)

defense arXiv Apr 17, 2026 · 4w ago

CiPO: Counterfactual Unlearning for Large Reasoning Models through Iterative Preference Optimization

Junyi Li, Yongqiang Chen, Ningning Ding · The Hong Kong University of Science and Technology · The Chinese University of Hong Kong

Unlearns knowledge from reasoning model CoT traces via iterative preference optimization, evaluated against membership inference attacks

Membership Inference Attack nlp
PDF Code