Dacheng Tao

Papers in Database (5)

defense arXiv Apr 21, 2026 · 4w ago

Distillation Traps and Guards: A Calibration Knob for LLM Distillability

Weixiao Zhan, Yongcheng Jing, Leszek Rutkowski et al. · Nanyang Technological University · Polish Academy of Sciences +2 more

Calibrates teacher LLMs to be either distillable for better knowledge transfer or undistillable for model IP protection

Model Theft Model Theft nlp
PDF
benchmark arXiv Aug 27, 2025 · Aug 2025

SoK: Large Language Model Copyright Auditing via Fingerprinting

Shuo Shao, Yiming Li, Yu He et al. · Zhejiang University · Nanyang Technological University +3 more

Surveys LLM fingerprinting for copyright auditing and benchmarks 13 post-development robustness techniques across 149 model instances

Model Theft Model Theft nlp
PDF Code
attack arXiv Mar 13, 2026 · 10w ago

CtrlAttack: A Unified Attack on World-Model Control in Diffusion Models

Shuhan Xu, Siyuan Liang, Hongling Zheng et al. · Wuhan University · Nanyang Technological University +1 more

Adversarial attack on diffusion I2V models that disrupts temporal consistency via low-dimensional velocity field perturbations

Input Manipulation Attack visiongenerative
PDF
defense arXiv Aug 4, 2025 · Aug 2025

Coward: Collision-based Watermark for Proactive Federated Backdoor Detection

Wenjie Li, Siying Gu, Yiming Li et al. · Tsinghua University · East China Normal University +1 more

Defends federated learning against backdoor attacks using multi-backdoor collision effects to create a server-injected detection watermark

Model Poisoning federated-learningvision
PDF Code
defense arXiv Sep 3, 2025 · Sep 2025

PromptCOS: Towards Content-only System Prompt Copyright Auditing for LLMs

Yuchen Yang, Yiming Li, Hongwei Yao et al. · Zhejiang University · Nanyang Technological University +2 more

Watermarks LLM system prompts with content-only verification to detect prompt theft without requiring access to model logits

Model Theft Sensitive Information Disclosure nlp
PDF Code