Latest papers

2 papers
defense arXiv Mar 12, 2026 · 25d ago

OrthoEraser: Coupled-Neuron Orthogonal Projection for Concept Erasure

Chuancheng Shi, Wenhua Wu, Fei Shen et al. · University of Sydney · National University of Singapore +2 more

Defends T2I diffusion models from adversarial induction of harmful content via orthogonal projection that preserves benign semantic subspaces during concept erasure

Prompt Injection visiongenerative
PDF
defense arXiv Mar 9, 2026 · 28d ago

Client-Cooperative Split Learning

Haiyu Deng, Yanna Jiang, Guangsheng Yu et al. · University of Technology Sydney · CSIRO Data61 +1 more

Defends split learning against activation inversion, label clustering, and model extraction via DP and chained watermarking

Model Inversion Attack Model Theft federated-learningvision
PDF