defense 2026

SIDeR: Semantic Identity Decoupling for Unrestricted Face Privacy

0 citations · 63 references · arXiv (Cornell University)

Published on arXiv

2602.04994

Input Manipulation Attack

OWASP ML Top 10 — ML01

Key Finding

Achieves 99% black-box attack success rate against face recognition systems and surpasses baseline restoration quality by 41.28% in PSNR on CelebA-HQ and FFHQ.

SIDeR

Novel technique introduced

With the deep integration of facial recognition into online banking, identity verification, and other networked services, achieving effective decoupling of identity information from visual representations during image storage and transmission has become a critical challenge for privacy protection. To address this issue, we propose SIDeR, a Semantic decoupling-driven framework for unrestricted face privacy protection. SIDeR decomposes a facial image into a machine-recognizable identity feature vector and a visually perceptible semantic appearance component. By leveraging semantic-guided recomposition in the latent space of a diffusion model, it generates visually anonymous adversarial faces while maintaining machine-level identity consistency. The framework incorporates momentum-driven unrestricted perturbation optimization and a semantic-visual balancing factor to synthesize multiple visually diverse, highly natural adversarial samples. Furthermore, for authorized access, the protected image can be restored to its original form when the correct password is provided. Extensive experiments on the CelebA-HQ and FFHQ datasets demonstrate that SIDeR achieves a 99% attack success rate in black-box scenarios and outperforms baseline methods by 41.28% in PSNR-based restoration quality.

Key Contributions

SIDeR framework that decomposes facial images into identity feature vectors and semantic appearance components, then recomposes them in diffusion model latent space to generate visually anonymous adversarial faces
Momentum-driven unrestricted perturbation optimization with a semantic-visual balancing factor for diverse, natural-looking adversarial samples
Password-controlled reversible restoration that reconstructs the original face with high fidelity (41.28% PSNR improvement over baselines) for authorized access

🛡️ Threat Analysis

Input Manipulation Attack

SIDeR crafts adversarial facial images that cause face recognition models to fail identification at inference time — this is a classic evasion/input manipulation attack. The momentum-driven unrestricted perturbation optimization and diffusion-based semantic recomposition are adversarial example generation techniques specifically designed to defeat ML-based face recognition systems in black-box settings.

Details

Domains

visiongenerative

Model Types

diffusioncnn

Threat Tags

black_boxinference_timeuntargeteddigital

Datasets

CelebA-HQFFHQ

Applications

facial recognitionidentity verificationface privacy protection

Read PDF arXiv DOI

SIDeR: Semantic Identity Decoupling for Unrestricted Face Privacy

Key Contributions

🛡️ Threat Analysis

Details

Similar Papers

Machine Pareidolia: Protecting Facial Image with Emotional Editing

TAIGen: Training-Free Adversarial Image Generation via Diffusion Models

MANI-Pure: Magnitude-Adaptive Noise Injection for Adversarial Purification

Off-The-Shelf Image-to-Image Models Are All You Need To Defeat Image Protection Schemes

Architecture-Agnostic Feature Synergy for Universal Defense Against Heterogeneous Generative Threats

Deep learning models are vulnerable, but adversarial examples are even more vulnerable

AdLift: Lifting Adversarial Perturbations to Safeguard 3D Gaussian Splatting Assets Against Instruction-Driven Editing

SC-Pro: Training-Free Framework for Defending Unsafe Image Synthesis Attack