Anti-Tamper Protection for Unauthorized Individual Image Generation

With the advancement of personalized image generation technologies, concerns about forgery attacks that infringe on portrait rights and privacy are growing. To address these concerns, protection perturbation algorithms have been developed to disrupt forgery generation. However, the protection algorithms would become ineffective when forgery attackers apply purification techniques to bypass the protection. To address this issue, we present a novel approach, Anti-Tamper Perturbation (ATP). ATP introduces a tamper-proof mechanism within the perturbation. It consists of protection and authorization perturbations, where the protection perturbation defends against forgery attacks, while the authorization perturbation detects purification-based tampering. Both protection and authorization perturbations are applied in the frequency domain under the guidance of a mask, ensuring that the protection perturbation does not disrupt the authorization perturbation. This design also enables the authorization perturbation to be distributed across all image pixels, preserving its sensitivity to purification-based tampering. ATP demonstrates its effectiveness in defending forgery attacks across various attack settings through extensive experiments, providing a robust solution for protecting individuals' portrait rights and privacy. Our code is available at: https://github.com/Seeyn/Anti-Tamper-Perturbation .

Key Contributions

Anti-Tamper Perturbation (ATP) framework combining a protection perturbation (disrupts personalized model training) with an authorization perturbation (detects purification-based tampering)
Frequency-domain application guided by a mask that prevents the protection perturbation from interfering with the authorization signal while distributing it across all pixels for tamper sensitivity
Demonstrated robustness against purification-based bypass attacks across multiple attack settings for portrait-rights protection

🛡️ Threat Analysis

Output Integrity Attack

The paper defends against unauthorized personalized image generation (AI-generated forgery/deepfakes) by adding protective perturbations to images, and specifically addresses purification attacks that remove those protective perturbations — per the classification rules, removing/defeating anti-deepfake perturbations is an ML09 attack on content integrity, and this paper proposes a tamper-proof defense against such removal.

Details

Domains

visiongenerative

Model Types

diffusion

Threat Tags

training_timedigitalwhite_box

Applications

2025 0 cit.

Output Integrity Attack

92%

Anti-Tamper Protection for Unauthorized Individual Image Generation

Key Contributions

🛡️ Threat Analysis

Details

Similar Papers

Towards Robust Defense against Customization via Protective Perturbation Resistant to Diffusion-based Purification

A Low-Rank Defense Method for Adversarial Attack on Diffusion Models

Universal Adversarial Purification with DDIM Metric Loss for Stable Diffusion

Targeted Data Protection for Diffusion Model by Matching Training Trajectory

DLADiff: A Dual-Layer Defense Framework against Fine-Tuning and Zero-Shot Customization of Diffusion Models

StyleProtect: Safeguarding Artistic Identity in Fine-tuned Diffusion Models

RDSplat: Robust Watermarking Against Diffusion Editing for 3D Gaussian Splatting

SuMa: A Subspace Mapping Approach for Robust and Effective Concept Erasure in Text-to-Image Diffusion Models