Self Voice Conversion as an Attack against Neural Audio Watermarking

Audio watermarking embeds auxiliary information into speech while maintaining speaker identity, linguistic content, and perceptual quality. Although recent advances in neural and digital signal processing-based watermarking methods have improved imperceptibility and embedding capacity, robustness is still primarily assessed against conventional distortions such as compression, additive noise, and resampling. However, the rise of deep learning-based attacks introduces novel and significant threats to watermark security. In this work, we investigate self voice conversion as a universal, content-preserving attack against audio watermarking systems. Self voice conversion remaps a speaker's voice to the same identity while altering acoustic characteristics through a voice conversion model. We demonstrate that this attack severely degrades the reliability of state-of-the-art watermarking approaches and highlight its implications for the security of modern audio watermarking techniques.

Key Contributions

Introduces self voice conversion (self VC) as a novel, universal, content-preserving attack against audio watermarking systems
Demonstrates that self VC severely degrades watermark detectability across state-of-the-art neural watermarking approaches (AudioSeal, TimbreWatermarking, WMCodec, WavMark, etc.)
Exposes a systematic overestimation of watermark robustness in current evaluations, which overlook deep learning-based adversarial transformations

🛡️ Threat Analysis

Output Integrity Attack

Self voice conversion is used as a watermark removal attack — it defeats content watermarks embedded in audio outputs to undermine provenance verification and content authentication. This is a direct attack on output integrity/content watermarking schemes, matching ML09's 'watermark removal attacks' criterion.

Details

Domains

audio

Model Types

transformer

Threat Tags

black_boxinference_timedigital

Applications

Key Contributions

🛡️ Threat Analysis

Details

Similar Papers

SegReConcat: A Data Augmentation Method for Voice Anonymization Attack

NE-PADD: Leveraging Named Entity Knowledge for Robust Partial Audio Deepfake Detection via Attention Aggregation

Why Speech Deepfake Detectors Won't Generalize: The Limits of Detection in an Open World

3-Tracer: A Tri-level Temporal-Aware Framework for Audio Forgery Detection and Localization

HierCon: Hierarchical Contrastive Attention for Audio Deepfake Detection

Probabilistic Verification of Voice Anti-Spoofing Models

Bona fide Cross Testing Reveals Weak Spot in Audio Deepfake Detection Systems

Speech DF Arena: A Leaderboard for Speech DeepFake Detection Models