benchmark 2026

LJ-Spoof: A Generatively Varied Corpus for Audio Anti-Spoofing and Synthesis Source Tracing

Surya Subramani , Hashim Ali , Hafiz Malik

0 citations · 20 references · arXiv

α

Published on arXiv

2601.07958

Output Integrity Attack

OWASP ML Top 10 — ML09

Key Finding

LJ-Spoof surpasses all prior single- and multi-speaker corpora in generative diversity, providing 500 variant subsets across 30 TTS families and 2.8M+ utterances to enable robust speaker-conditioned anti-spoofing and fine-grained source tracing benchmarking.

LJ-Spoof

Novel technique introduced


Speaker-specific anti-spoofing and synthesis-source tracing are central challenges in audio anti-spoofing. Progress has been hampered by the lack of datasets that systematically vary model architectures, synthesis pipelines, and generative parameters. To address this gap, we introduce LJ-Spoof, a speaker-specific, generatively diverse corpus that systematically varies prosody, vocoders, generative hyperparameters, bona fide prompt sources, training regimes, and neural post-processing. The corpus spans one speakers-including studio-quality recordings-30 TTS families, 500 generatively variant subsets, 10 bona fide neural-processing variants, and more than 3 million utterances. This variation-dense design enables robust speaker-conditioned anti-spoofing and fine-grained synthesis-source tracing. We further position this dataset as both a practical reference training resource and a benchmark evaluation suite for anti-spoofing and source tracing.


Key Contributions

  • LJ-Spoof corpus: 3M+ utterances from one speaker across 30 TTS families, 500 generatively variant subsets, systematically varying vocoders, prosody, hyperparameters, codec resynthesis, and neural post-processing
  • First single-speaker anti-spoofing corpus providing comprehensive generative variation (VocVar, GenVar, ReVoc, ReCod, FWVar) simultaneously, enabling controlled speaker-conditioned anti-spoofing research
  • Dual-purpose resource: reference training corpus for building robust anti-spoofing models AND a benchmark evaluation suite for fine-grained synthesis source tracing

🛡️ Threat Analysis

Output Integrity Attack

The dataset is purpose-built for audio anti-spoofing (detecting AI-generated/deepfake speech) and synthesis source tracing (identifying which generative model produced an audio sample) — both are output integrity and AI-generated content detection tasks directly within ML09.


Details

Domains
audiogenerative
Model Types
rnntransformerdiffusion
Threat Tags
inference_time
Datasets
LJSpeechASVspoof2019ASVspoof2021ASVspoof5WaveFakeCodecFake-OmniDFADDDiffSSDMLAAD
Applications
audio anti-spoofingspeaker verificationsynthesis source tracingdeepfake speech detection