Manuela Veloso

h-index: 1 8 citations 3 papers (total)

Papers in Database (1)

defense arXiv Dec 18, 2025 · Dec 2025

Perturb Your Data: Paraphrase-Guided Training Data Watermarking

Pranav Shetty, Mirazul Haque, Petr Babkin et al. · JPMorgan Chase & Co.

Paraphrase-based training data watermarking detects LLM training on copyrighted text even at 0.001% corpus presence

Output Integrity Attack nlp
PDF