Dara Bahri

Papers in Database (1)

defense arXiv Aug 18, 2025 · Aug 2025

Improving Detection of Watermarked Language Models

Dara Bahri, John Wieting · Google DeepMind

Hybrid LLM text detection combining watermark and classifier signals boosts accuracy from 75% to 95% on low-entropy prompts

Output Integrity Attack nlp
PDF