Niv Cohen

defense arXiv Oct 11, 2025 · Oct 2025

SimKey: A Semantically Aware Key Module for Watermarking Language Models

Shingo Kodama, Haya Diwan, Lucas Rosenblatt et al. · Middlebury College · New York University +1 more

Semantic LSH-based key module makes LLM text watermarks robust to paraphrasing while blocking harmful content false attribution

Output Integrity Attack nlp

1 citations PDF Code

attack arXiv Feb 22, 2026 · 6w ago

Understanding Empirical Unlearning with Combinatorial Interpretability

Shingo Kodama, Niv Cohen, Micah Adler et al. · Middlebury College · New York University +2 more

Attacks machine unlearning methods using combinatorial interpretability, showing erased knowledge persists in weights and recovers rapidly via fine-tuning

Model Inversion Attack nlpvision

PDF

Papers in Database (2)

SimKey: A Semantically Aware Key Module for Watermarking Language Models

Understanding Empirical Unlearning with Combinatorial Interpretability