EverTracer: Hunting Stolen Large Language Models via Stealthy and Robust Probabilistic Fingerprint

The proliferation of large language models (LLMs) has intensified concerns over model theft and license violations, necessitating robust and stealthy ownership verification. Existing fingerprinting methods either require impractical white-box access or introduce detectable statistical anomalies. We propose EverTracer, a novel gray-box fingerprinting framework that ensures stealthy and robust model provenance tracing. EverTracer is the first to repurpose Membership Inference Attacks (MIAs) for defensive use, embedding ownership signals via memorization instead of artificial trigger-output overfitting. It consists of Fingerprint Injection, which fine-tunes the model on any natural language data without detectable artifacts, and Verification, which leverages calibrated probability variation signal to distinguish fingerprinted models. This approach remains robust against adaptive adversaries, including input level modification, and model-level modifications. Extensive experiments across architectures demonstrate EverTracer's state-of-the-art effectiveness, stealthness, and resilience, establishing it as a practical solution for securing LLM intellectual property. Our code and data are publicly available at https://github.com/Xuzhenhua55/EverTracer.

Key Contributions

First framework to repurpose Membership Inference Attack mechanics as a defensive fingerprinting signal, embedding ownership via natural-language memorization instead of artificial trigger-output pairs
Gray-box fingerprinting that avoids perplexity-detectable statistical anomalies while remaining robust against input-level and model-level adversarial modifications (fine-tuning, merging, pruning)
Calibrated probability variation-based verification signal that isolates memorization patterns from general data frequency biases to confirm model provenance

🛡️ Threat Analysis

Model Theft

EverTracer embeds ownership fingerprints IN THE MODEL via fine-tuning, then verifies ownership against suspect (stolen) model copies — this is directly model IP protection and anti-theft fingerprinting. The watermark is in the model, not the content outputs.

Details

Domains

nlp

Model Types

llmtransformer

Threat Tags

grey_boxinference_time

Applications

2025 2 cit.

Model Theft

86%

EverTracer: Hunting Stolen Large Language Models via Stealthy and Robust Probabilistic Fingerprint

Key Contributions

🛡️ Threat Analysis

Details

Similar Papers

SLIP-SEC: Formalizing Secure Protocols for Model IP Protection

Fingerprinting LLMs via Prompt Injection

On Protecting Agentic Systems' Intellectual Property via Watermarking

Towards Distillation-Resistant Large Language Models: An Information-Theoretic Perspective

SecureInfer: Heterogeneous TEE-GPU Architecture for Privacy-Critical Tensors for Large Language Model Deployment

Practical Secure Inference Algorithm for Fine-tuned Large Language Model Based on Fully Homomorphic Encryption

FNF: Functional Network Fingerprint for Large Language Models

Verifying LLM Inference to Detect Model Weight Exfiltration