PAN 2026 organizes five shared evaluation tasks spanning AI-text detection, watermarking robustness, and LLM reasoning safety, continuing a tradition of 1,100+ reproducible submissions since 2012.

The goal of the PAN workshop is to advance computational stylometry and text forensics via objective and reproducible evaluation. In 2026, we run the following five tasks: (1) Voight-Kampff Generative AI Detection, particularly in mixed and obfuscated authorship scenarios, (2) Text Watermarking, a new task that aims to find new and benchmark the robustness of existing text watermarking schemes, (3) Multi-author Writing Style Analysis, a continued task that aims to find positions of authorship change, (4) Generative Plagiarism Detection, a continued task that targets source retrieval and text alignment between generated text and source documents, and (5) Reasoning Trajectory Detection, a new task that deals with source detection and safety detection of LLM-generated or human-written reasoning trajectories. As in previous years, PAN invites software submissions as easy-to-reproduce Docker containers for most of the tasks. Since PAN 2012, more than 1,100 submissions have been made this way via the TIRA experimentation platform.

Key Contributions

Introduces a new text watermarking shared task that benchmarks robustness of existing watermarking schemes against obfuscation
Introduces Reasoning Trajectory Detection, a new task for source and safety detection of LLM-generated or human-written reasoning chains
Continues established AI-generated content detection and multi-author stylometry tasks under reproducible Docker-based evaluation via the TIRA platform

🛡️ Threat Analysis

Output Integrity Attack

Three of the five tasks directly target output integrity: Voight-Kampff AI-generated text detection, a new text watermarking robustness benchmarking task, and generative plagiarism detection — all concern authenticating, watermarking, or detecting AI-generated content provenance.

Details

Domains

nlpgenerative

Model Types

llmtransformer

Threat Tags

inference_time

Datasets

TIRA

Applications

ai-generated text detectiontext watermarkingauthorship attributionplagiarism detectionllm reasoning safety

Read PDF arXiv DOI

Overview of PAN 2026: Voight-Kampff Generative AI Detection, Text Watermarking, Multi-Author Writing Style Analysis, Generative Plagiarism Detection, and Reasoning Trajectory Detection

Key Contributions

🛡️ Threat Analysis

Details

Similar Papers

How Secure is Secure Code Generation? Adversarial Prompts Put LLM Defenses to the Test

Whose Narrative is it Anyway? A KV Cache Manipulation Attack

Paladin: Defending LLM-enabled Phishing Emails with a New Trigger-Tag Paradigm

LLMs can hide text in other text of the same length

Watermarking Diffusion Language Models

From Similarity to Vulnerability: Key Collision Attack on LLM Semantic Caching

Audio Deepfake Detection in the Age of Advanced Text-to-Speech models

Risk Assessment and Security Analysis of Large Language Models