DSIPA: Detecting LLM-Generated Texts via Sentiment-Invariant Patterns Divergence Analysis

The rapid advancement of large language models (LLMs) presents new security challenges, particularly in detecting machine-generated text used for misinformation, impersonation, and content forgery. Most existing detection approaches struggle with robustness against adversarial perturbation, paraphrasing attacks, and domain shifts, often requiring restrictive access to model parameters or large labeled datasets. To address this, we propose DSIPA, a novel training-free framework that detects LLM-generated content by quantifying sentiment distributional stability under controlled stylistic variation. It is based on the observation that LLMs typically exhibit more emotionally consistent outputs, while human-written texts display greater affective variation. Our framework operates in a zero-shot, black-box manner, leveraging two unsupervised metrics, sentiment distribution consistency and sentiment distribution preservation, to capture these intrinsic behavioral asymmetries without the need for parameter updates or probability access. Extensive experiments are conducted on state-of-the-art proprietary and open-source models, including GPT-5.2, Gemini-1.5-pro, Claude-3, and LLaMa-3.3. Evaluations on five domains, such as news articles, programming code, student essays, academic papers, and community comments, demonstrate that DSIPA improves F1 detection scores by up to 49.89% over baseline methods. The framework exhibits superior generalizability across domains and strong resilience to adversarial conditions, providing a robust and interpretable behavioral signal for secure content identification in the evolving LLM landscape.

Key Contributions

Training-free, zero-shot detection framework using sentiment distribution consistency and preservation metrics
Works in black-box setting without requiring model parameters or probability access
Demonstrates superior generalization across 5 domains and resilience to adversarial perturbations

🛡️ Threat Analysis

Output Integrity Attack

Primary contribution is detecting AI-generated text (news, essays, code, papers, comments) via sentiment analysis — this is output integrity and content provenance detection.

Details

Domains

nlp

Model Types

llmtransformer

Threat Tags

black_boxinference_time

Datasets

news articlesprogramming codestudent essaysacademic paperscommunity comments

Applications

2026 0 cit.

Output Integrity Attack

90%

DSIPA: Detecting LLM-Generated Texts via Sentiment-Invariant Patterns Divergence Analysis

Key Contributions

🛡️ Threat Analysis

Details

Similar Papers

Diversity Boosts AI-Generated Text Detection

GPTZero: Robust Detection of LLM-Generated Texts

Zero-Shot Statistical Tests for LLM-Generated Text Detection using Finite Sample Concentration Inequalities

SOGPTSpotter: Detecting ChatGPT-Generated Answers on Stack Overflow

Reasoning-Aware AIGC Detection via Alignment and Reinforcement

Multi-Hierarchical Feature Detection for Large Language Model Generated Text

SpecDetect: Simple, Fast, and Training-Free Detection of LLM-Generated Text via Spectral Analysis

On the Effectiveness of LLM-Specific Fine-Tuning for Detecting AI-Generated Text