Incentive-Aligned Multi-Source LLM Summaries

Large language models (LLMs) are increasingly used in modern search and answer systems to synthesize multiple, sometimes conflicting, texts into a single response, yet current pipelines offer weak incentives for sources to be accurate and are vulnerable to adversarial content. We introduce Truthful Text Summarization (TTS), an incentive-aligned framework that improves factual robustness without ground-truth labels. TTS (i) decomposes a draft synthesis into atomic claims, (ii) elicits each source's stance on every claim, (iii) scores sources with an adapted multi-task peer-prediction mechanism that rewards informative agreement, and (iv) filters unreliable sources before re-summarizing. We establish formal guarantees that align a source's incentives with informative honesty, making truthful reporting the utility-maximizing strategy. Experiments show that TTS improves factual accuracy and robustness while preserving fluency, aligning exposure with informative corroboration and disincentivizing manipulation.

Key Contributions

Truthful Text Summarization (TTS) pipeline that decomposes documents into atomic claims and scores sources via a leave-one-out multi-task peer prediction mechanism, preventing adversarial sources from influencing their own evaluation set
Formal incentive-alignment guarantees (informed and strong truthfulness) showing truthful reporting is the utility-maximizing strategy for sources, with finite-sample convergence bounds
Empirical demonstration that TTS improves factual accuracy and robustness against hallucinations and adversarial/strategic prompt injection compared to majority-style and LLM-centric baselines

🛡️ Threat Analysis

Details

Domains

nlp

Model Types

llm

Threat Tags

inference_timeblack_box

Applications

llm-based searchretrieval-augmented generationai summarization

2025 0 cit.

100%