Zerui Cheng

h-index: 4 371 citations 15 papers (total)

Papers in Database (2)

benchmark arXiv Sep 30, 2025 · Sep 2025

When Hallucination Costs Millions: Benchmarking AI Agents in High-Stakes Adversarial Financial Markets

Zeshi Dai, Zimo Peng, Zerui Cheng et al. · Surf AI · Princeton University

Benchmarks 17 LLM agents against adversarial financial misinformation, revealing systematic tool-selection failures and indirect prompt injection via SEO-poisoned web search

Prompt Injection Excessive Agency nlp
3 citations PDF
defense arXiv Oct 15, 2025 · Oct 2025

Nondeterminism-Aware Optimistic Verification for Floating-Point Neural Networks

Jianzhu Yao, Hongxu Su, Taobo Liao et al. · Princeton University · HKUST (GZ) +1 more

Verifiable inference protocol for cloud ML that detects model swaps and computation tampering with 0.3% overhead using IEEE-754 bounds and Merkle-anchored dispute games

Output Integrity Attack visionnlpgenerative
2 citations PDF