Qin Lin

Papers in Database (1)

defense arXiv Mar 6, 2026 · 4w ago

Proof-of-Guardrail in AI Agents and What (Not) to Trust from It

Xisen Jin, Michael Duan, Qin Lin et al. · Sahara AI · University of Southern California

Proposes TEE-based cryptographic proof that AI agent responses passed a specific safety guardrail, preventing false safety claims

Output Integrity Attack Excessive Agency nlp
PDF Code