Nishith Shankar

benchmark arXiv Sep 10, 2025 · Sep 2025

Maheep Chaudhary, Ian Su, Nikhil Hooda et al. · Independent · University of California +6 more

Discovers power-law scaling of LLM evaluation awareness across 15 models, forecasting deceptive capability concealment in larger models

Prompt Injection nlp

Papers in Database (1)