Nithin Nagaraj

Papers in Database (1)

defense arXiv Feb 6, 2026 · 9w ago

Dependable Artificial Intelligence with Reliability and Security (DAIReS): A Unified Syndrome Decoding Approach for Hallucination and Backdoor Trigger Detection

Hema Karnam Surendrababu, Nithin Nagaraj · National Institute of Advanced Studies

Syndrome decoding unified framework detects both backdoor-poisoned training samples and LLM hallucinations in NLP embedding space

Model Poisoning Output Integrity Attack nlp
PDF