Subramanyam Sahoo

h-index: 1 1 citations 3 papers (total)

Papers in Database (2)

defense arXiv Dec 15, 2025 · Dec 2025

The Double Life of Code World Models: Provably Unmasking Malicious Behavior Through Execution Traces

Subramanyam Sahoo · UC Berkeley

Defends against backdoored code-generating LLMs by checking execution trace consistency across semantically equivalent program variants

Model Poisoning nlp
PDF
benchmark arXiv Dec 25, 2025 · Dec 2025

The Deepfake Detective: Interpreting Neural Forensics Through Sparse Features and Manifolds

Subramanyam Sahoo, Jared Junkin · University of California · Johns Hopkins University

Interprets deepfake detector internals using sparse autoencoders and forensic manifold analysis on a 2B-parameter VLM

Output Integrity Attack visionmultimodal
PDF Code