Ishwar Balappanawar

Papers in Database (1)

benchmark arXiv Aug 9, 2025 · Aug 2025

Who's the Evil Twin? Differential Auditing for Undesired Behavior

Ishwar Balappanawar, Venkata Hasith Vattikuti, Greta Kintzley et al. · IIIT Hyderabad · University of Texas at Austin +1 more

Adversarial auditing game framework detects backdoored CNNs and misaligned LLMs using model diffing, gradients, and adversarial probing

Model Poisoning Prompt Injection visionnlp
PDF