Muhammad Haris Khan

h-index: 2 15 citations 17 papers (total)

Papers in Database (3)

defense arXiv Dec 19, 2025 · Dec 2025

Key-Conditioned Orthonormal Transform Gating (K-OTG): Multi-Key Access Control with Hidden-State Scrambling for LoRA-Tuned Models

Muhammad Haris Khan · University of Copenhagen

Defends fine-tuned LLMs against unauthorized use via secret-key-conditioned orthonormal hidden-state scrambling in LoRA adapters

Model Theft Model Theft nlp
PDF
tool arXiv Feb 23, 2026 · 6w ago

Pixels Don't Lie (But Your Detector Might): Bootstrapping MLLM-as-a-Judge for Trustworthy Deepfake Detection and Reasoning Supervision

Kartik Kuckreja, Parul Gupta, Muhammad Haris Khan et al. · MBZUAI · Monash University

Builds an MLLM judge that evaluates reasoning fidelity of deepfake detectors, outperforming 30x larger baselines at 96.2% accuracy

Output Integrity Attack visionmultimodal
PDF Code
defense arXiv Dec 4, 2025 · Dec 2025

DAMASHA: Detecting AI in Mixed Adversarial Texts via Segmentation with Human-interpretable Attribution

L. D. M. S. Sai Teja, N. Siva Gopala Krishna, Ufaq Khan et al. · National Institute of Technology Silchar · BML Munjal University +1 more

Defends mixed-authorship AI text detectors against adversarial evasion using Info-Mask segmentation and interpretable stylometric attribution overlays

Output Integrity Attack Input Manipulation Attack nlp
PDF Code