Farinaz Koushanfar

defense arXiv Jan 5, 2026 · Jan 2026

SWaRL: Safeguard Code Watermarking via Reinforcement Learning

Neusha Javidnia, Ruisi Zhang, Ashish Kundu et al. · University of California · Cisco Research

RL-trained LoRA adapters embed detectable watermarks in code LLM outputs, resisting refactoring and adversarial removal attacks

Output Integrity Attack nlp

PDF

defense arXiv Feb 9, 2026 · 8w ago

CryptoGen: Secure Transformer Generation with Encrypted KV-Cache Reuse

Hedong Zhang, Neusha Javidnia, Shweta Pardeshi et al. · University of Central Florida · University of California

Cryptographic HE+MPC system enabling privacy-preserving autoregressive LLM inference that protects both user prompts and model weights from semi-honest adversaries

Model Theft nlp

PDF

survey arXiv Feb 6, 2026 · 8w ago

Trojans in Artificial Intelligence (TrojAI) Final Report

Kristopher W. Reese, Taylor Kulp-McDowall, Michael Majurski et al. · IARPA · NIST +13 more

Surveys IARPA TrojAI program findings on AI backdoor detection via weight analysis and trigger inversion across multi-year research

Model Poisoning visionnlp

PDF

Papers in Database (3)

SWaRL: Safeguard Code Watermarking via Reinforcement Learning

CryptoGen: Secure Transformer Generation with Encrypted KV-Cache Reuse

Trojans in Artificial Intelligence (TrojAI) Final Report