Latest papers

3 papers
defense arXiv Feb 13, 2026 · 7w ago

Neighborhood Blending: A Lightweight Inference-Time Defense Against Membership Inference Attacks

Osama Zafar, Shaojie Zhan, Tianxi Ji et al. · Case Western Reserve University · Texas Tech University

Inference-time defense smooths model confidence outputs via DP neighborhood averaging to defeat membership inference attacks without retraining

Membership Inference Attack tabular
PDF
defense arXiv Feb 10, 2026 · 7w ago

A Behavioral Fingerprint for Large Language Models: Provenance Tracking via Refusal Vectors

Zhenyu Xu, Victor S. Sheng · Texas Tech University

Fingerprints LLMs for provenance tracking using internal refusal vectors, achieving 100% accuracy across 76 derivative models

Model Theft Model Theft nlp
PDF
benchmark arXiv Aug 12, 2025 · Aug 2025

Securing Agentic AI: Threat Modeling and Risk Analysis for Network Monitoring Agentic AI System

Pallavi Zambare, Venkata Nikhil Thanikella, Ying Liu · Texas Tech University

Applies MAESTRO threat framework to LLM network-monitoring agents, demonstrating memory poisoning and resource DoS with measurable degradation

Excessive Agency Prompt Injection nlp
PDF