ML Security Papers

defense 2026

Adversarial Robustness of NTK Neural Networks

Yuxuan Hou ^1,2

¹ Qiuzhen College

² Tsinghua University

0 citations

α

Published on arXiv

2604.25965

Input Manipulation Attack

OWASP ML Top 10 — ML01

Key Finding

NTK networks with early stopping achieve minimax optimal adversarial robustness rates in Sobolev spaces, while overfitted minimum norm interpolants are provably vulnerable

NTK with Early Stopping

Novel technique introduced

Deep learning models are widely deployed in safety-critical domains, but remain vulnerable to adversarial attacks. In this paper, we study the adversarial robustness of NTK neural networks in the context of nonparametric regression. We establish minimax optimal rates for adversarial regression in Sobolev spaces and then show that NTK neural networks, trained via gradient flow with early stopping, can achieve this optimal rate. However, in the overfitting regime, we prove that the minimum norm interpolant is vulnerable to adversarial perturbations.

Key Contributions

Establishes minimax optimal rates for adversarial regression in Sobolev spaces
Proves NTK neural networks with gradient flow and early stopping achieve optimal adversarial robustness
Demonstrates that minimum norm interpolants (overfitting regime) are provably vulnerable to adversarial perturbations

🛡️ Threat Analysis

Input Manipulation Attack

Paper analyzes adversarial robustness of neural networks against input perturbations in the adversarial regression setting, establishing both defense guarantees (early stopping achieves optimal robustness) and vulnerability results (overfitting regime is vulnerable).

Details

Domains

tabular

Model Types

traditional_ml

Threat Tags

inference_timeuntargeted

Applications

nonparametric regression

Similar Papers

Countering adversarial evasion in regression analysis

Input Manipulation Attack

ROAST: Risk-aware Outlier-exposure for Adversarial Selective Training of Anomaly Detectors Against Evasion Attacks

Input Manipulation Attack

Adversarial Robustness in One-Stage Learning-to-Defer

Input Manipulation Attack

DeepTrust: Multi-Step Classification through Dissimilar Adversarial Representations for Robust Android Malware Detection

Input Manipulation Attack

Demystifying the Role of Rule-based Detection in AI Systems for Windows Malware Detection

Input Manipulation Attack

Robustness, Cost, and Attack-Surface Concentration in Phishing Detection

Input Manipulation Attack

Bilevel Models for Adversarial Learning and A Case Study

Input Manipulation Attack

Mitigating Evasion Attacks in Fog Computing Resource Provisioning Through Proactive Hardening

Input Manipulation Attack