Robustness Quantification and Uncertainty Quantification: Comparing Two Methods for Assessing the Reliability of Classifier Predictions

We consider two approaches for assessing the reliability of the individual predictions of a classifier: Robustness Quantification (RQ) and Uncertainty Quantification (UQ). We explain the conceptual differences between the two approaches, compare both approaches on a number of benchmark datasets and show that RQ is capable of outperforming UQ, both in a standard setting and in the presence of distribution shift. Beside showing that RQ can be competitive with UQ, we also demonstrate the complementarity of RQ and UQ by showing that a combination of both approaches can lead to even better reliability assessments.

Key Contributions

Comprehensive comparison of robustness quantification and uncertainty quantification on real benchmark datasets
Demonstrates that combining RQ and UQ leads to better reliability assessments than either alone
Shows RQ can outperform UQ in standard settings and under distribution shift

🛡️ Threat Analysis

Input Manipulation Attack

Robustness quantification measures how much perturbation a model can handle before changing predictions - this is evaluating adversarial robustness at inference time, even though the paper frames it as a reliability assessment tool rather than proposing attacks or defenses.

Details

Domains

visiontabular

Model Types

traditional_ml

Threat Tags

inference_time

Datasets

MNISTCIFAR-10

Applications

Key Contributions

🛡️ Threat Analysis

Details

Similar Papers

Benchmarking the Energy Cost of Assurance in Neuromorphic Edge Robotics

Adversarial Robustness in Financial Machine Learning: Defenses, Economic Impact, and Governance Evidence

On the Adversarial Robustness of Learning-based Conformal Novelty Detection

Lower Bounds on Adversarial Robustness for Multiclass Classification with General Loss Functions

Robustness, Cost, and Attack-Surface Concentration in Phishing Detection

On damage of interpolation to adversarial robustness in regression

Adversarial training with restricted data manipulation

Adversarial Examples Are Not Bugs, They Are Superposition