benchmark arXiv Mar 24, 2026 · 13d ago
Adrián Detavernier, Jasper De Bock · Ghent University
Compares robustness quantification and uncertainty quantification for assessing classifier reliability, showing combined approaches outperform either alone
Input Manipulation Attack visiontabular
We consider two approaches for assessing the reliability of the individual predictions of a classifier: Robustness Quantification (RQ) and Uncertainty Quantification (UQ). We explain the conceptual differences between the two approaches, compare both approaches on a number of benchmark datasets and show that RQ is capable of outperforming UQ, both in a standard setting and in the presence of distribution shift. Beside showing that RQ can be competitive with UQ, we also demonstrate the complementarity of RQ and UQ by showing that a combination of both approaches can lead to even better reliability assessments.
traditional_ml Ghent University