On Measures of Uncertainty in Classification

Uncertainty is unavoidable in classification tasks and might originate from data (e.g., due to noise or wrong labeling), or the model (e.g., due to erroneous assumptions, etc). Providing an assessment of uncertainty associated with each outcome is of paramount importance in assessing the reliability...

Full description

Saved in:
Bibliographic Details
Published inIEEE transactions on signal processing Vol. 71; pp. 3710 - 3725
Main Authors Chlaily, Saloua, Ratha, Debanshu, Lozou, Pigi, Marinoni, Andrea
Format Journal Article
LanguageEnglish
Published New York IEEE 2023
The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Subjects
Online AccessGet full text
ISSN1053-587X
1941-0476
1941-0476
DOI10.1109/TSP.2023.3322843

Cover

More Information
Summary:Uncertainty is unavoidable in classification tasks and might originate from data (e.g., due to noise or wrong labeling), or the model (e.g., due to erroneous assumptions, etc). Providing an assessment of uncertainty associated with each outcome is of paramount importance in assessing the reliability of classification algorithms, especially on unseen data. In this work, we propose two measures of uncertainty in classification. One of the measures is developed from a geometrical perspective and quantifies a classifier's distance from a random guess. In contrast, the second proposed uncertainty measure is homophily-based since it takes into account the similarity between the classes. Accordingly, it reflects the type of mistaken classes. The proposed measures are not aggregated, i.e., they provide an uncertainty assessment to each data point. Moreover, they do not require label information. Using several datasets, we demonstrate the proposed measures' differences and merit in assessing uncertainty in classification. The source code is available at github.com/pioui/uncertainty .
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
IEEE Transactions on Signal Processing
ISSN:1053-587X
1941-0476
1941-0476
DOI:10.1109/TSP.2023.3322843