Classification under uncertainty: data analysis for diagnostic antibody testing

Abstract Formulating accurate and robust classification strategies is a key challenge of developing diagnostic and antibody tests. Methods that do not explicitly account for disease prevalence and uncertainty therein can lead to significant classification errors. We present a novel method that lever...

Full description

Saved in:
Bibliographic Details
Published inMathematical medicine and biology Vol. 38; no. 3; pp. 396 - 416
Main Authors Patrone, Paul N, Kearsley, Anthony J
Format Journal Article
LanguageEnglish
Published Oxford University Press 01.09.2021
Subjects
Online AccessGet full text
ISSN1477-8599
1477-8602
1477-8602
DOI10.1093/imammb/dqab007

Cover

More Information
Summary:Abstract Formulating accurate and robust classification strategies is a key challenge of developing diagnostic and antibody tests. Methods that do not explicitly account for disease prevalence and uncertainty therein can lead to significant classification errors. We present a novel method that leverages optimal decision theory to address this problem. As a preliminary step, we develop an analysis that uses an assumed prevalence and conditional probability models of diagnostic measurement outcomes to define optimal (in the sense of minimizing rates of false positives and false negatives) classification domains. Critically, we demonstrate how this strategy can be generalized to a setting in which the prevalence is unknown by either (i) defining a third class of hold-out samples that require further testing or (ii) using an adaptive algorithm to estimate prevalence prior to defining classification domains. We also provide examples for a recently published SARS-CoV-2 serology test and discuss how measurement uncertainty (e.g. associated with instrumentation) can be incorporated into the analysis. We find that our new strategy decreases classification error by up to a decade relative to more traditional methods based on confidence intervals. Moreover, it establishes a theoretical foundation for generalizing techniques such as receiver operating characteristics by connecting them to the broader field of optimization.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:1477-8599
1477-8602
1477-8602
DOI:10.1093/imammb/dqab007