Computer-aided diagnosis with potential application to rapid detection of disease outbreaks

Our objectives are to quickly interpret symptoms of emergency patients to identify likely syndromes and to improve population‐wide disease outbreak detection. We constructed a database of 248 syndromes, each syndrome having an estimated probability of producing any of 85 symptoms, with some two‐way,...

Full description

Saved in:

Bibliographic Details
Published in	Statistics in medicine Vol. 26; no. 8; pp. 1857 - 1874
Main Authors	Burr, Tom, Koster, Frederick, Picard, Rick, Forslund, Dave, Wokoun, Doug, Joyce, Ed, Brillman, Judith, Froman, Phil, Lee, Jack
Format	Journal Article
Language	English
Published	Chichester, UK John Wiley & Sons, Ltd 15.04.2007 Wiley Subscription Services, Inc
Subjects	Algorithms Anthrax - diagnosis Bayes classifier Bayesian analysis biosurveillance Bioterrorism Computer Simulation Diagnosis, Computer-Assisted - methods Disease Outbreaks Epidemics Hantavirus Infections - diagnosis Humans iterative proportional fitting Medical diagnosis misdiagnosis rates Sensitivity and Specificity Simulation
Online Access	Get full text
ISSN	0277-6715 1097-0258
DOI	10.1002/sim.2798

Cover

More Information
Summary:	Our objectives are to quickly interpret symptoms of emergency patients to identify likely syndromes and to improve population‐wide disease outbreak detection. We constructed a database of 248 syndromes, each syndrome having an estimated probability of producing any of 85 symptoms, with some two‐way, three‐way, and five‐way probabilities reflecting correlations among symptoms. Using these multi‐way probabilities in conjunction with an iterative proportional fitting algorithm allows estimation of full conditional probabilities. Combining these conditional probabilities with misdiagnosis error rates and incidence rates via Bayes theorem, the probability of each syndrome is estimated. We tested a prototype of computer‐aided differential diagnosis (CADDY) on simulated data and on more than 100 real cases, including West Nile Virus, Q fever, SARS, anthrax, plague, tularaemia and toxic shock cases. We conclude that: (1) it is important to determine whether the unrecorded positive status of a symptom means that the status is negative or that the status is unknown; (2) inclusion of misdiagnosis error rates produces more realistic results; (3) the naive Bayes classifier, which assumes all symptoms behave independently, is slightly outperformed by CADDY, which includes available multi‐symptom information on correlations; as more information regarding symptom correlations becomes available, the advantage of CADDY over the naive Bayes classifier should increase; (4) overlooking low‐probability, high‐consequence events is less likely if the standard output summary is augmented with a list of rare syndromes that are consistent with observed symptoms, and (5) accumulating patient‐level probabilities across a larger population can aid in biosurveillance for disease outbreaks. Copyright © 2007 John Wiley & Sons, Ltd.
Bibliography:	ark:/67375/WNG-RG6MNJTZ-0 Los Alamos National Laboratory's Directed Research Program istex:0098B991DC9362FC6A87D19BC8369DF223E9B8D6 ArticleID:SIM2798 SourceType-Scholarly Journals-1 ObjectType-Feature-1 content type line 14 ObjectType-Article-1 ObjectType-Feature-2 content type line 23
ISSN:	0277-6715 1097-0258
DOI:	10.1002/sim.2798