A GIS-Based Artificial Neural Network Model for Spatial Distribution of Tuberculosis across the Continental United States

Despite the usefulness of artificial neural networks (ANNs) in the study of various complex problems, ANNs have not been applied for modeling the geographic distribution of tuberculosis (TB) in the US. Likewise, ecological level researches on TB incidence rate at the national level are inadequate fo...

Full description

Saved in:
Bibliographic Details
Published inInternational journal of environmental research and public health Vol. 16; no. 1; p. 157
Main Authors Mollalo, Abolfazl, Mao, Liang, Rashidi, Parisa, Glass, Gregory E.
Format Journal Article
LanguageEnglish
Published Switzerland MDPI AG 01.01.2019
MDPI
Subjects
Online AccessGet full text
ISSN1660-4601
1661-7827
1660-4601
DOI10.3390/ijerph16010157

Cover

More Information
Summary:Despite the usefulness of artificial neural networks (ANNs) in the study of various complex problems, ANNs have not been applied for modeling the geographic distribution of tuberculosis (TB) in the US. Likewise, ecological level researches on TB incidence rate at the national level are inadequate for epidemiologic inferences. We collected 278 exploratory variables including environmental and a broad range of socio-economic features for modeling the disease across the continental US. The spatial pattern of the disease distribution was statistically evaluated using the global Moran’s I, Getis–Ord General G, and local Gi* statistics. Next, we investigated the applicability of multilayer perceptron (MLP) ANN for predicting the disease incidence. To avoid overfitting, L1 regularization was used before developing the models. Predictive performance of the MLP was compared with linear regression for test dataset using root mean square error, mean absolute error, and correlations between model output and ground truth. Results of clustering analysis showed that there is a significant spatial clustering of smoothed TB incidence rate (p < 0.05) and the hotspots were mainly located in the southern and southeastern parts of the country. Among the developed models, single hidden layer MLP had the best test accuracy. Sensitivity analysis of the MLP model showed that immigrant population (proportion), underserved segments of the population, and minimum temperature were among the factors with the strongest contributions. The findings of this study can provide useful insight to health authorities on prioritizing resource allocation to risk-prone areas.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
ISSN:1660-4601
1661-7827
1660-4601
DOI:10.3390/ijerph16010157