Mitigating site effects in covariance for machine learning in neuroimaging data

To acquire larger samples for answering complex questions in neuroscience, researchers have increasingly turned to multi‐site neuroimaging studies. However, these studies are hindered by differences in images acquired across multiple sites. These effects have been shown to bias comparison between si...

Full description

Saved in:
Bibliographic Details
Published inHuman brain mapping Vol. 43; no. 4; pp. 1179 - 1195
Main Authors Chen, Andrew A., Beer, Joanne C., Tustison, Nicholas J., Cook, Philip A., Shinohara, Russell T., Shou, Haochang
Format Journal Article
LanguageEnglish
Published Hoboken, USA John Wiley & Sons, Inc 01.03.2022
Subjects
Online AccessGet full text
ISSN1065-9471
1097-0193
1097-0193
DOI10.1002/hbm.25688

Cover

More Information
Summary:To acquire larger samples for answering complex questions in neuroscience, researchers have increasingly turned to multi‐site neuroimaging studies. However, these studies are hindered by differences in images acquired across multiple sites. These effects have been shown to bias comparison between sites, mask biologically meaningful associations, and even introduce spurious associations. To address this, the field has focused on harmonizing data by removing site‐related effects in the mean and variance of measurements. Contemporaneously with the increase in popularity of multi‐center imaging, the use of machine learning (ML) in neuroimaging has also become commonplace. These approaches have been shown to provide improved sensitivity, specificity, and power due to their modeling the joint relationship across measurements in the brain. In this work, we demonstrate that methods for removing site effects in mean and variance may not be sufficient for ML. This stems from the fact that such methods fail to address how correlations between measurements can vary across sites. Data from the Alzheimer's Disease Neuroimaging Initiative is used to show that considerable differences in covariance exist across sites and that popular harmonization techniques do not address this issue. We then propose a novel harmonization method called Correcting Covariance Batch Effects (CovBat) that removes site effects in mean, variance, and covariance. We apply CovBat and show that within‐site correlation matrices are successfully harmonized. Furthermore, we find that ML methods are unable to distinguish scanner manufacturer after our proposed harmonization is applied, and that the CovBat‐harmonized data retain accurate prediction of disease group. Multi‐site neuroimaging studies are hindered by differences in images acquired across multiple sites, often referred to as site effects. In this work, we demonstrate that methods for removing site effects in mean and variance may not be sufficient for machine learning. After applying our proposed harmonization method CovBat, we find that machine learning methods are unable to distinguish scanner manufacturer after our proposed harmonization is applied, and that the CovBat‐harmonized data retain accurate prediction of disease group.
Bibliography:Funding information
National Institute of Neurological Disorders and Stroke, Grant/Award Numbers: R01 NS060910, R01 NS085211; University of Pennsylvania Center for Biomedical Image Computing and Analytics; Transition Therapeutics; Takeda Pharmaceutical Company; Servier; Piramal Imaging; Novartis Pharmaceuticals Corporation; Pfizer Inc.; Neurotrack Technologies; NeuroRx Research; Meso Scale Diagnostics, LLC.; Merck & Co., Inc.; Lundbeck; Lumosity; Johnson & Johnson Pharmaceutical Research & Development LLC.; Janssen Alzheimer Immunotherapy Research & Development, LLC.; IXICO Ltd; GE Healthcare; Fujirebio; Genentech, Inc.; F. Hoffmann‐La Roche Ltd; EuroImmun; Eli Lilly and Company; Elan Pharmaceuticals, Inc.; Cogstate; Eisai Inc; CereSpir, Inc.; Biogen; Bristol‐Myers Squibb Company; BioClinica, Inc.; Araclon Biotech; Alzheimer's Association; Alzheimer's Drug Discovery Foundation; AbbVie; National Institute of Biomedical Imaging and Bioengineering; National Institute on Aging; Department of Defense, Grant/Award Number: W81XWH‐12‐2‐0012; National Institutes of Health, Grant/Award Number: U01 AG024904; Alzheimer's Disease Neuroimaging Initiative (ADNI); National Multiple Sclerosis Society
Data used in preparation of this article were obtained from the Alzheimer's Disease Neuroimaging Initiative (ADNI) database (adni.loni.usc.edu). As such, the investigators within the ADNI contributed to the design and implementation of ADNI and/or provided data but did not participate in analysis or writing of this report. A complete listing of ADNI investigators can be found at
http://adni.loni.usc.edu/wp-content/uploads/how_to_apply/ADNI_Acknowledgement_List.pdf
.
Russell T. Shinohara and Haochang Shou contributed equally to this work.
ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
Data used in preparation of this article were obtained from the Alzheimer's Disease Neuroimaging Initiative (ADNI) database (adni.loni.usc.edu). As such, the investigators within the ADNI contributed to the design and implementation of ADNI and/or provided data but did not participate in analysis or writing of this report. A complete listing of ADNI investigators can be found at: http://adni.loni.usc.edu/wp-content/uploads/how_to_apply/ADNI_Acknowledgement_List.pdf.
Funding information National Institute of Neurological Disorders and Stroke, Grant/Award Numbers: R01 NS060910, R01 NS085211; University of Pennsylvania Center for Biomedical Image Computing and Analytics; Transition Therapeutics; Takeda Pharmaceutical Company; Servier; Piramal Imaging; Novartis Pharmaceuticals Corporation; Pfizer Inc.; Neurotrack Technologies; NeuroRx Research; Meso Scale Diagnostics, LLC.; Merck & Co., Inc.; Lundbeck; Lumosity; Johnson & Johnson Pharmaceutical Research & Development LLC.; Janssen Alzheimer Immunotherapy Research & Development, LLC.; IXICO Ltd; GE Healthcare; Fujirebio; Genentech, Inc.; F. Hoffmann‐La Roche Ltd; EuroImmun; Eli Lilly and Company; Elan Pharmaceuticals, Inc.; Cogstate; Eisai Inc; CereSpir, Inc.; Biogen; Bristol‐Myers Squibb Company; BioClinica, Inc.; Araclon Biotech; Alzheimer's Association; Alzheimer's Drug Discovery Foundation; AbbVie; National Institute of Biomedical Imaging and Bioengineering; National Institute on Aging; Department of Defense, Grant/Award Number: W81XWH‐12‐2‐0012; National Institutes of Health, Grant/Award Number: U01 AG024904; Alzheimer's Disease Neuroimaging Initiative (ADNI); National Multiple Sclerosis Society
ISSN:1065-9471
1097-0193
1097-0193
DOI:10.1002/hbm.25688