Statistical Methods for Handling Unwanted Variation in Metabolomics Data

Metabolomics experiments are inevitably subject to a component of unwanted variation, due to factors such as batch effects, long runs of samples, and confounding biological variation. Although the removal of this unwanted variation is a vital step in the analysis of metabolomics data, it is consider...

Full description

Saved in:
Bibliographic Details
Published inAnalytical chemistry (Washington) Vol. 87; no. 7; pp. 3606 - 3615
Main Authors Livera, Alysha M. De, Sysi-Aho, Marko, Jacob, Laurent, Gagnon-Bartsch, Johann A, Castillo, Sandra, Simpson, Julie A, Speed, Terence P
Format Journal Article
LanguageEnglish
Published United States American Chemical Society 07.04.2015
Subjects
Online AccessGet full text
ISSN0003-2700
1520-6882
1520-6882
DOI10.1021/ac502439y

Cover

More Information
Summary:Metabolomics experiments are inevitably subject to a component of unwanted variation, due to factors such as batch effects, long runs of samples, and confounding biological variation. Although the removal of this unwanted variation is a vital step in the analysis of metabolomics data, it is considered a gray area in which there is a recognized need to develop a better understanding of the procedures and statistical methods required to achieve statistically relevant optimal biological outcomes. In this paper, we discuss the causes of unwanted variation in metabolomics experiments, review commonly used metabolomics approaches for handling this unwanted variation, and present a statistical approach for the removal of unwanted variation to obtain normalized metabolomics data. The advantages and performance of the approach relative to several widely used metabolomics normalization approaches are illustrated through two metabolomics studies, and recommendations are provided for choosing and assessing the most suitable normalization method for a given metabolomics experiment. Software for the approach is made freely available.
Bibliography:SourceType-Scholarly Journals-1
ObjectType-Feature-1
content type line 14
ObjectType-Article-1
ObjectType-Feature-2
content type line 23
PMCID: PMC4544854
Department of Statistics, University of California, Berkeley, USA
Biostatistics Unit, Centre for Epidemiology and Biostatistics, University of Melbourne, VIC 3800, Australia
Zora Biosciences Oy, FIN-02150 Espoo, Finland
VTT Technical Research Centre of Finland, Finland
Bioinformatics Division, Walter and Eliza Hall Institute
Laboratoire de Biométrie et Biologie Evolutive, Université Lyon 1, CNRS, INRA, UMR5558, Villeurbanne, France
Department of Mathematics and Statistics, University of Melbourne, VIC 3800, Australia
ISSN:0003-2700
1520-6882
1520-6882
DOI:10.1021/ac502439y