Multi-Block Sparse Functional Principal Components Analysis for Longitudinal Microbiome Multi-Omics Data
Microbiome researchers often need to model the temporal dynamics of multiple complex, nonlinear outcome trajectories simultaneously. This motivates our development of multivariate Sparse Functional Principal Components Analysis (mSFPCA), extending existing SFPCA methods to simultaneously characteriz...
        Saved in:
      
    
          | Main Authors | , , , , , | 
|---|---|
| Format | Journal Article | 
| Language | English | 
| Published | 
          
        29.01.2021
     | 
| Subjects | |
| Online Access | Get full text | 
| DOI | 10.48550/arxiv.2102.00067 | 
Cover
| Summary: | Microbiome researchers often need to model the temporal dynamics of multiple
complex, nonlinear outcome trajectories simultaneously. This motivates our
development of multivariate Sparse Functional Principal Components Analysis
(mSFPCA), extending existing SFPCA methods to simultaneously characterize
multiple temporal trajectories and their inter-relationships. As with existing
SFPCA methods, the mSFPCA algorithm characterizes each trajectory as a smooth
mean plus a weighted combination of the smooth major modes of variation about
the mean, where the weights are given by the component scores for each subject.
Unlike existing SFPCA methods, the mSFPCA algorithm allows estimation of
multiple trajectories simultaneously, such that the component scores, which are
constrained to be independent within a particular outcome for identifiability,
may be arbitrarily correlated with component scores for other outcomes. A
Cholesky decomposition is used to estimate the component score covariance
matrix efficiently and guarantee positive semi-definiteness given these
constraints. Mutual information is used to assess the strength of marginal and
conditional temporal associations across outcome trajectories. Importantly, we
implement mSFPCA as a Bayesian algorithm using R and stan, enabling easy use of
packages such as PSIS-LOO for model selection and graphical posterior
predictive checks to assess the validity of mSFPCA models. Although we focus on
application of mSFPCA to microbiome data in this paper, the mSFPCA model is of
general utility and can be used in a wide range of real-world applications. | 
|---|---|
| DOI: | 10.48550/arxiv.2102.00067 |