Bayesian Models for Integrative Genomics

IntroductionThe practical utility of variable selection is well recognized, and this topic has been the focus of much research. Bayesian methods for variable selection have several appealing features. They address the selection and prediction problems in a unified manner, allow rich modeling via the...

Full description

Saved in:
Bibliographic Details
Published inAdvances in Statistical Bioinformatics pp. 272 - 291
Main Authors Stingo, Francesco C., Vannucci, Marina
Format Book Chapter
LanguageEnglish
Published Cambridge University Press 10.06.2013
Online AccessGet full text
ISBN1107027527
9781107027527
DOI10.1017/CBO9781139226448.014

Cover

More Information
Summary:IntroductionThe practical utility of variable selection is well recognized, and this topic has been the focus of much research. Bayesian methods for variable selection have several appealing features. They address the selection and prediction problems in a unified manner, allow rich modeling via the implementation of Markov Chain Monte Carlo (MCMC) stochastic search strategies and incorporate optimal model averaging prediction strategies; they extend quite naturally to multivariate responses and many linear and nonlinear settings; they can handle the “small n–large p” setting (i.e., situations in which the number of measured covariates is much larger than the sample size); and they allow past and collateral information to be easily accommodated into the model through the priors.The flexibility of the variable selection approach, in particular the fact that it can handle the “large p–small n” paradigm, has made Bayesian methods particularly relevant for the analysis of genomic studies, in which high-throughput technologies allow thousands of variables to be measured on individual samples. In this chapter we discuss recent contributions from our group on methods for integrative genomics. First, we focus on methods that integrate external biological information into the analysis of gene expression data. We consider a linear model that predicts a phenotype based on predictors synthesizing the activity of genes belonging to same pathways and encode into the prior model information on gene-gene networks, as retrieved from available databases.
ISBN:1107027527
9781107027527
DOI:10.1017/CBO9781139226448.014