An empirical Bayes approach to inferring large-scale gene association networks

Motivation: Genetic networks are often described statistically using graphical models (e.g. Bayesian networks). However, inferring the network structure offers a serious challenge in microarray analysis where the sample size is small compared to the number of considered genes. This renders many stan...

Full description

Saved in:

Bibliographic Details
Published in	Bioinformatics Vol. 21; no. 6; pp. 754 - 764
Main Authors	Schäfer, Juliane, Strimmer, Korbinian
Format	Journal Article
Language	English
Published	Oxford Oxford University Press 15.03.2005 Oxford Publishing Limited (England)
Subjects	Algorithms Artificial Intelligence Bayes Theorem Biological and medical sciences Biomarkers, Tumor - metabolism Breast Neoplasms - genetics Breast Neoplasms - metabolism Computer Simulation Fundamental and applied biological sciences. Psychology Gene Expression Profiling - methods Gene Expression Regulation - physiology General aspects Humans Mathematical models Mathematics in biology. Statistical analysis. Models. Metrology. Data processing in biology (general aspects) Models, Biological Models, Statistical Neoplasm Proteins - genetics Neoplasm Proteins - metabolism Oligonucleotide Array Sequence Analysis - methods Signal Transduction - physiology Topology Transcription Factors - metabolism Tumor Cells, Cultured Gene Size Bayes network Genetics Models Small sample Large scale Malignant tumor Structure Microarray
Online Access	Get full text
ISSN	1367-4803 1367-4811 1460-2059 1367-4811
DOI	10.1093/bioinformatics/bti062

Cover

More Information
Summary:	Motivation: Genetic networks are often described statistically using graphical models (e.g. Bayesian networks). However, inferring the network structure offers a serious challenge in microarray analysis where the sample size is small compared to the number of considered genes. This renders many standard algorithms for graphical models inapplicable, and inferring genetic networks an ‘ill-posed’ inverse problem. Methods: We introduce a novel framework for small-sample inference of graphical models from gene expression data. Specifically, we focus on the so-called graphical Gaussian models (GGMs) that are now frequently used to describe gene association networks and to detect conditionally dependent genes. Our new approach is based on (1) improved (regularized) small-sample point estimates of partial correlation, (2) an exact test of edge inclusion with adaptive estimation of the degree of freedom and (3) a heuristic network search based on false discovery rate multiple testing. Steps (2) and (3) correspond to an empirical Bayes estimate of the network topology. Results: Using computer simulations, we investigate the sensitivity (power) and specificity (true negative rate) of the proposed framework to estimate GGMs from microarray data. This shows that it is possible to recover the true network topology with high accuracy even for small-sample datasets. Subsequently, we analyze gene expression data from a breast cancer tumor study and illustrate our approach by inferring a corresponding large-scale gene association network for 3883 genes. Availability: The authors have implemented the approach in the R package ‘GeneTS’ that is freely available from http://www.stat.uni-muenchen.de/~strimmer/genets/, from the R archive (CRAN) and from the Bioconductor website. Contact: korbinian.strimmer@lmu.de
Bibliography:	istex:939330138323DFE036EEE7142D53313B7D43A106 local:bti062 To whom correspondence should be addressed. ark:/67375/HXZ-W9LD6GHR-D ObjectType-Article-1 SourceType-Scholarly Journals-1 content type line 14 ObjectType-Article-2 ObjectType-Feature-1 content type line 23 ObjectType-Undefined-1 ObjectType-Feature-3
ISSN:	1367-4803 1367-4811 1460-2059 1367-4811
DOI:	10.1093/bioinformatics/bti062