Estimating Effect Sizes of Differentially Expressed Genes for Power and Sample‐Size Assessments in Microarray Experiments

In microarray screening for differentially expressed genes using multiple testing, assessment of power or sample size is of particular importance to ensure that few relevant genes are removed from further consideration prematurely. In this assessment, adequate estimation of the effect sizes of diffe...

Full description

Saved in:

Bibliographic Details
Published in	Biometrics Vol. 67; no. 4; pp. 1225 - 1235
Main Authors	Matsui, Shigeyuki, Noma, Hisashi
Format	Journal Article
Language	English
Published	Malden, USA Blackwell Publishing Inc 01.12.2011 Wiley-Blackwell Blackwell Publishing Ltd
Subjects	Algorithms Bayes estimators Bayes Theorem Bayesian analysis BIOMETRIC METHODOLOGY Biometrics biometry clinical trials data collection Data Interpretation, Statistical Datasets DNA - genetics Effect size Empirical Bayes Estimating techniques Estimation methods Gene expression Gene Expression Profiling - methods gene expression regulation Gene screening Genes Genetic screening Hierarchical mixture models microarray technology Microarrays Multilevel models Noma Oligonucleotide Array Sequence Analysis - methods Power Sample Size Sampling techniques screening Screening tests
Online Access	Get full text
ISSN	0006-341X 1541-0420 1541-0420
DOI	10.1111/j.1541-0420.2011.01618.x

Cover

More Information
Summary:	In microarray screening for differentially expressed genes using multiple testing, assessment of power or sample size is of particular importance to ensure that few relevant genes are removed from further consideration prematurely. In this assessment, adequate estimation of the effect sizes of differentially expressed genes is crucial because of its substantial impact on power and sample‐size estimates. However, conventional methods using top genes with largest observed effect sizes would be subject to overestimation due to random variation. In this article, we propose a simple estimation method based on hierarchical mixture models with a nonparametric prior distribution to accommodate random variation and possible large diversity of effect sizes across differential genes, separated from nuisance, nondifferential genes. Based on empirical Bayes estimates of effect sizes, the power and false discovery rate (FDR) can be estimated to monitor them simultaneously in gene screening. We also propose a power index that concerns selection of top genes with largest effect sizes, called partial power. This new power index could provide a practical compromise for the difficulty in achieving high levels of usual overall power as confronted in many microarray experiments. Applications to two real datasets from cancer clinical studies are provided.
Bibliography:	http://dx.doi.org/10.1111/j.1541-0420.2011.01618.x istex:B510EA6AC5329F14E9EECB5E0B49064F3DD3FAAF ArticleID:BIOM1618 ark:/67375/WNG-2KJ170WF-J ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 ObjectType-Article-2 ObjectType-Feature-1 content type line 23
ISSN:	0006-341X 1541-0420 1541-0420
DOI:	10.1111/j.1541-0420.2011.01618.x