Estimating Effect Sizes of Differentially Expressed Genes for Power and Sample‐Size Assessments in Microarray Experiments

In microarray screening for differentially expressed genes using multiple testing, assessment of power or sample size is of particular importance to ensure that few relevant genes are removed from further consideration prematurely. In this assessment, adequate estimation of the effect sizes of diffe...

Full description

Saved in:
Bibliographic Details
Published inBiometrics Vol. 67; no. 4; pp. 1225 - 1235
Main Authors Matsui, Shigeyuki, Noma, Hisashi
Format Journal Article
LanguageEnglish
Published Malden, USA Blackwell Publishing Inc 01.12.2011
Wiley-Blackwell
Blackwell Publishing Ltd
Subjects
Online AccessGet full text
ISSN0006-341X
1541-0420
1541-0420
DOI10.1111/j.1541-0420.2011.01618.x

Cover

More Information
Summary:In microarray screening for differentially expressed genes using multiple testing, assessment of power or sample size is of particular importance to ensure that few relevant genes are removed from further consideration prematurely. In this assessment, adequate estimation of the effect sizes of differentially expressed genes is crucial because of its substantial impact on power and sample‐size estimates. However, conventional methods using top genes with largest observed effect sizes would be subject to overestimation due to random variation. In this article, we propose a simple estimation method based on hierarchical mixture models with a nonparametric prior distribution to accommodate random variation and possible large diversity of effect sizes across differential genes, separated from nuisance, nondifferential genes. Based on empirical Bayes estimates of effect sizes, the power and false discovery rate (FDR) can be estimated to monitor them simultaneously in gene screening. We also propose a power index that concerns selection of top genes with largest effect sizes, called partial power. This new power index could provide a practical compromise for the difficulty in achieving high levels of usual overall power as confronted in many microarray experiments. Applications to two real datasets from cancer clinical studies are provided.
Bibliography:http://dx.doi.org/10.1111/j.1541-0420.2011.01618.x
istex:B510EA6AC5329F14E9EECB5E0B49064F3DD3FAAF
ArticleID:BIOM1618
ark:/67375/WNG-2KJ170WF-J
ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ObjectType-Article-2
ObjectType-Feature-1
content type line 23
ISSN:0006-341X
1541-0420
1541-0420
DOI:10.1111/j.1541-0420.2011.01618.x