Integrative analysis of multiple cancer prognosis studies with gene expression measurements

Although in cancer research microarray gene profiling studies have been successful in identifying genetic variants predisposing to the development and progression of cancer, the identified markers from analysis of single datasets often suffer low reproducibility. Among multiple possible causes, the...

Full description

Saved in:
Bibliographic Details
Published inStatistics in medicine Vol. 30; no. 28; pp. 3361 - 3371
Main Authors Ma, Shuangge, Huang, Jian, Wei, Fengrong, Xie, Yang, Fang, Kuangnan
Format Journal Article
LanguageEnglish
Published Chichester, UK John Wiley & Sons, Ltd 10.12.2011
Wiley Subscription Services, Inc
Subjects
Online AccessGet full text
ISSN0277-6715
1097-0258
1097-0258
DOI10.1002/sim.4337

Cover

More Information
Summary:Although in cancer research microarray gene profiling studies have been successful in identifying genetic variants predisposing to the development and progression of cancer, the identified markers from analysis of single datasets often suffer low reproducibility. Among multiple possible causes, the most important one is the small sample size hence the lack of power of single studies. Integrative analysis jointly considers multiple heterogeneous studies, has a significantly larger sample size, and can improve reproducibility. In this article, we focus on cancer prognosis studies, where the response variables are progression‐free, overall, or other types of survival. A group minimax concave penalty (GMCP) penalized integrative analysis approach is proposed for analyzing multiple heterogeneous cancer prognosis studies with microarray gene expression measurements. An efficient group coordinate descent algorithm is developed. The GMCP can automatically accommodate the heterogeneity across multiple datasets, and the identified markers have consistent effects across multiple studies. Simulation studies show that the GMCP provides significantly improved selection results as compared with the existing meta‐analysis approaches, intensity approaches, and group Lasso penalized integrative analysis. We apply the GMCP to four microarray studies and identify genes associated with the prognosis of breast cancer. Copyright © 2011 John Wiley & Sons, Ltd.
Bibliography:istex:1410A976899C2B4B5C1F75A1789E44E6E9908FAF
ArticleID:SIM4337
ark:/67375/WNG-DWC24479-V
NIH - No. LM009828; No. CA120988; No. CA152301; No. CA142774
SourceType-Scholarly Journals-1
ObjectType-Feature-1
content type line 14
ObjectType-Article-1
ObjectType-Feature-2
content type line 23
shuangge.ma@yale.edu
ISSN:0277-6715
1097-0258
1097-0258
DOI:10.1002/sim.4337