Integrative analysis of multiple cancer prognosis studies with gene expression measurements
Although in cancer research microarray gene profiling studies have been successful in identifying genetic variants predisposing to the development and progression of cancer, the identified markers from analysis of single datasets often suffer low reproducibility. Among multiple possible causes, the...
Saved in:
| Published in | Statistics in medicine Vol. 30; no. 28; pp. 3361 - 3371 |
|---|---|
| Main Authors | , , , , |
| Format | Journal Article |
| Language | English |
| Published |
Chichester, UK
John Wiley & Sons, Ltd
10.12.2011
Wiley Subscription Services, Inc |
| Subjects | |
| Online Access | Get full text |
| ISSN | 0277-6715 1097-0258 1097-0258 |
| DOI | 10.1002/sim.4337 |
Cover
| Summary: | Although in cancer research microarray gene profiling studies have been successful in identifying genetic variants predisposing to the development and progression of cancer, the identified markers from analysis of single datasets often suffer low reproducibility. Among multiple possible causes, the most important one is the small sample size hence the lack of power of single studies. Integrative analysis jointly considers multiple heterogeneous studies, has a significantly larger sample size, and can improve reproducibility. In this article, we focus on cancer prognosis studies, where the response variables are progression‐free, overall, or other types of survival. A group minimax concave penalty (GMCP) penalized integrative analysis approach is proposed for analyzing multiple heterogeneous cancer prognosis studies with microarray gene expression measurements. An efficient group coordinate descent algorithm is developed. The GMCP can automatically accommodate the heterogeneity across multiple datasets, and the identified markers have consistent effects across multiple studies. Simulation studies show that the GMCP provides significantly improved selection results as compared with the existing meta‐analysis approaches, intensity approaches, and group Lasso penalized integrative analysis. We apply the GMCP to four microarray studies and identify genes associated with the prognosis of breast cancer. Copyright © 2011 John Wiley & Sons, Ltd. |
|---|---|
| Bibliography: | istex:1410A976899C2B4B5C1F75A1789E44E6E9908FAF ArticleID:SIM4337 ark:/67375/WNG-DWC24479-V NIH - No. LM009828; No. CA120988; No. CA152301; No. CA142774 SourceType-Scholarly Journals-1 ObjectType-Feature-1 content type line 14 ObjectType-Article-1 ObjectType-Feature-2 content type line 23 shuangge.ma@yale.edu |
| ISSN: | 0277-6715 1097-0258 1097-0258 |
| DOI: | 10.1002/sim.4337 |