Integrative clustering of multi-level ‘omic data based on non-negative matrix factorization algorithm

Integrative analyses of high-throughput 'omic data, such as DNA methylation, DNA copy number alteration, mRNA and protein expression levels, have created unprecedented opportunities to understand the molecular basis of human disease. In particular, integrative analyses have been the cornerstone...

Full description

Saved in:
Bibliographic Details
Published inPloS one Vol. 12; no. 5; p. e0176278
Main Authors Chalise, Prabhakar, Fridley, Brooke L.
Format Journal Article
LanguageEnglish
Published United States Public Library of Science 01.05.2017
Public Library of Science (PLoS)
Subjects
Online AccessGet full text
ISSN1932-6203
1932-6203
DOI10.1371/journal.pone.0176278

Cover

More Information
Summary:Integrative analyses of high-throughput 'omic data, such as DNA methylation, DNA copy number alteration, mRNA and protein expression levels, have created unprecedented opportunities to understand the molecular basis of human disease. In particular, integrative analyses have been the cornerstone in the study of cancer to determine molecular subtypes within a given cancer. As malignant tumors with similar morphological characteristics have been shown to exhibit entirely different molecular profiles, there has been significant interest in using multiple 'omic data for the identification of novel molecular subtypes of disease, which could impact treatment decisions. Therefore, we have developed intNMF, an integrative approach for disease subtype classification based on non-negative matrix factorization. The proposed approach carries out integrative clustering of multiple high dimensional molecular data in a single comprehensive analysis utilizing the information across multiple biological levels assessed on the same individual. As intNMF does not assume any distributional form for the data, it has obvious advantages over other model based clustering methods which require specific distributional assumptions. Application of intNMF is illustrated using both simulated and real data from The Cancer Genome Atlas (TCGA).
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
Competing Interests: The authors have declared that no competing interests exist.
Conceptualization: PC BLF.Formal analysis: PC BLF.Funding acquisition: BLF.Investigation: PC BLF.Methodology: PC BLF.Resources: PC BLF.Software: PC BLF.Supervision: PC BLF.Validation: PC BLF.Writing – original draft: PC BLF.Writing – review & editing: PC BLF.
ISSN:1932-6203
1932-6203
DOI:10.1371/journal.pone.0176278