Improving Cancer Classification Accuracy Using Gene Pairs

Recent studies suggest that the deregulation of pathways, rather than individual genes, may be critical in triggering carcinogenesis. The pathway deregulation is often caused by the simultaneous deregulation of more than one gene in the pathway. This suggests that robust gene pair combinations may e...

Full description

Saved in:
Bibliographic Details
Published inPloS one Vol. 5; no. 12; p. e14305
Main Authors Chopra, Pankaj, Lee, Jinseung, Kang, Jaewoo, Lee, Sunwon
Format Journal Article
LanguageEnglish
Published United States Public Library of Science 21.12.2010
Public Library of Science (PLoS)
Subjects
Online AccessGet full text
ISSN1932-6203
1932-6203
DOI10.1371/journal.pone.0014305

Cover

More Information
Summary:Recent studies suggest that the deregulation of pathways, rather than individual genes, may be critical in triggering carcinogenesis. The pathway deregulation is often caused by the simultaneous deregulation of more than one gene in the pathway. This suggests that robust gene pair combinations may exploit the underlying bio-molecular reactions that are relevant to the pathway deregulation and thus they could provide better biomarkers for cancer, as compared to individual genes. In order to validate this hypothesis, in this paper, we used gene pair combinations, called doublets, as input to the cancer classification algorithms, instead of the original expression values, and we showed that the classification accuracy was consistently improved across different datasets and classification algorithms. We validated the proposed approach using nine cancer datasets and five classification algorithms including Prediction Analysis for Microarrays (PAM), C4.5 Decision Trees (DT), Naive Bayesian (NB), Support Vector Machine (SVM), and k-Nearest Neighbor (k-NN).
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
Conceived and designed the experiments: PC JK. Performed the experiments: PC JL SL. Analyzed the data: PC JL JK SL. Wrote the paper: PC JL JK.
ISSN:1932-6203
1932-6203
DOI:10.1371/journal.pone.0014305