Computational Prediction of Polycomb-Associated Long Non-Coding RNAs

Among thousands of long non-coding RNAs (lncRNAs) only a small subset is functionally characterized and the functional annotation of lncRNAs on the genomic scale remains inadequate. In this study we computationally characterized two functionally different parts of human lncRNAs transcriptome based o...

Full description

Saved in:
Bibliographic Details
Published inPloS one Vol. 7; no. 9; p. e44878
Main Authors Glazko, Galina V., Zybailov, Boris L., Rogozin, Igor B.
Format Journal Article
LanguageEnglish
Published United States Public Library of Science 13.09.2012
Public Library of Science (PLoS)
Subjects
Online AccessGet full text
ISSN1932-6203
1932-6203
DOI10.1371/journal.pone.0044878

Cover

More Information
Summary:Among thousands of long non-coding RNAs (lncRNAs) only a small subset is functionally characterized and the functional annotation of lncRNAs on the genomic scale remains inadequate. In this study we computationally characterized two functionally different parts of human lncRNAs transcriptome based on their ability to bind the polycomb repressive complex, PRC2. This classification is enabled by the fact that while all lncRNAs constitute a diverse set of sequences, the classes of PRC2-binding and PRC2 non-binding lncRNAs possess characteristic combinations of sequence-structure patterns and, therefore, can be separated within the feature space. Based on the specific combination of features, we built several machine-learning classifiers and identified the SVM-based classifier as the best performing. We further showed that the SVM-based classifier is able to generalize on the independent data sets. We observed that this classifier, trained on the human lncRNAs, can predict up to 59.4% of PRC2-binding lncRNAs in mice. This suggests that, despite the low degree of sequence conservation, many lncRNAs play functionally conserved biological roles.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
Competing Interests: The authors have declared that no competing interests exist.
Conceived and designed the experiments: GG BZ IR. Performed the experiments: GG BZ. Analyzed the data: GG BZ IR. Contributed reagents/materials/analysis tools: GG BZ IR. Wrote the paper: GG BZ IR. Designed the software used in analysis: GG.
ISSN:1932-6203
1932-6203
DOI:10.1371/journal.pone.0044878