Expansion of Biological Pathways Based on Evolutionary Inference

The availability of diverse genomes makes it possible to predict gene function based on shared evolutionary history. This approach can be challenging, however, for pathways whose components do not exhibit a shared history but rather consist of distinct “evolutionary modules.” We introduce a computat...

Full description

Saved in:
Bibliographic Details
Published inCell Vol. 158; no. 1; pp. 213 - 225
Main Authors Li, Yang, Calvo, Sarah E., Gutman, Roee, Liu, Jun S., Mootha, Vamsi K.
Format Journal Article
LanguageEnglish
Published United States Elsevier Inc 03.07.2014
Subjects
Online AccessGet full text
ISSN0092-8674
1097-4172
1097-4172
DOI10.1016/j.cell.2014.05.034

Cover

More Information
Summary:The availability of diverse genomes makes it possible to predict gene function based on shared evolutionary history. This approach can be challenging, however, for pathways whose components do not exhibit a shared history but rather consist of distinct “evolutionary modules.” We introduce a computational algorithm, clustering by inferred models of evolution (CLIME), which inputs a eukaryotic species tree, homology matrix, and pathway (gene set) of interest. CLIME partitions the gene set into disjoint evolutionary modules, simultaneously learning the number of modules and a tree-based evolutionary history that defines each module. CLIME then expands each module by scanning the genome for new components that likely arose under the inferred evolutionary model. Application of CLIME to ∼1,000 annotated human pathways and to the proteomes of yeast, red algae, and malaria reveals unanticipated evolutionary modularity and coevolving components. CLIME is freely available and should become increasingly powerful with the growing wealth of eukaryotic genomes. [Display omitted] •A tree-based algorithm (CLIME) clusters gene sets based on evolutionary history•CLIME predicts new members of a pathway based on shared inferred ancestry•Entire genomes or focused pathways can be explored•CLIME software and a web-based interface are freely available Genes that share a phylogenetic history, showing related patterns of loss and gain across a wide range of organisms, are clustered into functional units at the pathway or genome level. The evolutionary patterns then highlight genes with related histories, placing them within known pathways or suggesting candidate functions.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
Co-first authors
ISSN:0092-8674
1097-4172
1097-4172
DOI:10.1016/j.cell.2014.05.034