Integrating gene annotation with orthology inference at scale

Annotating coding genes and inferring orthologs are two classical challenges in genomics and evolutionary biology that have traditionally been approached separately, limiting scalability. We present TOGA (Tool to infer Orthologs from Genome Alignments), a method that integrates structural gene annot...

Full description

Saved in:
Bibliographic Details
Published inScience (American Association for the Advancement of Science) Vol. 380; no. 6643; p. eabn3107
Main Authors Kirilenko, Bogdan M, Munegowda, Chetan, Osipova, Ekaterina, Jebb, David, Sharma, Virag, Blumer, Moritz, Morales, Ariadna E, Ahmed, Alexis-Walid, Kontopoulos, Dimitrios-Georgios, Hilgers, Leon, Lindblad-Toh, Kerstin, Karlsson, Elinor K, Hiller, Michael
Format Journal Article
LanguageEnglish
Published United States 28.04.2023
Subjects
Online AccessGet full text
ISSN1095-9203
0036-8075
1095-9203
DOI10.1126/science.abn3107

Cover

More Information
Summary:Annotating coding genes and inferring orthologs are two classical challenges in genomics and evolutionary biology that have traditionally been approached separately, limiting scalability. We present TOGA (Tool to infer Orthologs from Genome Alignments), a method that integrates structural gene annotation and orthology inference. TOGA implements a different paradigm to infer orthologous loci, improves ortholog detection and annotation of conserved genes compared with state-of-the-art methods, and handles even highly fragmented assemblies. TOGA scales to hundreds of genomes, which we demonstrate by applying it to 488 placental mammal and 501 bird assemblies, creating the largest comparative gene resources so far. Additionally, TOGA detects gene losses, enables selection screens, and automatically provides a superior measure of mammalian genome quality. TOGA is a powerful and scalable method to annotate and compare genes in the genomic era.
ISSN:1095-9203
0036-8075
1095-9203
DOI:10.1126/science.abn3107