Comparative Annotation Toolkit (CAT)—simultaneous clade and personal genome annotation

The recent introductions of low-cost, long-read, and read-cloud sequencing technologies coupled with intense efforts to develop efficient algorithms have made affordable, high-quality de novo sequence assembly a realistic proposition. The result is an explosion of new, ultracontiguous genome assembl...

Full description

Saved in:
Bibliographic Details
Published inGenome research Vol. 28; no. 7; pp. 1029 - 1038
Main Authors Fiddes, Ian T., Armstrong, Joel, Diekhans, Mark, Nachtweide, Stefanie, Kronenberg, Zev N., Underwood, Jason G., Gordon, David, Earl, Dent, Keane, Thomas, Eichler, Evan E., Haussler, David, Stanke, Mario, Paten, Benedict
Format Journal Article
LanguageEnglish
Published United States Cold Spring Harbor Laboratory Press 01.07.2018
Subjects
Online AccessGet full text
ISSN1088-9051
1549-5469
1549-5469
DOI10.1101/gr.233460.117

Cover

More Information
Summary:The recent introductions of low-cost, long-read, and read-cloud sequencing technologies coupled with intense efforts to develop efficient algorithms have made affordable, high-quality de novo sequence assembly a realistic proposition. The result is an explosion of new, ultracontiguous genome assemblies. To compare these genomes, we need robust methods for genome annotation. We describe the fully open source Comparative Annotation Toolkit (CAT), which provides a flexible way to simultaneously annotate entire clades and identify orthology relationships. We show that CAT can be used to improve annotations on the rat genome, annotate the great apes, annotate a diverse set of mammals, and annotate personal, diploid human genomes. We demonstrate the resulting discovery of novel genes, isoforms, and structural variants—even in genomes as well studied as rat and the great apes—and how these annotations improve cross-species RNA expression experiments.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
These authors contributed equally to this work.
ISSN:1088-9051
1549-5469
1549-5469
DOI:10.1101/gr.233460.117