When the levee breaks: a practical guide to sketching algorithms for processing the flood of genomic data

Considerable advances in genomics over the past decade have resulted in vast amounts of data being generated and deposited in global archives. The growth of these archives exceeds our ability to process their content, leading to significant analysis bottlenecks. Sketching algorithms produce small, a...

Full description

Saved in:
Bibliographic Details
Published inGenome Biology Vol. 20; no. 1; p. 199
Main Author Rowe, Will P. M.
Format Journal Article
LanguageEnglish
Published London BioMed Central 13.09.2019
Springer Nature B.V
BMC
Subjects
Online AccessGet full text
ISSN1474-760X
1474-7596
1474-760X
DOI10.1186/s13059-019-1809-x

Cover

More Information
Summary:Considerable advances in genomics over the past decade have resulted in vast amounts of data being generated and deposited in global archives. The growth of these archives exceeds our ability to process their content, leading to significant analysis bottlenecks. Sketching algorithms produce small, approximate summaries of data and have shown great utility in tackling this flood of genomic data, while using minimal compute resources. This article reviews the current state of the field, focusing on how the algorithms work and how genomicists can utilize them effectively. References to interactive workbooks for explaining concepts and demonstrating workflows are included at https://github.com/will-rowe/genome-sketching .
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ObjectType-Review-3
content type line 23
ISSN:1474-760X
1474-7596
1474-760X
DOI:10.1186/s13059-019-1809-x