Multi-genome Scaffold Co-assembly Based on the Analysis of Gene Orders and Genomic Repeats
Advances in the DNA sequencing technology over the past decades have increased the volume of raw sequenced genomic data available for further assembly and analysis. While there exist many software tools for assembly of sequenced genomic material, they often experience difficulties with reconstructin...
Saved in:
| Published in | Bioinformatics Research and Applications pp. 237 - 249 |
|---|---|
| Main Authors | , |
| Format | Book Chapter |
| Language | English |
| Published |
Cham
Springer International Publishing
2016
|
| Series | Lecture Notes in Computer Science |
| Subjects | |
| Online Access | Get full text |
| ISBN | 9783319387819 3319387812 |
| ISSN | 0302-9743 1611-3349 |
| DOI | 10.1007/978-3-319-38782-6_20 |
Cover
| Summary: | Advances in the DNA sequencing technology over the past decades have increased the volume of raw sequenced genomic data available for further assembly and analysis. While there exist many software tools for assembly of sequenced genomic material, they often experience difficulties with reconstructing complete chromosomes. Major obstacles include uneven read coverage and long similar subsequences (repeats) in genomes. Assemblers therefore often are able to reliably reconstruct only long subsequences, called scaffolds.
We present a method for simultaneous co-assembly of all fragmented genomes (represented as collections of scaffolds rather than chromosomes) in a given set of annotated genomes. The method is based on the analysis of gene orders and relies on the evolutionary model, which includes genome rearrangements as well as gene insertions and deletions. It can also utilize information about genomic repeats and the phylogenetic tree of the given genomes, further improving their assembly quality. |
|---|---|
| Bibliography: | The work is supported by the National Science Foundation under Grant No. IIS-1462107. |
| ISBN: | 9783319387819 3319387812 |
| ISSN: | 0302-9743 1611-3349 |
| DOI: | 10.1007/978-3-319-38782-6_20 |