Telomere-to-telomere assembly of a complete human X chromosome

After two decades of improvements, the current human reference genome (GRCh38) is the most accurate and complete vertebrate genome ever produced. However, no single chromosome has been finished end to end, and hundreds of unresolved gaps persist 1 , 2 . Here we present a human genome assembly that s...

Full description

Saved in:
Bibliographic Details
Published inNature (London) Vol. 585; no. 7823; pp. 79 - 84
Main Authors Miga, Karen H., Koren, Sergey, Rhie, Arang, Vollger, Mitchell R., Gershman, Ariel, Bzikadze, Andrey, Brooks, Shelise, Howe, Edmund, Porubsky, David, Logsdon, Glennis A., Schneider, Valerie A., Potapova, Tamara, Wood, Jonathan, Chow, William, Armstrong, Joel, Fredrickson, Jeanne, Pak, Evgenia, Tigyi, Kristof, Kremitzki, Milinn, Markovic, Christopher, Maduro, Valerie, Dutra, Amalia, Bouffard, Gerard G., Chang, Alexander M., Hansen, Nancy F., Wilfert, Amy B., Thibaud-Nissen, Françoise, Schmitt, Anthony D., Belton, Jon-Matthew, Selvaraj, Siddarth, Dennis, Megan Y., Soto, Daniela C., Sahasrabudhe, Ruta, Kaya, Gulhan, Quick, Josh, Loman, Nicholas J., Holmes, Nadine, Loose, Matthew, Surti, Urvashi, Risques, Rosa ana, Graves Lindsay, Tina A., Fulton, Robert, Hall, Ira, Paten, Benedict, Howe, Kerstin, Timp, Winston, Young, Alice, Mullikin, James C., Pevzner, Pavel A., Gerton, Jennifer L., Sullivan, Beth A., Eichler, Evan E., Phillippy, Adam M.
Format Journal Article
LanguageEnglish
Published London Nature Publishing Group UK 03.09.2020
Nature Publishing Group
Subjects
Online AccessGet full text
ISSN0028-0836
1476-4687
1476-4687
DOI10.1038/s41586-020-2547-7

Cover

More Information
Summary:After two decades of improvements, the current human reference genome (GRCh38) is the most accurate and complete vertebrate genome ever produced. However, no single chromosome has been finished end to end, and hundreds of unresolved gaps persist 1 , 2 . Here we present a human genome assembly that surpasses the continuity of GRCh38 2 , along with a gapless, telomere-to-telomere assembly of a human chromosome. This was enabled by high-coverage, ultra-long-read nanopore sequencing of the complete hydatidiform mole CHM13 genome, combined with complementary technologies for quality improvement and validation. Focusing our efforts on the human X chromosome 3 , we reconstructed the centromeric satellite DNA array (approximately 3.1 Mb) and closed the 29 remaining gaps in the current reference, including new sequences from the human pseudoautosomal regions and from cancer-testis ampliconic gene families (CT-X and GAGE). These sequences will be integrated into future human reference genome releases. In addition, the complete chromosome X, combined with the ultra-long nanopore data, allowed us to map methylation patterns across complex tandem repeats and satellite arrays. Our results demonstrate that finishing the entire human genome is now within reach, and the data presented here will facilitate ongoing efforts to complete the other human chromosomes. High-coverage, ultra-long-read nanopore sequencing is used to create a new human genome assembly that improves on the coverage and accuracy of the current reference (GRCh38) and includes the gap-free, telomere-to-telomere sequence of the X chromosome.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
ISSN:0028-0836
1476-4687
1476-4687
DOI:10.1038/s41586-020-2547-7