Comprehensive, integrated, and phased whole-genome analysis of the primary ENCODE cell line K562

K562 is widely used in biomedical research. It is one of three tier-one cell lines of ENCODE and also most commonly used for large-scale CRISPR/Cas9 screens. Although its functional genomic and epigenomic characteristics have been extensively studied, its genome sequence and genomic structural featu...

Full description

Saved in:
Bibliographic Details
Published inGenome research Vol. 29; no. 3; pp. 472 - 484
Main Authors Zhou, Bo, Ho, Steve S., Greer, Stephanie U., Zhu, Xiaowei, Bell, John M., Arthur, Joseph G., Spies, Noah, Zhang, Xianglong, Byeon, Seunggyu, Pattni, Reenal, Ben-Efraim, Noa, Haney, Michael S., Haraksingh, Rajini R., Song, Giltae, Ji, Hanlee P., Perrin, Dimitri, Wong, Wing H., Abyzov, Alexej, Urban, Alexander E.
Format Journal Article
LanguageEnglish
Published United States Cold Spring Harbor Laboratory Press 01.03.2019
Subjects
Online AccessGet full text
ISSN1088-9051
1549-5469
1549-5469
DOI10.1101/gr.234948.118

Cover

More Information
Summary:K562 is widely used in biomedical research. It is one of three tier-one cell lines of ENCODE and also most commonly used for large-scale CRISPR/Cas9 screens. Although its functional genomic and epigenomic characteristics have been extensively studied, its genome sequence and genomic structural features have never been comprehensively analyzed. Such information is essential for the correct interpretation and understanding of the vast troves of existing functional genomics and epigenomics data for K562. We performed and integrated deep-coverage whole-genome (short-insert), mate-pair, and linked-read sequencing as well as karyotyping and array CGH analysis to identify a wide spectrum of genome characteristics in K562: copy numbers (CN) of aneuploid chromosome segments at high-resolution, SNVs and indels (both corrected for CN in aneuploid regions), loss of heterozygosity, megabase-scale phased haplotypes often spanning entire chromosome arms, structural variants (SVs), including small and large-scale complex SVs and nonreference retrotransposon insertions. Many SVs were phased, assembled, and experimentally validated. We identified multiple allele-specific deletions and duplications within the tumor suppressor gene FHIT . Taking aneuploidy into account, we reanalyzed K562 RNA-seq and whole-genome bisulfite sequencing data for allele-specific expression and allele-specific DNA methylation. We also show examples of how deeper insights into regulatory complexity are gained by integrating genomic variant information and structural context with functional genomics and epigenomics data. Furthermore, using K562 haplotype information, we produced an allele-specific CRISPR targeting map. This comprehensive whole-genome analysis serves as a resource for future studies that utilize K562 as well as a framework for the analysis of other cancer genomes.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
Present addresses: 1310X Genomics, Pleasanton, CA 94566, USA; 14Celsius Therapeutics, Cambridge, MA 02142, USA; 15Department of Life Sciences, The University of the West Indies, Saint Augustine, Trinidad and Tobago
ISSN:1088-9051
1549-5469
1549-5469
DOI:10.1101/gr.234948.118