Atlas of Transcription Factor Binding Sites from ENCODE DNase Hypersensitivity Data across 27 Tissue Types

Characterizing the tissue-specific binding sites of transcription factors (TFs) is essential to reconstruct gene regulatory networks and predict functions for non-coding genetic variation. DNase-seq footprinting enables the prediction of genome-wide binding sites for hundreds of TFs simultaneously....

Full description

Saved in:
Bibliographic Details
Published inCell reports (Cambridge) Vol. 32; no. 7; p. 108029
Main Authors Funk, Cory C., Casella, Alex M., Jung, Segun, Richards, Matthew A., Rodriguez, Alex, Shannon, Paul, Donovan-Maiye, Rory, Heavner, Ben, Chard, Kyle, Xiao, Yukai, Glusman, Gustavo, Ertekin-Taner, Nilufer, Golde, Todd E., Toga, Arthur, Hood, Leroy, Van Horn, John D., Kesselman, Carl, Foster, Ian, Madduri, Ravi, Price, Nathan D., Ament, Seth A.
Format Journal Article
LanguageEnglish
Published United States Elsevier Inc 18.08.2020
Elsevier
Subjects
Online AccessGet full text
ISSN2211-1247
2211-1247
DOI10.1016/j.celrep.2020.108029

Cover

More Information
Summary:Characterizing the tissue-specific binding sites of transcription factors (TFs) is essential to reconstruct gene regulatory networks and predict functions for non-coding genetic variation. DNase-seq footprinting enables the prediction of genome-wide binding sites for hundreds of TFs simultaneously. Despite the public availability of high-quality DNase-seq data from hundreds of samples, a comprehensive, up-to-date resource for the locations of genomic footprints is lacking. Here, we develop a scalable footprinting workflow using two state-of-the-art algorithms: Wellington and HINT. We apply our workflow to detect footprints in 192 ENCODE DNase-seq experiments and predict the genomic occupancy of 1,515 human TFs in 27 human tissues. We validate that these footprints overlap true-positive TF binding sites from ChIP-seq. We demonstrate that the locations, depth, and tissue specificity of footprints predict effects of genetic variants on gene expression and capture a substantial proportion of genetic risk for complex traits. [Display omitted] •Comprehensive map of TF occupancy in human tissues from DNase-seq footprints•Footprints contain genetic variants associated with changes in gene expression•Tissue-specific associations of footprints with genetic risk for complex traits DNase-seq footprinting provides a means to predict genome-wide binding sites for hundreds of transcription factors (TFs) simultaneously. Funk et al. analyze data from the ENCODE consortium to create a resource of footprints in 27 human tissues, demonstrating associations of tissue-specific TF occupancy with gene regulation and disease risk.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
National Institute on Aging (NIA)
National Institute of Mental Health (NIMH)
National Human Genome Research Institute (NHGRI)
National Institutes of Health (NIH)
AC02-06CH11357
USDOE
National Institute of General Medical Sciences (NIGMS)
Conceptualization, C.C.F., A.M.C., R.M., N.D.P., and S.A.A.; Methodology, A.M.C.; Software, C.C.F., A.M.C., S.J., M.A.R., A.R., P.S., R.D.-M., B.H., K.C., and Y.X.; Formal Analysis, C.C.F., A.M.C., M.A.R., R.D.-M., and S.A.A.; Data Curation, C.C.F., A.M.C., and S.A.A.; Writing – Original Draft, C.C.F., A.M.C., and S.A.A.; Writing – Review & Editing, all authors; Visualization, C.C.F., A.M.C., M.A.R., and S.A.A.; Supervision, R.M., N.D.P., and S.A.A.; Project Administration, R.M., N.D.P., and S.A.A.; Funding Acquisition, R.M., N.D.P., and S.A.A.
AUTHOR CONTRIBUTIONS
ISSN:2211-1247
2211-1247
DOI:10.1016/j.celrep.2020.108029