Atlas of Transcription Factor Binding Sites from ENCODE DNase Hypersensitivity Data across 27 Tissue Types
Characterizing the tissue-specific binding sites of transcription factors (TFs) is essential to reconstruct gene regulatory networks and predict functions for non-coding genetic variation. DNase-seq footprinting enables the prediction of genome-wide binding sites for hundreds of TFs simultaneously....
Saved in:
Published in | Cell reports (Cambridge) Vol. 32; no. 7; p. 108029 |
---|---|
Main Authors | , , , , , , , , , , , , , , , , , , , , |
Format | Journal Article |
Language | English |
Published |
United States
Elsevier Inc
18.08.2020
Elsevier |
Subjects | |
Online Access | Get full text |
ISSN | 2211-1247 2211-1247 |
DOI | 10.1016/j.celrep.2020.108029 |
Cover
Summary: | Characterizing the tissue-specific binding sites of transcription factors (TFs) is essential to reconstruct gene regulatory networks and predict functions for non-coding genetic variation. DNase-seq footprinting enables the prediction of genome-wide binding sites for hundreds of TFs simultaneously. Despite the public availability of high-quality DNase-seq data from hundreds of samples, a comprehensive, up-to-date resource for the locations of genomic footprints is lacking. Here, we develop a scalable footprinting workflow using two state-of-the-art algorithms: Wellington and HINT. We apply our workflow to detect footprints in 192 ENCODE DNase-seq experiments and predict the genomic occupancy of 1,515 human TFs in 27 human tissues. We validate that these footprints overlap true-positive TF binding sites from ChIP-seq. We demonstrate that the locations, depth, and tissue specificity of footprints predict effects of genetic variants on gene expression and capture a substantial proportion of genetic risk for complex traits.
[Display omitted]
•Comprehensive map of TF occupancy in human tissues from DNase-seq footprints•Footprints contain genetic variants associated with changes in gene expression•Tissue-specific associations of footprints with genetic risk for complex traits
DNase-seq footprinting provides a means to predict genome-wide binding sites for hundreds of transcription factors (TFs) simultaneously. Funk et al. analyze data from the ENCODE consortium to create a resource of footprints in 27 human tissues, demonstrating associations of tissue-specific TF occupancy with gene regulation and disease risk. |
---|---|
Bibliography: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 National Institute on Aging (NIA) National Institute of Mental Health (NIMH) National Human Genome Research Institute (NHGRI) National Institutes of Health (NIH) AC02-06CH11357 USDOE National Institute of General Medical Sciences (NIGMS) Conceptualization, C.C.F., A.M.C., R.M., N.D.P., and S.A.A.; Methodology, A.M.C.; Software, C.C.F., A.M.C., S.J., M.A.R., A.R., P.S., R.D.-M., B.H., K.C., and Y.X.; Formal Analysis, C.C.F., A.M.C., M.A.R., R.D.-M., and S.A.A.; Data Curation, C.C.F., A.M.C., and S.A.A.; Writing – Original Draft, C.C.F., A.M.C., and S.A.A.; Writing – Review & Editing, all authors; Visualization, C.C.F., A.M.C., M.A.R., and S.A.A.; Supervision, R.M., N.D.P., and S.A.A.; Project Administration, R.M., N.D.P., and S.A.A.; Funding Acquisition, R.M., N.D.P., and S.A.A. AUTHOR CONTRIBUTIONS |
ISSN: | 2211-1247 2211-1247 |
DOI: | 10.1016/j.celrep.2020.108029 |