Meta‐analysis of massively parallel reporter assays enables prediction of regulatory function across cell types

Deciphering the potential of noncoding loci to influence gene regulation has been the subject of intense research, with important implications in understanding genetic underpinnings of human diseases. Massively parallel reporter assays (MPRAs) can measure regulatory activity of thousands of DNA sequ...

Full description

Saved in:
Bibliographic Details
Published inHuman mutation Vol. 40; no. 9; pp. 1299 - 1313
Main Authors Kreimer, Anat, Yan, Zhongxia, Ahituv, Nadav, Yosef, Nir
Format Journal Article
LanguageEnglish
Published United States John Wiley & Sons, Inc 01.09.2019
John Wiley and Sons Inc
Subjects
Online AccessGet full text
ISSN1059-7794
1098-1004
1098-1004
DOI10.1002/humu.23820

Cover

More Information
Summary:Deciphering the potential of noncoding loci to influence gene regulation has been the subject of intense research, with important implications in understanding genetic underpinnings of human diseases. Massively parallel reporter assays (MPRAs) can measure regulatory activity of thousands of DNA sequences and their variants in a single experiment. With increasing number of publically available MPRA data sets, one can now develop data‐driven models which, given a DNA sequence, predict its regulatory activity. Here, we performed a comprehensive meta‐analysis of several MPRA data sets in a variety of cellular contexts. We first applied an ensemble of methods to predict MPRA output in each context and observed that the most predictive features are consistent across data sets. We then demonstrate that predictive models trained in one cellular context can be used to predict MPRA output in another, with loss of accuracy attributed to cell‐type‐specific features. Finally, we show that our approach achieves top performance in the Fifth Critical Assessment of Genome Interpretation “Regulation Saturation” Challenge for predicting effects of single‐nucleotide variants. Overall, our analysis provides insights into how MPRA data can be leveraged to highlight functional regulatory regions throughout the genome and can guide effective design of future experiments by better prioritizing regions of interest.
Bibliography:Anat Kreimer and Zhongxia Yan contributed equally.
ObjectType-Article-2
SourceType-Scholarly Journals-1
content type line 14
ObjectType-Feature-3
ObjectType-Evidence Based Healthcare-1
ObjectType-Article-1
ObjectType-Feature-2
content type line 23
ISSN:1059-7794
1098-1004
1098-1004
DOI:10.1002/humu.23820