Predictive long-range allele-specific mapping of regulatory variants and target transcripts
Genome-wide association studies (GWASs) have identified a large number of noncoding associations, calling for systematic mapping to causal regulatory variants and their distal target genes. A widely used method, quantitative trait loci (QTL) mapping for chromatin or expression traits, suffers from s...
        Saved in:
      
    
          | Published in | PloS one Vol. 12; no. 4; p. e0175768 | 
|---|---|
| Main Authors | , , , | 
| Format | Journal Article | 
| Language | English | 
| Published | 
        United States
          Public Library of Science
    
        13.04.2017
     Public Library of Science (PLoS)  | 
| Subjects | |
| Online Access | Get full text | 
| ISSN | 1932-6203 1932-6203  | 
| DOI | 10.1371/journal.pone.0175768 | 
Cover
| Summary: | Genome-wide association studies (GWASs) have identified a large number of noncoding associations, calling for systematic mapping to causal regulatory variants and their distal target genes. A widely used method, quantitative trait loci (QTL) mapping for chromatin or expression traits, suffers from sample-to-sample experimental variation and trans-acting or environmental effects. Instead, alleles at heterozygous loci can be compared within a sample, thereby controlling for those confounding factors. Here we introduce a method for chromatin structure-based allele-specific pairing of regulatory variants and target transcripts. With phased genotypes, much of allele-specific expression could be explained by paired allelic cis-regulation across a long range. This approach showed approximately two times greater sensitivity than QTL mapping. There are cases in which allele imbalance cannot be tested because heterozygotes are not available among reference samples. Therefore, we employed a machine learning method to predict missing positive cases based on various features shared by observed allele-specific pairs. We showed that only 10 reference samples are sufficient to achieve high prediction accuracy with a low sampling variation. In conclusion, our method enables highly sensitive fine mapping and target identification for trait-associated variants based on a small number of reference samples. | 
|---|---|
| Bibliography: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 content type line 23 Conceptualization: JKC.Data curation: KBL.Formal analysis: KBL SKL HB.Funding acquisition: JKC.Investigation: SKL.Methodology: SKL.Project administration: JKC.Resources: HB.Supervision: JKC.Writing – original draft: JKC.Writing – review & editing: JKC. Competing Interests: The authors have declared that no competing interests exist. These authors are co-first authors on this work.  | 
| ISSN: | 1932-6203 1932-6203  | 
| DOI: | 10.1371/journal.pone.0175768 |