Accuracy of imputation to infer unobserved APOE epsilon alleles in genome-wide genotyping data

Apolipoprotein E, encoded by APOE, is the main apoprotein for catabolism of chylomicrons and very low density lipoprotein. Two common single-nucleotide polymorphisms (SNPs) in APOE, rs429358 and rs7412, determine the three epsilon alleles that are established genetic risk factors for late-onset Alzh...

Full description

Saved in:
Bibliographic Details
Published inEuropean journal of human genetics : EJHG Vol. 22; no. 10; pp. 1239 - 1242
Main Authors Radmanesh, Farid, Devan, William J, Anderson, Christopher D, Rosand, Jonathan, Falcone, Guido J
Format Journal Article
LanguageEnglish
Published England Nature Publishing Group 01.10.2014
Subjects
Online AccessGet full text
ISSN1018-4813
1476-5438
1476-5438
DOI10.1038/ejhg.2013.308

Cover

More Information
Summary:Apolipoprotein E, encoded by APOE, is the main apoprotein for catabolism of chylomicrons and very low density lipoprotein. Two common single-nucleotide polymorphisms (SNPs) in APOE, rs429358 and rs7412, determine the three epsilon alleles that are established genetic risk factors for late-onset Alzheimer's disease (AD), cerebral amyloid angiopathy, and intracerebral hemorrhage (ICH). These two SNPs are not present in most commercially available genome-wide genotyping arrays and cannot be inferred through imputation using HapMap reference panels. Therefore, these SNPs are often separately genotyped. Introduction of reference panels compiled from the 1000 Genomes project has made imputation of these variants possible. We compared the directly genotyped and imputed SNPs that define the APOE epsilon alleles to determine the accuracy of imputation for inference of unobserved epsilon alleles. We utilized genome-wide genotype data obtained from two cohorts of ICH and AD constituting subjects of European ancestry. Our data suggest that imputation is highly accurate, yields an acceptable proportion of missing data that is non-differentially distributed across case and control groups, and generates comparable results to genotyped data for hypothesis testing. Further, we explored the effect of imputation algorithm parameters and demonstrated that customization of these parameters yields an improved balance between accuracy and missing data for inferred genotypes.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ObjectType-Article-2
ObjectType-Feature-1
content type line 23
These authors contributed equally to this work.
Data used in preparation of this article was obtained from the Alzheimer's Disease Neuroimaging Initiative (ADNI) database (adni.loni.ucla.edu). As such, the investigators within the ADNI contributed to the design and implementation of ADNI and/or provided data, but did not participate in analysis or writing of this report. A complete listing of ADNI investigators can be found at: http://adni.loni.usc.edu/wp-content/uploads/how_to_apply/ADNI_Acknowledgement_List.pdf
ISSN:1018-4813
1476-5438
1476-5438
DOI:10.1038/ejhg.2013.308