A genotype calling algorithm for the Illumina BeadArray platform

Motivation: Large-scale genotyping relies on the use of unsupervised automated calling algorithms to assign genotypes to hybridization data. A number of such calling algorithms have been recently established for the Affymetrix GeneChip genotyping technology. Here, we present a fast and accurate geno...

Full description

Saved in:
Bibliographic Details
Published inBioinformatics Vol. 23; no. 20; pp. 2741 - 2746
Main Authors Teo, Yik Y., Inouye, Michael, Small, Kerrin S., Gwilliam, Rhian, Deloukas, Panagiotis, Kwiatkowski, Dominic P., Clark, Taane G.
Format Journal Article
LanguageEnglish
Published Oxford Oxford University Press 15.10.2007
Oxford Publishing Limited (England)
Subjects
Online AccessGet full text
ISSN1367-4803
1367-4811
1460-2059
1367-4811
DOI10.1093/bioinformatics/btm443

Cover

More Information
Summary:Motivation: Large-scale genotyping relies on the use of unsupervised automated calling algorithms to assign genotypes to hybridization data. A number of such calling algorithms have been recently established for the Affymetrix GeneChip genotyping technology. Here, we present a fast and accurate genotype calling algorithm for the Illumina BeadArray genotyping platforms. As the technology moves towards assaying millions of genetic polymorphisms simultaneously, there is a need for an integrated and easy-to-use software for calling genotypes. Results: We have introduced a model-based genotype calling algorithm which does not rely on having prior training data or require computationally intensive procedures. The algorithm can assign genotypes to hybridization data from thousands of individuals simultaneously and pools information across multiple individuals to improve the calling. The method can accommodate variations in hybridization intensities which result in dramatic shifts of the position of the genotype clouds by identifying the optimal coordinates to initialize the algorithm. By incorporating the process of perturbation analysis, we can obtain a quality metric measuring the stability of the assigned genotype calls. We show that this quality metric can be used to identify SNPs with low call rates and accuracy. Availability: The C++ executable for the algorithm described here is available by request from the authors. Contact: teo@well.ox.ac.uk or tgc@well.ox.ac.uk
Bibliography:The authors wish it to be known that, in their opinion, the first two authors should be regarded as joint First Authors.
istex:19C7D12E9B8EC95D74F294F1C29A734F406375BA
To whom correspondence should be addressed.
ark:/67375/HXZ-2R19P6XD-J
ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
ISSN:1367-4803
1367-4811
1460-2059
1367-4811
DOI:10.1093/bioinformatics/btm443