Investigation of the fine structure of European populations with applications to disease association studies

An investigation into fine-scale European population structure was carried out using high-density genetic variation on nearly 6000 individuals originating from across Europe. The individuals were collected as control samples and were genotyped with more than 300 000 SNPs in genome-wide association s...

Full description

Saved in:
Bibliographic Details
Published inEuropean journal of human genetics : EJHG Vol. 16; no. 12; pp. 1413 - 1429
Main Authors Heath, Simon C, Gut, Ivo G, Brennan, Paul, McKay, James D, Bencko, Vladimir, Fabianova, Eleonora, Foretova, Lenka, Georges, Michael, Janout, Vladimir, Kabesch, Michael, Krokan, Hans E, Elvestad, Maiken B, Lissowska, Jolanta, Mates, Dana, Rudnai, Peter, Skorpen, Frank, Schreiber, Stefan, Soria, José M, Syvänen, Ann-Christine, Meneton, Pierre, Herçberg, Serge, Galan, Pilar, Szeszenia-Dabrowska, Neonilia, Zaridze, David, Génin, Emmanuel, Cardon, Lon R, Lathrop, Mark
Format Journal Article Web Resource
LanguageEnglish
Published Cham Springer International Publishing 01.12.2008
Nature Publishing Group
Subjects
Online AccessGet full text
ISSN1018-4813
1476-5438
1476-5438
DOI10.1038/ejhg.2008.210

Cover

More Information
Summary:An investigation into fine-scale European population structure was carried out using high-density genetic variation on nearly 6000 individuals originating from across Europe. The individuals were collected as control samples and were genotyped with more than 300 000 SNPs in genome-wide association studies using the Illumina Infinium platform. A major East–West gradient from Russian (Moscow) samples to Spanish samples was identified as the first principal component (PC) of the genetic diversity. The second PC identified a North–South gradient from Norway and Sweden to Romania and Spain. Variation of frequencies at markers in three separate genomic regions, surrounding LCT , HLA and HERC2 , were strongly associated with this gradient. The next 18 PCs also accounted for a significant proportion of genetic diversity observed in the sample. We present a method to predict the ethnic origin of samples by comparing the sample genotypes with those from a reference set of samples of known origin. These predictions can be performed using just summary information on the known samples, and individual genotype data are not required. We discuss issues raised by these data and analyses for association studies including the matching of case-only cohorts to appropriate pre-collected control samples for genome-wide association studies.
Bibliography:ObjectType-Article-2
SourceType-Scholarly Journals-1
ObjectType-Feature-1
content type line 14
ObjectType-Article-1
ObjectType-Feature-2
content type line 23
scopus-id:2-s2.0-56749169214
ISSN:1018-4813
1476-5438
1476-5438
DOI:10.1038/ejhg.2008.210