Admixture In Sickle Cell Disease Defined By Genome Wide Ancestry Informative Markers Varies By Geographic Region In The United States

Genome wide association studies (GWAS) are an important tool for identifying complex human disease loci. A key variable for GWAS is the degree of admixture or recent mixing of previously isolated populations. Undetected admixture can lead to spurious associations, although it may also be used as a s...

Full description

Saved in:
Bibliographic Details
Published inBlood Vol. 122; no. 21; p. 995
Main Authors Wang, Zhengyuan, Diaw, Lena, Dizon, Michael, Barr, Marci, Quinn, Meghan, Diggs, Dominique, Allen, Darlene, Seamon, Cassie, Oguhebe, Anna, Darbari, Deepika, Zhang, Yingze, Gladwin, Mark T., Hutchinson, Amy A., Burdette, Laurie, Hoppe, Carolyn, Taylor, James G.
Format Journal Article
LanguageEnglish
Published Elsevier Inc 15.11.2013
Online AccessGet full text
ISSN0006-4971
1528-0020
DOI10.1182/blood.V122.21.995.995

Cover

Abstract Genome wide association studies (GWAS) are an important tool for identifying complex human disease loci. A key variable for GWAS is the degree of admixture or recent mixing of previously isolated populations. Undetected admixture can lead to spurious associations, although it may also be used as a strategy for mapping by admixture linkage disequilibrium (MALD). Development of reagents to identify admixture and to perform MALD may help to identify genetic modifiers of sickle cell disease (SCD). We have examined ancestry and genetic admixture in SCD patients living in North America. We first examined ancestry from family histories for 649 adults recruited to the Bethesda Sickle Cell Cohort Study in the eastern US, where 61% of subjects are African American, 22% African, 12% Caribbean or South American and 1.8% of other origins. We hypothesized that the high proportion of subjects from the Africa ancestry group would diminish the expected degree of admixture in the Bethesda cohort compared to other SCD cohorts. To assess admixture at the genetic level, we identified ancestry informative markers (AIMs) from 3,804,602 SNPs genotyped by HapMap Phase III in CEU (European) and YRI (Nigeria) populations. These SNPs were combined with a published admixture mapping panel, yielding a new MALD panel of 2251 AIMs. We assessed the performance of our AIM panel by genotyping 221 HapMap individuals and 489 adults from the Bethesda cohort with an overall completion rate of 98.51% using Illumina iSelect arrays. HapMap samples had 98% concordance with publicly available data. Out of 2251 candidate markers, only 1806 successfully passed all quality controls (289 SNPs could not be genotyped; 82 were not AIMs; 74 had low concordance with HapMap). Using our HapMap genotype data, this admixture panel has a mean δ (difference in allele frequencies between populations) of 0.712 (SD 0.118) with an average inter-marker distance of 1.862 Mb. The utility of this 1806 marker panel for genome wide MALD in SCD was demonstrated with a proof of concept scan to map the known genetic locus underlying SCD in the Bethesda cohort. To compare admixture in the Bethesda cohort to other SCD populations, we defined a subset of AIMs (n=360) that distinguish 6 major geo-ethnic groups from the Human Genome Diversity Panel (n=952 individuals). These results suggest that these 360 markers can distinguish contributions from geo-ethnic ancestry groups besides African and European populations. Using these markers in the Bethesda cohort, principle components analysis (PCA) showed African American or Caribbean/South American subjects had a wider range of admixture compared to Africans or HapMap populations (CEU, YRI and CHB). We then compared admixture proportions in the Bethesda cohort to 2 additional SCD populations, including 469 anonymous samples from a newborn screen SCD cohort in the western US and 439 SCD adults from a clinical trial in North America and Europe (WalkPhasst). Comparison of these SCD cohorts to 3 HapMap populations using PCA showed nearly 50% of the variation is explained by 2 major vectors that distinguish European/African and Asian/Native American ancestry. Finally, we compared admixture proportions across all 3 SCD cohorts using Structure. Here, we observed significant differences in ancestry proportions arising from Africa, Europe and Asia/Americas (P<0.001 by ANOVA). Specifically, African ancestry proportions were 0.640, 0.700, and 0.510 from the Bethesda cohort, WalkPhasst, and western newborns (all pairwise comparisons P<0.01), respectively. Significantly, there were differences between the cohorts from different regions of the US, where the Bethesda cohort had 0.039 Asian/Native American ancestry compared to 0.085 in the western SCD newborns (P<0.001). We conclude that there are significant admixture differences in SCD populations from eastern and western regions of the US. Overall, SCD patients in North America have a variable degree of genetic admixture that could affect interpretation of candidate gene or GWA studies. Furthermore, the degree of admixture adjustment varies for SCD subjects from different geographic regions within the US. Finally, our proof of concept studies suggest SCD may be ideal for MALD whole genome scans to identify genetic modifiers. No relevant conflicts of interest to declare.
AbstractList Genome wide association studies (GWAS) are an important tool for identifying complex human disease loci. A key variable for GWAS is the degree of admixture or recent mixing of previously isolated populations. Undetected admixture can lead to spurious associations, although it may also be used as a strategy for mapping by admixture linkage disequilibrium (MALD). Development of reagents to identify admixture and to perform MALD may help to identify genetic modifiers of sickle cell disease (SCD). We have examined ancestry and genetic admixture in SCD patients living in North America. We first examined ancestry from family histories for 649 adults recruited to the Bethesda Sickle Cell Cohort Study in the eastern US, where 61% of subjects are African American, 22% African, 12% Caribbean or South American and 1.8% of other origins. We hypothesized that the high proportion of subjects from the Africa ancestry group would diminish the expected degree of admixture in the Bethesda cohort compared to other SCD cohorts. To assess admixture at the genetic level, we identified ancestry informative markers (AIMs) from 3,804,602 SNPs genotyped by HapMap Phase III in CEU (European) and YRI (Nigeria) populations. These SNPs were combined with a published admixture mapping panel, yielding a new MALD panel of 2251 AIMs. We assessed the performance of our AIM panel by genotyping 221 HapMap individuals and 489 adults from the Bethesda cohort with an overall completion rate of 98.51% using Illumina iSelect arrays. HapMap samples had 98% concordance with publicly available data. Out of 2251 candidate markers, only 1806 successfully passed all quality controls (289 SNPs could not be genotyped; 82 were not AIMs; 74 had low concordance with HapMap). Using our HapMap genotype data, this admixture panel has a mean δ (difference in allele frequencies between populations) of 0.712 (SD 0.118) with an average inter-marker distance of 1.862 Mb. The utility of this 1806 marker panel for genome wide MALD in SCD was demonstrated with a proof of concept scan to map the known genetic locus underlying SCD in the Bethesda cohort. To compare admixture in the Bethesda cohort to other SCD populations, we defined a subset of AIMs (n=360) that distinguish 6 major geo-ethnic groups from the Human Genome Diversity Panel (n=952 individuals). These results suggest that these 360 markers can distinguish contributions from geo-ethnic ancestry groups besides African and European populations. Using these markers in the Bethesda cohort, principle components analysis (PCA) showed African American or Caribbean/South American subjects had a wider range of admixture compared to Africans or HapMap populations (CEU, YRI and CHB). We then compared admixture proportions in the Bethesda cohort to 2 additional SCD populations, including 469 anonymous samples from a newborn screen SCD cohort in the western US and 439 SCD adults from a clinical trial in North America and Europe (WalkPhasst). Comparison of these SCD cohorts to 3 HapMap populations using PCA showed nearly 50% of the variation is explained by 2 major vectors that distinguish European/African and Asian/Native American ancestry. Finally, we compared admixture proportions across all 3 SCD cohorts using Structure. Here, we observed significant differences in ancestry proportions arising from Africa, Europe and Asia/Americas (P<0.001 by ANOVA). Specifically, African ancestry proportions were 0.640, 0.700, and 0.510 from the Bethesda cohort, WalkPhasst, and western newborns (all pairwise comparisons P<0.01), respectively. Significantly, there were differences between the cohorts from different regions of the US, where the Bethesda cohort had 0.039 Asian/Native American ancestry compared to 0.085 in the western SCD newborns (P<0.001). We conclude that there are significant admixture differences in SCD populations from eastern and western regions of the US. Overall, SCD patients in North America have a variable degree of genetic admixture that could affect interpretation of candidate gene or GWA studies. Furthermore, the degree of admixture adjustment varies for SCD subjects from different geographic regions within the US. Finally, our proof of concept studies suggest SCD may be ideal for MALD whole genome scans to identify genetic modifiers.
Genome wide association studies (GWAS) are an important tool for identifying complex human disease loci. A key variable for GWAS is the degree of admixture or recent mixing of previously isolated populations. Undetected admixture can lead to spurious associations, although it may also be used as a strategy for mapping by admixture linkage disequilibrium (MALD). Development of reagents to identify admixture and to perform MALD may help to identify genetic modifiers of sickle cell disease (SCD). We have examined ancestry and genetic admixture in SCD patients living in North America. We first examined ancestry from family histories for 649 adults recruited to the Bethesda Sickle Cell Cohort Study in the eastern US, where 61% of subjects are African American, 22% African, 12% Caribbean or South American and 1.8% of other origins. We hypothesized that the high proportion of subjects from the Africa ancestry group would diminish the expected degree of admixture in the Bethesda cohort compared to other SCD cohorts. To assess admixture at the genetic level, we identified ancestry informative markers (AIMs) from 3,804,602 SNPs genotyped by HapMap Phase III in CEU (European) and YRI (Nigeria) populations. These SNPs were combined with a published admixture mapping panel, yielding a new MALD panel of 2251 AIMs. We assessed the performance of our AIM panel by genotyping 221 HapMap individuals and 489 adults from the Bethesda cohort with an overall completion rate of 98.51% using Illumina iSelect arrays. HapMap samples had 98% concordance with publicly available data. Out of 2251 candidate markers, only 1806 successfully passed all quality controls (289 SNPs could not be genotyped; 82 were not AIMs; 74 had low concordance with HapMap). Using our HapMap genotype data, this admixture panel has a mean δ (difference in allele frequencies between populations) of 0.712 (SD 0.118) with an average inter-marker distance of 1.862 Mb. The utility of this 1806 marker panel for genome wide MALD in SCD was demonstrated with a proof of concept scan to map the known genetic locus underlying SCD in the Bethesda cohort. To compare admixture in the Bethesda cohort to other SCD populations, we defined a subset of AIMs (n=360) that distinguish 6 major geo-ethnic groups from the Human Genome Diversity Panel (n=952 individuals). These results suggest that these 360 markers can distinguish contributions from geo-ethnic ancestry groups besides African and European populations. Using these markers in the Bethesda cohort, principle components analysis (PCA) showed African American or Caribbean/South American subjects had a wider range of admixture compared to Africans or HapMap populations (CEU, YRI and CHB). We then compared admixture proportions in the Bethesda cohort to 2 additional SCD populations, including 469 anonymous samples from a newborn screen SCD cohort in the western US and 439 SCD adults from a clinical trial in North America and Europe (WalkPhasst). Comparison of these SCD cohorts to 3 HapMap populations using PCA showed nearly 50% of the variation is explained by 2 major vectors that distinguish European/African and Asian/Native American ancestry. Finally, we compared admixture proportions across all 3 SCD cohorts using Structure. Here, we observed significant differences in ancestry proportions arising from Africa, Europe and Asia/Americas (P<0.001 by ANOVA). Specifically, African ancestry proportions were 0.640, 0.700, and 0.510 from the Bethesda cohort, WalkPhasst, and western newborns (all pairwise comparisons P<0.01), respectively. Significantly, there were differences between the cohorts from different regions of the US, where the Bethesda cohort had 0.039 Asian/Native American ancestry compared to 0.085 in the western SCD newborns (P<0.001). We conclude that there are significant admixture differences in SCD populations from eastern and western regions of the US. Overall, SCD patients in North America have a variable degree of genetic admixture that could affect interpretation of candidate gene or GWA studies. Furthermore, the degree of admixture adjustment varies for SCD subjects from different geographic regions within the US. Finally, our proof of concept studies suggest SCD may be ideal for MALD whole genome scans to identify genetic modifiers. No relevant conflicts of interest to declare.
Author Diggs, Dominique
Zhang, Yingze
Taylor, James G.
Allen, Darlene
Burdette, Laurie
Seamon, Cassie
Hoppe, Carolyn
Barr, Marci
Oguhebe, Anna
Quinn, Meghan
Darbari, Deepika
Gladwin, Mark T.
Wang, Zhengyuan
Hutchinson, Amy A.
Diaw, Lena
Dizon, Michael
Author_xml – sequence: 1
  givenname: Zhengyuan
  surname: Wang
  fullname: Wang, Zhengyuan
  organization: Hematology Branch, NHLBI, NIH, Bethesda, MD, USA
– sequence: 2
  givenname: Lena
  surname: Diaw
  fullname: Diaw, Lena
  organization: Hematology Branch, NHLBI, NIH, Bethesda, MD, USA
– sequence: 3
  givenname: Michael
  surname: Dizon
  fullname: Dizon, Michael
  organization: Hematology Branch, NHLBI, NIH, Bethesda, MD, USA
– sequence: 4
  givenname: Marci
  surname: Barr
  fullname: Barr, Marci
  organization: Hematology Branch, NHLBI, NIH, Bethesda, MD, USA
– sequence: 5
  givenname: Meghan
  surname: Quinn
  fullname: Quinn, Meghan
  organization: Hematology Branch, NHLBI, NIH, Bethesda, MD, USA
– sequence: 6
  givenname: Dominique
  surname: Diggs
  fullname: Diggs, Dominique
  organization: Hematology Branch, NHLBI, NIH, Bethesda, MD, USA
– sequence: 7
  givenname: Darlene
  surname: Allen
  fullname: Allen, Darlene
  organization: Hematology Branch, NHLBI, NIH, Bethesda, MD, USA
– sequence: 8
  givenname: Cassie
  surname: Seamon
  fullname: Seamon, Cassie
  organization: Hematology Branch, NHLBI, NIH, Bethesda, MD, USA
– sequence: 9
  givenname: Anna
  surname: Oguhebe
  fullname: Oguhebe, Anna
  organization: Hematology Branch, NHLBI, NIH, Bethesda, MD, USA
– sequence: 10
  givenname: Deepika
  surname: Darbari
  fullname: Darbari, Deepika
  organization: Pediatric Hematology and Oncology/ NHLBI Pulmonary and Vascular Medicine, Children's National Medical Center / National Heart, Lung, and Blood Institute, Washington, DC, USA
– sequence: 11
  givenname: Yingze
  surname: Zhang
  fullname: Zhang, Yingze
  organization: Division of Pulmonary, Allergy and Critical Care Medicine, University of Pittsburgh, Pittsburgh, PA, USA
– sequence: 12
  givenname: Mark T.
  surname: Gladwin
  fullname: Gladwin, Mark T.
  organization: Division of Pulmonary, Allergy, Critical Care Medicine, University of Pittsburgh, Pittsburgh, PA, USA
– sequence: 13
  givenname: Amy A.
  surname: Hutchinson
  fullname: Hutchinson, Amy A.
  organization: Core Genotyping Facility, National Cancer Institute, SAIC-Frederick, Inc., NCI-Frederick, Fredrick, MD, USA
– sequence: 14
  givenname: Laurie
  surname: Burdette
  fullname: Burdette, Laurie
  organization: Core Genotyping Facility, National Cancer Institute, SAIC-Frederick, Inc., NCI-Frederick, Frederick, MD, USA
– sequence: 15
  givenname: Carolyn
  surname: Hoppe
  fullname: Hoppe, Carolyn
  organization: Children's Hospital & Research Center Oakland, Oakland, CA, USA
– sequence: 16
  givenname: James G.
  surname: Taylor
  fullname: Taylor, James G.
  organization: Hematology Branch, NHLBI, NIH, Bethesda, MD, USA
BookMark eNqFkM9OAjEQhxuDiYA-gklfYNe2S9ndk0FQJMGYyB-PTbedhcqyJe1K5AF8b7vi3cNkDpPvNzNfD3VqWwNCt5TElGbsrqis1fGaMhYzGuc5b-sCdSlnWUQIIx3UJYQMo0Ge0ivU8_6DEDpIGO-i75Hem6_m0wGe1Xhh1K4CPIaqwhPjQXrAEyhNDRo_nPAUarsH_G404FGtwDfuFLDSur1szBHwi3Q7cB6vpTPgz4jdOHnYGoXfYGNs3a5ZbgGvatOE1EUjG_DX6LKUlYebv95Hq6fH5fg5mr9OZ-PRPFKUMB7xVKflIC90woBrnoanCp6RTGsZxoQVTANVpEyTLC-TIS8oCyBJMgWQlxlP-oifc5Wz3jsoxcGZvXQnQYloXYpfl6J1KRgVwWNbgbs_cxCOOxpwwisDwYA2DlQjtDX_JPwAkm6Aug
ContentType Journal Article
Copyright 2013 American Society of Hematology
Copyright_xml – notice: 2013 American Society of Hematology
DBID 6I.
AAFTH
AAYXX
CITATION
DOI 10.1182/blood.V122.21.995.995
DatabaseName ScienceDirect Open Access Titles
Elsevier:ScienceDirect:Open Access
CrossRef
DatabaseTitle CrossRef
DatabaseTitleList CrossRef

DeliveryMethod fulltext_linktorsrc
Discipline Medicine
Chemistry
Biology
Anatomy & Physiology
EISSN 1528-0020
EndPage 995
ExternalDocumentID 10_1182_blood_V122_21_995_995
S0006497119642771
GroupedDBID ---
-~X
.55
1CY
23N
2WC
34G
39C
4.4
53G
5GY
5RE
5VS
6I.
6J9
AAEDW
AAFTH
AAXUO
ABOCM
ABVKL
ACGFO
ADBBV
AENEX
AFOSN
AHPSJ
ALMA_UNASSIGNED_HOLDINGS
BAWUL
BTFSW
CS3
DIK
DU5
E3Z
EBS
EJD
EX3
F5P
FDB
FRP
GS5
GX1
IH2
K-O
KQ8
L7B
LSO
MJL
N9A
OK1
P2P
R.V
RHF
RHI
ROL
SJN
THE
TR2
TWZ
W2D
W8F
WH7
WOQ
WOW
X7M
YHG
YKV
ZA5
0R~
AALRI
AAYXX
ACVFH
ADCNI
ADVLN
AEUPX
AFPUW
AGCQF
AIGII
AITUG
AKBMS
AKRWK
AKYEP
AMRAJ
CITATION
H13
ID FETCH-LOGICAL-c1025-57d7f49bd32e5d57000b5808dda02502b2de1c0f7389f365b12c10038cee9f853
ISSN 0006-4971
IngestDate Tue Jul 01 03:02:38 EDT 2025
Fri Feb 23 02:42:40 EST 2024
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 21
Language English
License This article is made available under the Elsevier license.
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-c1025-57d7f49bd32e5d57000b5808dda02502b2de1c0f7389f365b12c10038cee9f853
OpenAccessLink https://dx.doi.org/10.1182/blood.V122.21.995.995
PageCount 1
ParticipantIDs crossref_primary_10_1182_blood_V122_21_995_995
elsevier_sciencedirect_doi_10_1182_blood_V122_21_995_995
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate 2013-11-15
PublicationDateYYYYMMDD 2013-11-15
PublicationDate_xml – month: 11
  year: 2013
  text: 2013-11-15
  day: 15
PublicationDecade 2010
PublicationTitle Blood
PublicationYear 2013
Publisher Elsevier Inc
Publisher_xml – name: Elsevier Inc
SSID ssj0014325
Score 2.0920653
Snippet Genome wide association studies (GWAS) are an important tool for identifying complex human disease loci. A key variable for GWAS is the degree of admixture or...
SourceID crossref
elsevier
SourceType Index Database
Publisher
StartPage 995
Title Admixture In Sickle Cell Disease Defined By Genome Wide Ancestry Informative Markers Varies By Geographic Region In The United States
URI https://dx.doi.org/10.1182/blood.V122.21.995.995
Volume 122
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1Lb9NAEF6FIigXBCmI8tIeEJfIxl6vX8cmBbWIIiGlVcXFsr3rEok6VZRIuHd-DP-SmZ2147ZRRTnEcqzsbJL5PP5258XYu7QqPa1y6XhpohypvNBJwqhyfK1KT6q8KGPMHT76Gh0cy8-n4elg8KcXtbRaFm55uTGv5H-0CtdAr5glewfNdkLhApyDfuEIGobjP-kYK9_-Mi6AQ7hLZ-ilHU1wM26f3C5gTSpgkWo0brC-9PxcgxVQWIkErMNy0YxsMpIJH8KsHczmPTGrZxpC9axNrL2JW4ZppteJ6hW38E_be5726MmOfP-h67NmtYbh_iw3DqUvus7X1y5vxvGbHdbFwuYUlbP-FoUfYK4eJWl2ZjfCVnYEIm0tLZbG9oR3xRQL0cMcpU5by5pSL077kLbvbtr_BOvJmph_9wSkucJ34bNuN7pfb_vac7CLTjTrokRkRkyGYjLhZyACX_fYfREDTUP__7e1w0oGgppl2F9qk8VAzIeN32YzDepRm-kT9tiuSfgeAewpG-h6yHb26nw5P2_4e26ihI37ZcgejNuz7UnbK3DIHh7ZEI0d9rsDJT-sOYGSIyi5BSW3oOTjhhMoOYKSt6DkPVByC0pOoKQhLSg5gRKnAVByAiUnUD5jx58-TicHju314ZQ-dlQOYxVXMi1UIHSosOmCV4SJlyiVI0sXhVDaL70qBoJdBVFY-AIGekECJC-tgHM-Z1v1vNYvGM-DKipiJWVaCpnDAkhIraNCVWmkYfWc7zK3_euzCyrpkt2q8l2WtArKLC8lvpkB7G4f-vKuc71ij9Z30Gu2tVys9BugvcvirQHcXxtAq0s
linkProvider Colorado Alliance of Research Libraries
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Admixture+In+Sickle+Cell+Disease+Defined+By+Genome+Wide+Ancestry+Informative+Markers+Varies+By+Geographic+Region+In+The+United+States&rft.jtitle=Blood&rft.au=Wang%2C+Zhengyuan&rft.au=Diaw%2C+Lena&rft.au=Dizon%2C+Michael&rft.au=Barr%2C+Marci&rft.date=2013-11-15&rft.issn=0006-4971&rft.eissn=1528-0020&rft.volume=122&rft.issue=21&rft.spage=995&rft.epage=995&rft_id=info:doi/10.1182%2Fblood.V122.21.995.995&rft.externalDBID=n%2Fa&rft.externalDocID=10_1182_blood_V122_21_995_995
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0006-4971&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0006-4971&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0006-4971&client=summon