Admixture In Sickle Cell Disease Defined By Genome Wide Ancestry Informative Markers Varies By Geographic Region In The United States
Genome wide association studies (GWAS) are an important tool for identifying complex human disease loci. A key variable for GWAS is the degree of admixture or recent mixing of previously isolated populations. Undetected admixture can lead to spurious associations, although it may also be used as a s...
Saved in:
Published in | Blood Vol. 122; no. 21; p. 995 |
---|---|
Main Authors | , , , , , , , , , , , , , , , |
Format | Journal Article |
Language | English |
Published |
Elsevier Inc
15.11.2013
|
Online Access | Get full text |
ISSN | 0006-4971 1528-0020 |
DOI | 10.1182/blood.V122.21.995.995 |
Cover
Abstract | Genome wide association studies (GWAS) are an important tool for identifying complex human disease loci. A key variable for GWAS is the degree of admixture or recent mixing of previously isolated populations. Undetected admixture can lead to spurious associations, although it may also be used as a strategy for mapping by admixture linkage disequilibrium (MALD). Development of reagents to identify admixture and to perform MALD may help to identify genetic modifiers of sickle cell disease (SCD). We have examined ancestry and genetic admixture in SCD patients living in North America. We first examined ancestry from family histories for 649 adults recruited to the Bethesda Sickle Cell Cohort Study in the eastern US, where 61% of subjects are African American, 22% African, 12% Caribbean or South American and 1.8% of other origins. We hypothesized that the high proportion of subjects from the Africa ancestry group would diminish the expected degree of admixture in the Bethesda cohort compared to other SCD cohorts.
To assess admixture at the genetic level, we identified ancestry informative markers (AIMs) from 3,804,602 SNPs genotyped by HapMap Phase III in CEU (European) and YRI (Nigeria) populations. These SNPs were combined with a published admixture mapping panel, yielding a new MALD panel of 2251 AIMs. We assessed the performance of our AIM panel by genotyping 221 HapMap individuals and 489 adults from the Bethesda cohort with an overall completion rate of 98.51% using Illumina iSelect arrays. HapMap samples had 98% concordance with publicly available data. Out of 2251 candidate markers, only 1806 successfully passed all quality controls (289 SNPs could not be genotyped; 82 were not AIMs; 74 had low concordance with HapMap). Using our HapMap genotype data, this admixture panel has a mean δ (difference in allele frequencies between populations) of 0.712 (SD 0.118) with an average inter-marker distance of 1.862 Mb. The utility of this 1806 marker panel for genome wide MALD in SCD was demonstrated with a proof of concept scan to map the known genetic locus underlying SCD in the Bethesda cohort.
To compare admixture in the Bethesda cohort to other SCD populations, we defined a subset of AIMs (n=360) that distinguish 6 major geo-ethnic groups from the Human Genome Diversity Panel (n=952 individuals). These results suggest that these 360 markers can distinguish contributions from geo-ethnic ancestry groups besides African and European populations. Using these markers in the Bethesda cohort, principle components analysis (PCA) showed African American or Caribbean/South American subjects had a wider range of admixture compared to Africans or HapMap populations (CEU, YRI and CHB). We then compared admixture proportions in the Bethesda cohort to 2 additional SCD populations, including 469 anonymous samples from a newborn screen SCD cohort in the western US and 439 SCD adults from a clinical trial in North America and Europe (WalkPhasst). Comparison of these SCD cohorts to 3 HapMap populations using PCA showed nearly 50% of the variation is explained by 2 major vectors that distinguish European/African and Asian/Native American ancestry. Finally, we compared admixture proportions across all 3 SCD cohorts using Structure. Here, we observed significant differences in ancestry proportions arising from Africa, Europe and Asia/Americas (P<0.001 by ANOVA). Specifically, African ancestry proportions were 0.640, 0.700, and 0.510 from the Bethesda cohort, WalkPhasst, and western newborns (all pairwise comparisons P<0.01), respectively. Significantly, there were differences between the cohorts from different regions of the US, where the Bethesda cohort had 0.039 Asian/Native American ancestry compared to 0.085 in the western SCD newborns (P<0.001). We conclude that there are significant admixture differences in SCD populations from eastern and western regions of the US. Overall, SCD patients in North America have a variable degree of genetic admixture that could affect interpretation of candidate gene or GWA studies. Furthermore, the degree of admixture adjustment varies for SCD subjects from different geographic regions within the US. Finally, our proof of concept studies suggest SCD may be ideal for MALD whole genome scans to identify genetic modifiers.
No relevant conflicts of interest to declare. |
---|---|
AbstractList | Genome wide association studies (GWAS) are an important tool for identifying complex human disease loci. A key variable for GWAS is the degree of admixture or recent mixing of previously isolated populations. Undetected admixture can lead to spurious associations, although it may also be used as a strategy for mapping by admixture linkage disequilibrium (MALD). Development of reagents to identify admixture and to perform MALD may help to identify genetic modifiers of sickle cell disease (SCD). We have examined ancestry and genetic admixture in SCD patients living in North America. We first examined ancestry from family histories for 649 adults recruited to the Bethesda Sickle Cell Cohort Study in the eastern US, where 61% of subjects are African American, 22% African, 12% Caribbean or South American and 1.8% of other origins. We hypothesized that the high proportion of subjects from the Africa ancestry group would diminish the expected degree of admixture in the Bethesda cohort compared to other SCD cohorts.
To assess admixture at the genetic level, we identified ancestry informative markers (AIMs) from 3,804,602 SNPs genotyped by HapMap Phase III in CEU (European) and YRI (Nigeria) populations. These SNPs were combined with a published admixture mapping panel, yielding a new MALD panel of 2251 AIMs. We assessed the performance of our AIM panel by genotyping 221 HapMap individuals and 489 adults from the Bethesda cohort with an overall completion rate of 98.51% using Illumina iSelect arrays. HapMap samples had 98% concordance with publicly available data. Out of 2251 candidate markers, only 1806 successfully passed all quality controls (289 SNPs could not be genotyped; 82 were not AIMs; 74 had low concordance with HapMap). Using our HapMap genotype data, this admixture panel has a mean δ (difference in allele frequencies between populations) of 0.712 (SD 0.118) with an average inter-marker distance of 1.862 Mb. The utility of this 1806 marker panel for genome wide MALD in SCD was demonstrated with a proof of concept scan to map the known genetic locus underlying SCD in the Bethesda cohort.
To compare admixture in the Bethesda cohort to other SCD populations, we defined a subset of AIMs (n=360) that distinguish 6 major geo-ethnic groups from the Human Genome Diversity Panel (n=952 individuals). These results suggest that these 360 markers can distinguish contributions from geo-ethnic ancestry groups besides African and European populations. Using these markers in the Bethesda cohort, principle components analysis (PCA) showed African American or Caribbean/South American subjects had a wider range of admixture compared to Africans or HapMap populations (CEU, YRI and CHB). We then compared admixture proportions in the Bethesda cohort to 2 additional SCD populations, including 469 anonymous samples from a newborn screen SCD cohort in the western US and 439 SCD adults from a clinical trial in North America and Europe (WalkPhasst). Comparison of these SCD cohorts to 3 HapMap populations using PCA showed nearly 50% of the variation is explained by 2 major vectors that distinguish European/African and Asian/Native American ancestry. Finally, we compared admixture proportions across all 3 SCD cohorts using Structure. Here, we observed significant differences in ancestry proportions arising from Africa, Europe and Asia/Americas (P<0.001 by ANOVA). Specifically, African ancestry proportions were 0.640, 0.700, and 0.510 from the Bethesda cohort, WalkPhasst, and western newborns (all pairwise comparisons P<0.01), respectively. Significantly, there were differences between the cohorts from different regions of the US, where the Bethesda cohort had 0.039 Asian/Native American ancestry compared to 0.085 in the western SCD newborns (P<0.001). We conclude that there are significant admixture differences in SCD populations from eastern and western regions of the US. Overall, SCD patients in North America have a variable degree of genetic admixture that could affect interpretation of candidate gene or GWA studies. Furthermore, the degree of admixture adjustment varies for SCD subjects from different geographic regions within the US. Finally, our proof of concept studies suggest SCD may be ideal for MALD whole genome scans to identify genetic modifiers. Genome wide association studies (GWAS) are an important tool for identifying complex human disease loci. A key variable for GWAS is the degree of admixture or recent mixing of previously isolated populations. Undetected admixture can lead to spurious associations, although it may also be used as a strategy for mapping by admixture linkage disequilibrium (MALD). Development of reagents to identify admixture and to perform MALD may help to identify genetic modifiers of sickle cell disease (SCD). We have examined ancestry and genetic admixture in SCD patients living in North America. We first examined ancestry from family histories for 649 adults recruited to the Bethesda Sickle Cell Cohort Study in the eastern US, where 61% of subjects are African American, 22% African, 12% Caribbean or South American and 1.8% of other origins. We hypothesized that the high proportion of subjects from the Africa ancestry group would diminish the expected degree of admixture in the Bethesda cohort compared to other SCD cohorts. To assess admixture at the genetic level, we identified ancestry informative markers (AIMs) from 3,804,602 SNPs genotyped by HapMap Phase III in CEU (European) and YRI (Nigeria) populations. These SNPs were combined with a published admixture mapping panel, yielding a new MALD panel of 2251 AIMs. We assessed the performance of our AIM panel by genotyping 221 HapMap individuals and 489 adults from the Bethesda cohort with an overall completion rate of 98.51% using Illumina iSelect arrays. HapMap samples had 98% concordance with publicly available data. Out of 2251 candidate markers, only 1806 successfully passed all quality controls (289 SNPs could not be genotyped; 82 were not AIMs; 74 had low concordance with HapMap). Using our HapMap genotype data, this admixture panel has a mean δ (difference in allele frequencies between populations) of 0.712 (SD 0.118) with an average inter-marker distance of 1.862 Mb. The utility of this 1806 marker panel for genome wide MALD in SCD was demonstrated with a proof of concept scan to map the known genetic locus underlying SCD in the Bethesda cohort. To compare admixture in the Bethesda cohort to other SCD populations, we defined a subset of AIMs (n=360) that distinguish 6 major geo-ethnic groups from the Human Genome Diversity Panel (n=952 individuals). These results suggest that these 360 markers can distinguish contributions from geo-ethnic ancestry groups besides African and European populations. Using these markers in the Bethesda cohort, principle components analysis (PCA) showed African American or Caribbean/South American subjects had a wider range of admixture compared to Africans or HapMap populations (CEU, YRI and CHB). We then compared admixture proportions in the Bethesda cohort to 2 additional SCD populations, including 469 anonymous samples from a newborn screen SCD cohort in the western US and 439 SCD adults from a clinical trial in North America and Europe (WalkPhasst). Comparison of these SCD cohorts to 3 HapMap populations using PCA showed nearly 50% of the variation is explained by 2 major vectors that distinguish European/African and Asian/Native American ancestry. Finally, we compared admixture proportions across all 3 SCD cohorts using Structure. Here, we observed significant differences in ancestry proportions arising from Africa, Europe and Asia/Americas (P<0.001 by ANOVA). Specifically, African ancestry proportions were 0.640, 0.700, and 0.510 from the Bethesda cohort, WalkPhasst, and western newborns (all pairwise comparisons P<0.01), respectively. Significantly, there were differences between the cohorts from different regions of the US, where the Bethesda cohort had 0.039 Asian/Native American ancestry compared to 0.085 in the western SCD newborns (P<0.001). We conclude that there are significant admixture differences in SCD populations from eastern and western regions of the US. Overall, SCD patients in North America have a variable degree of genetic admixture that could affect interpretation of candidate gene or GWA studies. Furthermore, the degree of admixture adjustment varies for SCD subjects from different geographic regions within the US. Finally, our proof of concept studies suggest SCD may be ideal for MALD whole genome scans to identify genetic modifiers. No relevant conflicts of interest to declare. |
Author | Diggs, Dominique Zhang, Yingze Taylor, James G. Allen, Darlene Burdette, Laurie Seamon, Cassie Hoppe, Carolyn Barr, Marci Oguhebe, Anna Quinn, Meghan Darbari, Deepika Gladwin, Mark T. Wang, Zhengyuan Hutchinson, Amy A. Diaw, Lena Dizon, Michael |
Author_xml | – sequence: 1 givenname: Zhengyuan surname: Wang fullname: Wang, Zhengyuan organization: Hematology Branch, NHLBI, NIH, Bethesda, MD, USA – sequence: 2 givenname: Lena surname: Diaw fullname: Diaw, Lena organization: Hematology Branch, NHLBI, NIH, Bethesda, MD, USA – sequence: 3 givenname: Michael surname: Dizon fullname: Dizon, Michael organization: Hematology Branch, NHLBI, NIH, Bethesda, MD, USA – sequence: 4 givenname: Marci surname: Barr fullname: Barr, Marci organization: Hematology Branch, NHLBI, NIH, Bethesda, MD, USA – sequence: 5 givenname: Meghan surname: Quinn fullname: Quinn, Meghan organization: Hematology Branch, NHLBI, NIH, Bethesda, MD, USA – sequence: 6 givenname: Dominique surname: Diggs fullname: Diggs, Dominique organization: Hematology Branch, NHLBI, NIH, Bethesda, MD, USA – sequence: 7 givenname: Darlene surname: Allen fullname: Allen, Darlene organization: Hematology Branch, NHLBI, NIH, Bethesda, MD, USA – sequence: 8 givenname: Cassie surname: Seamon fullname: Seamon, Cassie organization: Hematology Branch, NHLBI, NIH, Bethesda, MD, USA – sequence: 9 givenname: Anna surname: Oguhebe fullname: Oguhebe, Anna organization: Hematology Branch, NHLBI, NIH, Bethesda, MD, USA – sequence: 10 givenname: Deepika surname: Darbari fullname: Darbari, Deepika organization: Pediatric Hematology and Oncology/ NHLBI Pulmonary and Vascular Medicine, Children's National Medical Center / National Heart, Lung, and Blood Institute, Washington, DC, USA – sequence: 11 givenname: Yingze surname: Zhang fullname: Zhang, Yingze organization: Division of Pulmonary, Allergy and Critical Care Medicine, University of Pittsburgh, Pittsburgh, PA, USA – sequence: 12 givenname: Mark T. surname: Gladwin fullname: Gladwin, Mark T. organization: Division of Pulmonary, Allergy, Critical Care Medicine, University of Pittsburgh, Pittsburgh, PA, USA – sequence: 13 givenname: Amy A. surname: Hutchinson fullname: Hutchinson, Amy A. organization: Core Genotyping Facility, National Cancer Institute, SAIC-Frederick, Inc., NCI-Frederick, Fredrick, MD, USA – sequence: 14 givenname: Laurie surname: Burdette fullname: Burdette, Laurie organization: Core Genotyping Facility, National Cancer Institute, SAIC-Frederick, Inc., NCI-Frederick, Frederick, MD, USA – sequence: 15 givenname: Carolyn surname: Hoppe fullname: Hoppe, Carolyn organization: Children's Hospital & Research Center Oakland, Oakland, CA, USA – sequence: 16 givenname: James G. surname: Taylor fullname: Taylor, James G. organization: Hematology Branch, NHLBI, NIH, Bethesda, MD, USA |
BookMark | eNqFkM9OAjEQhxuDiYA-gklfYNe2S9ndk0FQJMGYyB-PTbedhcqyJe1K5AF8b7vi3cNkDpPvNzNfD3VqWwNCt5TElGbsrqis1fGaMhYzGuc5b-sCdSlnWUQIIx3UJYQMo0Ge0ivU8_6DEDpIGO-i75Hem6_m0wGe1Xhh1K4CPIaqwhPjQXrAEyhNDRo_nPAUarsH_G404FGtwDfuFLDSur1szBHwi3Q7cB6vpTPgz4jdOHnYGoXfYGNs3a5ZbgGvatOE1EUjG_DX6LKUlYebv95Hq6fH5fg5mr9OZ-PRPFKUMB7xVKflIC90woBrnoanCp6RTGsZxoQVTANVpEyTLC-TIS8oCyBJMgWQlxlP-oifc5Wz3jsoxcGZvXQnQYloXYpfl6J1KRgVwWNbgbs_cxCOOxpwwisDwYA2DlQjtDX_JPwAkm6Aug |
ContentType | Journal Article |
Copyright | 2013 American Society of Hematology |
Copyright_xml | – notice: 2013 American Society of Hematology |
DBID | 6I. AAFTH AAYXX CITATION |
DOI | 10.1182/blood.V122.21.995.995 |
DatabaseName | ScienceDirect Open Access Titles Elsevier:ScienceDirect:Open Access CrossRef |
DatabaseTitle | CrossRef |
DatabaseTitleList | CrossRef |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Medicine Chemistry Biology Anatomy & Physiology |
EISSN | 1528-0020 |
EndPage | 995 |
ExternalDocumentID | 10_1182_blood_V122_21_995_995 S0006497119642771 |
GroupedDBID | --- -~X .55 1CY 23N 2WC 34G 39C 4.4 53G 5GY 5RE 5VS 6I. 6J9 AAEDW AAFTH AAXUO ABOCM ABVKL ACGFO ADBBV AENEX AFOSN AHPSJ ALMA_UNASSIGNED_HOLDINGS BAWUL BTFSW CS3 DIK DU5 E3Z EBS EJD EX3 F5P FDB FRP GS5 GX1 IH2 K-O KQ8 L7B LSO MJL N9A OK1 P2P R.V RHF RHI ROL SJN THE TR2 TWZ W2D W8F WH7 WOQ WOW X7M YHG YKV ZA5 0R~ AALRI AAYXX ACVFH ADCNI ADVLN AEUPX AFPUW AGCQF AIGII AITUG AKBMS AKRWK AKYEP AMRAJ CITATION H13 |
ID | FETCH-LOGICAL-c1025-57d7f49bd32e5d57000b5808dda02502b2de1c0f7389f365b12c10038cee9f853 |
ISSN | 0006-4971 |
IngestDate | Tue Jul 01 03:02:38 EDT 2025 Fri Feb 23 02:42:40 EST 2024 |
IsDoiOpenAccess | true |
IsOpenAccess | true |
IsPeerReviewed | true |
IsScholarly | true |
Issue | 21 |
Language | English |
License | This article is made available under the Elsevier license. |
LinkModel | OpenURL |
MergedId | FETCHMERGED-LOGICAL-c1025-57d7f49bd32e5d57000b5808dda02502b2de1c0f7389f365b12c10038cee9f853 |
OpenAccessLink | https://dx.doi.org/10.1182/blood.V122.21.995.995 |
PageCount | 1 |
ParticipantIDs | crossref_primary_10_1182_blood_V122_21_995_995 elsevier_sciencedirect_doi_10_1182_blood_V122_21_995_995 |
ProviderPackageCode | CITATION AAYXX |
PublicationCentury | 2000 |
PublicationDate | 2013-11-15 |
PublicationDateYYYYMMDD | 2013-11-15 |
PublicationDate_xml | – month: 11 year: 2013 text: 2013-11-15 day: 15 |
PublicationDecade | 2010 |
PublicationTitle | Blood |
PublicationYear | 2013 |
Publisher | Elsevier Inc |
Publisher_xml | – name: Elsevier Inc |
SSID | ssj0014325 |
Score | 2.0920653 |
Snippet | Genome wide association studies (GWAS) are an important tool for identifying complex human disease loci. A key variable for GWAS is the degree of admixture or... |
SourceID | crossref elsevier |
SourceType | Index Database Publisher |
StartPage | 995 |
Title | Admixture In Sickle Cell Disease Defined By Genome Wide Ancestry Informative Markers Varies By Geographic Region In The United States |
URI | https://dx.doi.org/10.1182/blood.V122.21.995.995 |
Volume | 122 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1Lb9NAEF6FIigXBCmI8tIeEJfIxl6vX8cmBbWIIiGlVcXFsr3rEok6VZRIuHd-DP-SmZ2147ZRRTnEcqzsbJL5PP5258XYu7QqPa1y6XhpohypvNBJwqhyfK1KT6q8KGPMHT76Gh0cy8-n4elg8KcXtbRaFm55uTGv5H-0CtdAr5glewfNdkLhApyDfuEIGobjP-kYK9_-Mi6AQ7hLZ-ilHU1wM26f3C5gTSpgkWo0brC-9PxcgxVQWIkErMNy0YxsMpIJH8KsHczmPTGrZxpC9axNrL2JW4ZppteJ6hW38E_be5726MmOfP-h67NmtYbh_iw3DqUvus7X1y5vxvGbHdbFwuYUlbP-FoUfYK4eJWl2ZjfCVnYEIm0tLZbG9oR3xRQL0cMcpU5by5pSL077kLbvbtr_BOvJmph_9wSkucJ34bNuN7pfb_vac7CLTjTrokRkRkyGYjLhZyACX_fYfREDTUP__7e1w0oGgppl2F9qk8VAzIeN32YzDepRm-kT9tiuSfgeAewpG-h6yHb26nw5P2_4e26ihI37ZcgejNuz7UnbK3DIHh7ZEI0d9rsDJT-sOYGSIyi5BSW3oOTjhhMoOYKSt6DkPVByC0pOoKQhLSg5gRKnAVByAiUnUD5jx58-TicHju314ZQ-dlQOYxVXMi1UIHSosOmCV4SJlyiVI0sXhVDaL70qBoJdBVFY-AIGekECJC-tgHM-Z1v1vNYvGM-DKipiJWVaCpnDAkhIraNCVWmkYfWc7zK3_euzCyrpkt2q8l2WtArKLC8lvpkB7G4f-vKuc71ij9Z30Gu2tVys9BugvcvirQHcXxtAq0s |
linkProvider | Colorado Alliance of Research Libraries |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Admixture+In+Sickle+Cell+Disease+Defined+By+Genome+Wide+Ancestry+Informative+Markers+Varies+By+Geographic+Region+In+The+United+States&rft.jtitle=Blood&rft.au=Wang%2C+Zhengyuan&rft.au=Diaw%2C+Lena&rft.au=Dizon%2C+Michael&rft.au=Barr%2C+Marci&rft.date=2013-11-15&rft.issn=0006-4971&rft.eissn=1528-0020&rft.volume=122&rft.issue=21&rft.spage=995&rft.epage=995&rft_id=info:doi/10.1182%2Fblood.V122.21.995.995&rft.externalDBID=n%2Fa&rft.externalDocID=10_1182_blood_V122_21_995_995 |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0006-4971&client=summon |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0006-4971&client=summon |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0006-4971&client=summon |