The Rare Diseases Pilot for the 100,000 Genomes Project: Findings in Known and New Genes by Analysis of 3,549 Whole Genome Sequenced Samples from Patients and Relatives with Haematological, Haemostasis and Immune Disorders
The Rare Diseases Pilot study of the 100,000 Genomes Project had two objectives. Firstly, to identify the DNA variants underlying unresolved Mendelian disorders. Secondly, to develop an accredited framework for delivering whole genome sequencing (WGS) results across a national healthcare system. Fro...
Saved in:
Published in | Blood Vol. 132; no. Supplement 1; p. 504 |
---|---|
Main Authors | , |
Format | Journal Article |
Language | English |
Published |
Elsevier Inc
29.11.2018
|
Online Access | Get full text |
ISSN | 0006-4971 1528-0020 |
DOI | 10.1182/blood-2018-99-119068 |
Cover
Abstract | The Rare Diseases Pilot study of the 100,000 Genomes Project had two objectives. Firstly, to identify the DNA variants underlying unresolved Mendelian disorders. Secondly, to develop an accredited framework for delivering whole genome sequencing (WGS) results across a national healthcare system. From February 2014 to June 2017, 13,037 individuals with a rare disease and their relatives were recruited at 57 National Health Service (NHS) hospitals in the UK and 26 non-UK hospitals using standardized eligibility criteria for 12 rare disease domains. This cohort includes cases with haematology (n=1021), immunology (n=1359) and haemostasis disorders (n=1169).
With informed consent, clinical and laboratory data were collected and coded into a single research database using Human Phenotype Ontology (HPO) terms and 13,037 samples of DNA were Illumina WGS analysed to clinical standard at a mean depths >30X in all samples and 90% of the reference genome was covered at 19X minimum in all samples. The pilot resource contains over 165 million unique variants with 91.5%, 8,5% and 5.6% single nucleotide variants (SNVs), short insertions / deletions and large deletions of the 10,258 genetically independent samples with 47% of variants previously unobserved in large scale genome datasets (e.g. TopMED, gnomAD, UK10K). Across all domains 2,067 unique diagnostic-grade genes (DGGs) were curated to clinical standards to support pertinent finding reporting by 12 multi-disciplinary teams (MDTs) with domain-relevant clinical and genetic expertise. Over 1,300 MDT reports assigning pathogenic or likely pathogenic causal variants have been returned to referring clinicians, with the diagnostic yield ranging from 1.6 to 53.8%, depending on the extent of genetic screening pre-enrolment and the importance of the non-genetic component of the disorder (e.g. in immune disorders). About 30% of the causal variants identified have never been reported (absent from the Human Gene Mutation Database v.2018.1); interestingly, 51 variants have been reported in 11 DGGs linked to phenotypes belonging to different domains. A comparison with standard whole exome sequencing results revealed WGS to have at least 12.5% superiority in sensitivity for detecting known pathogenic variants.
For the haematology, immunology and haemostasis domains 330 causal variants were reported in 83 DGGs, revealing novel modes of inheritance (Sivapalaratnam et al Blood 2016), and entire new clinical phenotypes linked to mutations in ABCC4, GNE, KDSR and STIM1 amongst other DGGs. The genotype and HPO-coded phenotypes of all pilot cases were analysed with BeviMed, a rapid and scalable Bayesian association test (Greene et al AJHG 2017) to identify causal variants in hitherto unknown genes. This identified more than 30 genes with posterior probabilities indicating a high likelihood of being implicated in underlying as yet unresolved Mendelian disorders. Results from co-segregation and cell biology studies have already corroborated this statistical inference and 15 novel genes acquired DGG status. Including a new method for the analysis of the ‘gene-regulatory’ elements we also identified an example of a causal variant in such an element controlling the function of both GATA1 and HDAC6 resulting in a severe syndromic pathology characterized by abnormal erythropoiesis and megakaryopoiesis.
In conclusion, the pilot of the 100,000 Genomes Project has shown the feasibility of using WGS across a national health system, such as the NHS, to deliver a molecular diagnosis for patients with rare inherited diseases and how a national genotype/HPO-coded phenotype resource provides a powerful platform for the identification of novel diagnostic-grade genes.
No relevant conflicts of interest to declare. |
---|---|
AbstractList | The Rare Diseases Pilot study of the 100,000 Genomes Project had two objectives. Firstly, to identify the DNA variants underlying unresolved Mendelian disorders. Secondly, to develop an accredited framework for delivering whole genome sequencing (WGS) results across a national healthcare system. From February 2014 to June 2017, 13,037 individuals with a rare disease and their relatives were recruited at 57 National Health Service (NHS) hospitals in the UK and 26 non-UK hospitals using standardized eligibility criteria for 12 rare disease domains. This cohort includes cases with haematology (n=1021), immunology (n=1359) and haemostasis disorders (n=1169).
With informed consent, clinical and laboratory data were collected and coded into a single research database using Human Phenotype Ontology (HPO) terms and 13,037 samples of DNA were Illumina WGS analysed to clinical standard at a mean depths >30X in all samples and 90% of the reference genome was covered at 19X minimum in all samples. The pilot resource contains over 165 million unique variants with 91.5%, 8,5% and 5.6% single nucleotide variants (SNVs), short insertions / deletions and large deletions of the 10,258 genetically independent samples with 47% of variants previously unobserved in large scale genome datasets (e.g. TopMED, gnomAD, UK10K). Across all domains 2,067 unique diagnostic-grade genes (DGGs) were curated to clinical standards to support pertinent finding reporting by 12 multi-disciplinary teams (MDTs) with domain-relevant clinical and genetic expertise. Over 1,300 MDT reports assigning pathogenic or likely pathogenic causal variants have been returned to referring clinicians, with the diagnostic yield ranging from 1.6 to 53.8%, depending on the extent of genetic screening pre-enrolment and the importance of the non-genetic component of the disorder (e.g. in immune disorders). About 30% of the causal variants identified have never been reported (absent from the Human Gene Mutation Database v.2018.1); interestingly, 51 variants have been reported in 11 DGGs linked to phenotypes belonging to different domains. A comparison with standard whole exome sequencing results revealed WGS to have at least 12.5% superiority in sensitivity for detecting known pathogenic variants.
For the haematology, immunology and haemostasis domains 330 causal variants were reported in 83 DGGs, revealing novel modes of inheritance (Sivapalaratnam et al Blood 2016), and entire new clinical phenotypes linked to mutations in ABCC4, GNE, KDSR and STIM1 amongst other DGGs. The genotype and HPO-coded phenotypes of all pilot cases were analysed with BeviMed, a rapid and scalable Bayesian association test (Greene et al AJHG 2017) to identify causal variants in hitherto unknown genes. This identified more than 30 genes with posterior probabilities indicating a high likelihood of being implicated in underlying as yet unresolved Mendelian disorders. Results from co-segregation and cell biology studies have already corroborated this statistical inference and 15 novel genes acquired DGG status. Including a new method for the analysis of the 'gene-regulatory' elements we also identified an example of a causal variant in such an element controlling the function of both GATA1 and HDAC6 resulting in a severe syndromic pathology characterized by abnormal erythropoiesis and megakaryopoiesis.
In conclusion, the pilot of the 100,000 Genomes Project has shown the feasibility of using WGS across a national health system, such as the NHS, to deliver a molecular diagnosis for patients with rare inherited diseases and how a national genotype/HPO-coded phenotype resource provides a powerful platform for the identification of novel diagnostic-grade genes. The Rare Diseases Pilot study of the 100,000 Genomes Project had two objectives. Firstly, to identify the DNA variants underlying unresolved Mendelian disorders. Secondly, to develop an accredited framework for delivering whole genome sequencing (WGS) results across a national healthcare system. From February 2014 to June 2017, 13,037 individuals with a rare disease and their relatives were recruited at 57 National Health Service (NHS) hospitals in the UK and 26 non-UK hospitals using standardized eligibility criteria for 12 rare disease domains. This cohort includes cases with haematology (n=1021), immunology (n=1359) and haemostasis disorders (n=1169). With informed consent, clinical and laboratory data were collected and coded into a single research database using Human Phenotype Ontology (HPO) terms and 13,037 samples of DNA were Illumina WGS analysed to clinical standard at a mean depths >30X in all samples and 90% of the reference genome was covered at 19X minimum in all samples. The pilot resource contains over 165 million unique variants with 91.5%, 8,5% and 5.6% single nucleotide variants (SNVs), short insertions / deletions and large deletions of the 10,258 genetically independent samples with 47% of variants previously unobserved in large scale genome datasets (e.g. TopMED, gnomAD, UK10K). Across all domains 2,067 unique diagnostic-grade genes (DGGs) were curated to clinical standards to support pertinent finding reporting by 12 multi-disciplinary teams (MDTs) with domain-relevant clinical and genetic expertise. Over 1,300 MDT reports assigning pathogenic or likely pathogenic causal variants have been returned to referring clinicians, with the diagnostic yield ranging from 1.6 to 53.8%, depending on the extent of genetic screening pre-enrolment and the importance of the non-genetic component of the disorder (e.g. in immune disorders). About 30% of the causal variants identified have never been reported (absent from the Human Gene Mutation Database v.2018.1); interestingly, 51 variants have been reported in 11 DGGs linked to phenotypes belonging to different domains. A comparison with standard whole exome sequencing results revealed WGS to have at least 12.5% superiority in sensitivity for detecting known pathogenic variants. For the haematology, immunology and haemostasis domains 330 causal variants were reported in 83 DGGs, revealing novel modes of inheritance (Sivapalaratnam et al Blood 2016), and entire new clinical phenotypes linked to mutations in ABCC4, GNE, KDSR and STIM1 amongst other DGGs. The genotype and HPO-coded phenotypes of all pilot cases were analysed with BeviMed, a rapid and scalable Bayesian association test (Greene et al AJHG 2017) to identify causal variants in hitherto unknown genes. This identified more than 30 genes with posterior probabilities indicating a high likelihood of being implicated in underlying as yet unresolved Mendelian disorders. Results from co-segregation and cell biology studies have already corroborated this statistical inference and 15 novel genes acquired DGG status. Including a new method for the analysis of the ‘gene-regulatory’ elements we also identified an example of a causal variant in such an element controlling the function of both GATA1 and HDAC6 resulting in a severe syndromic pathology characterized by abnormal erythropoiesis and megakaryopoiesis. In conclusion, the pilot of the 100,000 Genomes Project has shown the feasibility of using WGS across a national health system, such as the NHS, to deliver a molecular diagnosis for patients with rare inherited diseases and how a national genotype/HPO-coded phenotype resource provides a powerful platform for the identification of novel diagnostic-grade genes. No relevant conflicts of interest to declare. |
Author | Bioresource, Nihr Sivapalaratnam, Suthesh |
Author_xml | – sequence: 1 givenname: Suthesh surname: Sivapalaratnam fullname: Sivapalaratnam, Suthesh organization: Center for Immunobiology, QMUL, London, United Kingdom – sequence: 2 givenname: Nihr surname: Bioresource fullname: Bioresource, Nihr organization: NIHR, Cambridge CB2 0PT, United Kingdom |
BookMark | eNqFUctu1DAUtVCRmBb-gIU_YALXHmcSd4FUFfoQFVRtEcvIsW86rhy72G5H87N8C06mKxawsq_O4-rcc0gOfPBIyHsGHxhr-cfehWAqDqytpKwYk7BuX5EFq3lbAXA4IAsAWFdCNuwNOUzpAYCJFa8X5PfdBumNikg_24QqYaLX1oVMhxBpLhgDWBYxPUcfxgmN4QF1PqZn1hvr7xO1nn71Yeup8oZ-w-1ELcR-R0-8crtkEw0DXS1rIenPTXD44kVv8dcTeo2G3qrx0RXNEMNIr1W26HOa_W7QlfG5YFubN_RC4ahycOHeauWW8xxSVtOSiX45jk9-zhKiwZjekteDcgnfvbxH5MfZl7vTi-rq-_nl6clVpVldrmbWfSOxZkILAUavYGh5ywUOjWpl3TdgGt4IKB9hGPRcQwtc6obr1aCGdbM6ImLvq2NIKeLQPUY7qrjrGHRTR93cUTd11EnZ7TsqsuO_ZNrmkjf4HJV1_xN_2ouxBHu2GLuk7XxPG0tDnQn23wZ_AAMNsSY |
CitedBy_id | crossref_primary_10_3389_fgene_2020_527484 |
ContentType | Journal Article |
Copyright | 2018 American Society of Hematology |
Copyright_xml | – notice: 2018 American Society of Hematology |
DBID | 6I. AAFTH AAYXX CITATION |
DOI | 10.1182/blood-2018-99-119068 |
DatabaseName | ScienceDirect Open Access Titles Elsevier:ScienceDirect:Open Access CrossRef |
DatabaseTitle | CrossRef |
DatabaseTitleList | CrossRef |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Medicine Chemistry Biology Anatomy & Physiology |
EISSN | 1528-0020 |
EndPage | 504 |
ExternalDocumentID | 10_1182_blood_2018_99_119068 S000649711936553X |
GroupedDBID | --- -~X .55 1CY 23N 2WC 34G 39C 4.4 53G 5GY 5RE 5VS 6I. 6J9 AAEDW AAFTH AAXUO ABOCM ABVKL ACGFO ADBBV AENEX AFOSN AHPSJ ALMA_UNASSIGNED_HOLDINGS AMRAJ BAWUL BTFSW CS3 DIK DU5 E3Z EBS EJD EX3 F5P FDB FRP GS5 GX1 IH2 K-O KQ8 L7B LSO MJL N9A OK1 P2P R.V RHF RHI ROL SJN THE TR2 TWZ W2D W8F WH7 WOQ WOW X7M YHG YKV ZA5 0R~ AALRI AAYXX ACVFH ADCNI ADVLN AEUPX AFETI AFPUW AGCQF AIGII AITUG AKBMS AKRWK AKYEP CITATION H13 |
ID | FETCH-LOGICAL-c1518-d6b79e514c440dc30f82824ef7a895b70d72740b704d10b2c08029c72c3faf673 |
ISSN | 0006-4971 |
IngestDate | Thu Apr 24 23:09:37 EDT 2025 Tue Jul 01 00:22:48 EDT 2025 Fri Feb 23 02:41:27 EST 2024 |
IsDoiOpenAccess | true |
IsOpenAccess | true |
IsPeerReviewed | true |
IsScholarly | true |
Issue | Supplement 1 |
Language | English |
License | This article is made available under the Elsevier license. |
LinkModel | OpenURL |
MergedId | FETCHMERGED-LOGICAL-c1518-d6b79e514c440dc30f82824ef7a895b70d72740b704d10b2c08029c72c3faf673 |
OpenAccessLink | https://dx.doi.org/10.1182/blood-2018-99-119068 |
PageCount | 1 |
ParticipantIDs | crossref_primary_10_1182_blood_2018_99_119068 crossref_citationtrail_10_1182_blood_2018_99_119068 elsevier_sciencedirect_doi_10_1182_blood_2018_99_119068 |
ProviderPackageCode | CITATION AAYXX |
PublicationCentury | 2000 |
PublicationDate | 2018-11-29 |
PublicationDateYYYYMMDD | 2018-11-29 |
PublicationDate_xml | – month: 11 year: 2018 text: 2018-11-29 day: 29 |
PublicationDecade | 2010 |
PublicationTitle | Blood |
PublicationYear | 2018 |
Publisher | Elsevier Inc |
Publisher_xml | – name: Elsevier Inc |
SSID | ssj0014325 |
Score | 2.2793736 |
Snippet | The Rare Diseases Pilot study of the 100,000 Genomes Project had two objectives. Firstly, to identify the DNA variants underlying unresolved Mendelian... |
SourceID | crossref elsevier |
SourceType | Enrichment Source Index Database Publisher |
StartPage | 504 |
Title | The Rare Diseases Pilot for the 100,000 Genomes Project: Findings in Known and New Genes by Analysis of 3,549 Whole Genome Sequenced Samples from Patients and Relatives with Haematological, Haemostasis and Immune Disorders |
URI | https://dx.doi.org/10.1182/blood-2018-99-119068 |
Volume | 132 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnZ1bb9MwFICtMsTlBUHHxLjpPCBe2kDiOInDGxtMFdJQgU3sLcrFEZVKMzUp0vix_Ap-AOfYTppCNRgvURolJ678xT4-PhfGnmVSha7iqVOUReaIEj_FOI8zR6LuIRXOMKGu3nD8Ppycindnwdlg8LPntbRqshf5961xJf_Tq3gN-5WiZK_Qs51QvIDn2L94xB7G4z_38Udy3Xpjtlnq0XQ2r5rOddCjEsxk06fs0hWlZZoawwvZAY5mOqBF-8NS8WvjlUz-jpSKuia9tJ-xxEdJuMobfaaCulYejjTGERuV1pTSDNcmXGVqkrXWNvpxrpOL2zi6SUpZYtshV898eKVCJZVepL2TKWRFdXlB641957ktbq-tQrNvONXPKXn5wnD9ifz1687CfTCrlnZ7wlD_Zdm3cniSwv2sKaQduUOqhmc4VHawpuzaLnc3RvO1uXRF4TTn1g1_5PWG6cCUPLYzvv3152QiKTmtCSDQbYpjbFbsmjpAm7m7f5tTO09HvcaSPNFSEpKSxHFipFxj13mEGh-5EnxY730Jn5u6G_Yf24BPlPJyW1u2K1Q9JenkLrtjVzfw2qB6jw3UYsh2EaOm-noBz0H7G-uNnCG7cdCe3Tpsqw4O2c1j6-yxy34g3kB4Q4s3aLwB8QbsZ0C8x9h-sHCDhfsVtGjDbAEabUCuANEGjTZkF9CiDVUJ_hjBBg22lQUd2GDBBgIbWrC1vA5sILBhE-wx9LDWtxusocP6Pjs9entyOHFsORInR7VYOkWYRTEOXyIXwi1y3y0ll1yoMkplHGSRW-BaQLh4IgrPzTjl8OdxHvHcL1McBP09trOoFuoBgzwVoafI1hFwUfJQloEQApeOkVIoW-0zv-3TJLe5-qlkzDy5jKd95nRPnZtcNX-5P2pxSay-bfToBL-AS598eMU3PWK31x_1Y7bTLFfqCSrzTfZUs_8Luf_zkg |
linkProvider | Colorado Alliance of Research Libraries |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=The+Rare+Diseases+Pilot+for+the+100%2C000+Genomes+Project%3A+Findings+in+Known+and+New+Genes+by+Analysis+of+3%2C549+Whole+Genome+Sequenced+Samples+from+Patients+and+Relatives+with+Haematological%2C+Haemostasis+and+Immune+Disorders&rft.jtitle=Blood&rft.au=Sivapalaratnam%2C+Suthesh&rft.au=Bioresource%2C+Nihr&rft.date=2018-11-29&rft.issn=0006-4971&rft.eissn=1528-0020&rft.volume=132&rft.issue=Supplement+1&rft.spage=504&rft.epage=504&rft_id=info:doi/10.1182%2Fblood-2018-99-119068&rft.externalDBID=n%2Fa&rft.externalDocID=10_1182_blood_2018_99_119068 |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0006-4971&client=summon |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0006-4971&client=summon |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0006-4971&client=summon |