The Rare Diseases Pilot for the 100,000 Genomes Project: Findings in Known and New Genes by Analysis of 3,549 Whole Genome Sequenced Samples from Patients and Relatives with Haematological, Haemostasis and Immune Disorders

The Rare Diseases Pilot study of the 100,000 Genomes Project had two objectives. Firstly, to identify the DNA variants underlying unresolved Mendelian disorders. Secondly, to develop an accredited framework for delivering whole genome sequencing (WGS) results across a national healthcare system. Fro...

Full description

Saved in:
Bibliographic Details
Published inBlood Vol. 132; no. Supplement 1; p. 504
Main Authors Sivapalaratnam, Suthesh, Bioresource, Nihr
Format Journal Article
LanguageEnglish
Published Elsevier Inc 29.11.2018
Online AccessGet full text
ISSN0006-4971
1528-0020
DOI10.1182/blood-2018-99-119068

Cover

Abstract The Rare Diseases Pilot study of the 100,000 Genomes Project had two objectives. Firstly, to identify the DNA variants underlying unresolved Mendelian disorders. Secondly, to develop an accredited framework for delivering whole genome sequencing (WGS) results across a national healthcare system. From February 2014 to June 2017, 13,037 individuals with a rare disease and their relatives were recruited at 57 National Health Service (NHS) hospitals in the UK and 26 non-UK hospitals using standardized eligibility criteria for 12 rare disease domains. This cohort includes cases with haematology (n=1021), immunology (n=1359) and haemostasis disorders (n=1169). With informed consent, clinical and laboratory data were collected and coded into a single research database using Human Phenotype Ontology (HPO) terms and 13,037 samples of DNA were Illumina WGS analysed to clinical standard at a mean depths >30X in all samples and 90% of the reference genome was covered at 19X minimum in all samples. The pilot resource contains over 165 million unique variants with 91.5%, 8,5% and 5.6% single nucleotide variants (SNVs), short insertions / deletions and large deletions of the 10,258 genetically independent samples with 47% of variants previously unobserved in large scale genome datasets (e.g. TopMED, gnomAD, UK10K). Across all domains 2,067 unique diagnostic-grade genes (DGGs) were curated to clinical standards to support pertinent finding reporting by 12 multi-disciplinary teams (MDTs) with domain-relevant clinical and genetic expertise. Over 1,300 MDT reports assigning pathogenic or likely pathogenic causal variants have been returned to referring clinicians, with the diagnostic yield ranging from 1.6 to 53.8%, depending on the extent of genetic screening pre-enrolment and the importance of the non-genetic component of the disorder (e.g. in immune disorders). About 30% of the causal variants identified have never been reported (absent from the Human Gene Mutation Database v.2018.1); interestingly, 51 variants have been reported in 11 DGGs linked to phenotypes belonging to different domains. A comparison with standard whole exome sequencing results revealed WGS to have at least 12.5% superiority in sensitivity for detecting known pathogenic variants. For the haematology, immunology and haemostasis domains 330 causal variants were reported in 83 DGGs, revealing novel modes of inheritance (Sivapalaratnam et al Blood 2016), and entire new clinical phenotypes linked to mutations in ABCC4, GNE, KDSR and STIM1 amongst other DGGs. The genotype and HPO-coded phenotypes of all pilot cases were analysed with BeviMed, a rapid and scalable Bayesian association test (Greene et al AJHG 2017) to identify causal variants in hitherto unknown genes. This identified more than 30 genes with posterior probabilities indicating a high likelihood of being implicated in underlying as yet unresolved Mendelian disorders. Results from co-segregation and cell biology studies have already corroborated this statistical inference and 15 novel genes acquired DGG status. Including a new method for the analysis of the ‘gene-regulatory’ elements we also identified an example of a causal variant in such an element controlling the function of both GATA1 and HDAC6 resulting in a severe syndromic pathology characterized by abnormal erythropoiesis and megakaryopoiesis. In conclusion, the pilot of the 100,000 Genomes Project has shown the feasibility of using WGS across a national health system, such as the NHS, to deliver a molecular diagnosis for patients with rare inherited diseases and how a national genotype/HPO-coded phenotype resource provides a powerful platform for the identification of novel diagnostic-grade genes. No relevant conflicts of interest to declare.
AbstractList The Rare Diseases Pilot study of the 100,000 Genomes Project had two objectives. Firstly, to identify the DNA variants underlying unresolved Mendelian disorders. Secondly, to develop an accredited framework for delivering whole genome sequencing (WGS) results across a national healthcare system. From February 2014 to June 2017, 13,037 individuals with a rare disease and their relatives were recruited at 57 National Health Service (NHS) hospitals in the UK and 26 non-UK hospitals using standardized eligibility criteria for 12 rare disease domains. This cohort includes cases with haematology (n=1021), immunology (n=1359) and haemostasis disorders (n=1169). With informed consent, clinical and laboratory data were collected and coded into a single research database using Human Phenotype Ontology (HPO) terms and 13,037 samples of DNA were Illumina WGS analysed to clinical standard at a mean depths >30X in all samples and 90% of the reference genome was covered at 19X minimum in all samples. The pilot resource contains over 165 million unique variants with 91.5%, 8,5% and 5.6% single nucleotide variants (SNVs), short insertions / deletions and large deletions of the 10,258 genetically independent samples with 47% of variants previously unobserved in large scale genome datasets (e.g. TopMED, gnomAD, UK10K). Across all domains 2,067 unique diagnostic-grade genes (DGGs) were curated to clinical standards to support pertinent finding reporting by 12 multi-disciplinary teams (MDTs) with domain-relevant clinical and genetic expertise. Over 1,300 MDT reports assigning pathogenic or likely pathogenic causal variants have been returned to referring clinicians, with the diagnostic yield ranging from 1.6 to 53.8%, depending on the extent of genetic screening pre-enrolment and the importance of the non-genetic component of the disorder (e.g. in immune disorders). About 30% of the causal variants identified have never been reported (absent from the Human Gene Mutation Database v.2018.1); interestingly, 51 variants have been reported in 11 DGGs linked to phenotypes belonging to different domains. A comparison with standard whole exome sequencing results revealed WGS to have at least 12.5% superiority in sensitivity for detecting known pathogenic variants. For the haematology, immunology and haemostasis domains 330 causal variants were reported in 83 DGGs, revealing novel modes of inheritance (Sivapalaratnam et al Blood 2016), and entire new clinical phenotypes linked to mutations in ABCC4, GNE, KDSR and STIM1 amongst other DGGs. The genotype and HPO-coded phenotypes of all pilot cases were analysed with BeviMed, a rapid and scalable Bayesian association test (Greene et al AJHG 2017) to identify causal variants in hitherto unknown genes. This identified more than 30 genes with posterior probabilities indicating a high likelihood of being implicated in underlying as yet unresolved Mendelian disorders. Results from co-segregation and cell biology studies have already corroborated this statistical inference and 15 novel genes acquired DGG status. Including a new method for the analysis of the 'gene-regulatory' elements we also identified an example of a causal variant in such an element controlling the function of both GATA1 and HDAC6 resulting in a severe syndromic pathology characterized by abnormal erythropoiesis and megakaryopoiesis. In conclusion, the pilot of the 100,000 Genomes Project has shown the feasibility of using WGS across a national health system, such as the NHS, to deliver a molecular diagnosis for patients with rare inherited diseases and how a national genotype/HPO-coded phenotype resource provides a powerful platform for the identification of novel diagnostic-grade genes.
The Rare Diseases Pilot study of the 100,000 Genomes Project had two objectives. Firstly, to identify the DNA variants underlying unresolved Mendelian disorders. Secondly, to develop an accredited framework for delivering whole genome sequencing (WGS) results across a national healthcare system. From February 2014 to June 2017, 13,037 individuals with a rare disease and their relatives were recruited at 57 National Health Service (NHS) hospitals in the UK and 26 non-UK hospitals using standardized eligibility criteria for 12 rare disease domains. This cohort includes cases with haematology (n=1021), immunology (n=1359) and haemostasis disorders (n=1169). With informed consent, clinical and laboratory data were collected and coded into a single research database using Human Phenotype Ontology (HPO) terms and 13,037 samples of DNA were Illumina WGS analysed to clinical standard at a mean depths >30X in all samples and 90% of the reference genome was covered at 19X minimum in all samples. The pilot resource contains over 165 million unique variants with 91.5%, 8,5% and 5.6% single nucleotide variants (SNVs), short insertions / deletions and large deletions of the 10,258 genetically independent samples with 47% of variants previously unobserved in large scale genome datasets (e.g. TopMED, gnomAD, UK10K). Across all domains 2,067 unique diagnostic-grade genes (DGGs) were curated to clinical standards to support pertinent finding reporting by 12 multi-disciplinary teams (MDTs) with domain-relevant clinical and genetic expertise. Over 1,300 MDT reports assigning pathogenic or likely pathogenic causal variants have been returned to referring clinicians, with the diagnostic yield ranging from 1.6 to 53.8%, depending on the extent of genetic screening pre-enrolment and the importance of the non-genetic component of the disorder (e.g. in immune disorders). About 30% of the causal variants identified have never been reported (absent from the Human Gene Mutation Database v.2018.1); interestingly, 51 variants have been reported in 11 DGGs linked to phenotypes belonging to different domains. A comparison with standard whole exome sequencing results revealed WGS to have at least 12.5% superiority in sensitivity for detecting known pathogenic variants. For the haematology, immunology and haemostasis domains 330 causal variants were reported in 83 DGGs, revealing novel modes of inheritance (Sivapalaratnam et al Blood 2016), and entire new clinical phenotypes linked to mutations in ABCC4, GNE, KDSR and STIM1 amongst other DGGs. The genotype and HPO-coded phenotypes of all pilot cases were analysed with BeviMed, a rapid and scalable Bayesian association test (Greene et al AJHG 2017) to identify causal variants in hitherto unknown genes. This identified more than 30 genes with posterior probabilities indicating a high likelihood of being implicated in underlying as yet unresolved Mendelian disorders. Results from co-segregation and cell biology studies have already corroborated this statistical inference and 15 novel genes acquired DGG status. Including a new method for the analysis of the ‘gene-regulatory’ elements we also identified an example of a causal variant in such an element controlling the function of both GATA1 and HDAC6 resulting in a severe syndromic pathology characterized by abnormal erythropoiesis and megakaryopoiesis. In conclusion, the pilot of the 100,000 Genomes Project has shown the feasibility of using WGS across a national health system, such as the NHS, to deliver a molecular diagnosis for patients with rare inherited diseases and how a national genotype/HPO-coded phenotype resource provides a powerful platform for the identification of novel diagnostic-grade genes. No relevant conflicts of interest to declare.
Author Bioresource, Nihr
Sivapalaratnam, Suthesh
Author_xml – sequence: 1
  givenname: Suthesh
  surname: Sivapalaratnam
  fullname: Sivapalaratnam, Suthesh
  organization: Center for Immunobiology, QMUL, London, United Kingdom
– sequence: 2
  givenname: Nihr
  surname: Bioresource
  fullname: Bioresource, Nihr
  organization: NIHR, Cambridge CB2 0PT, United Kingdom
BookMark eNqFUctu1DAUtVCRmBb-gIU_YALXHmcSd4FUFfoQFVRtEcvIsW86rhy72G5H87N8C06mKxawsq_O4-rcc0gOfPBIyHsGHxhr-cfehWAqDqytpKwYk7BuX5EFq3lbAXA4IAsAWFdCNuwNOUzpAYCJFa8X5PfdBumNikg_24QqYaLX1oVMhxBpLhgDWBYxPUcfxgmN4QF1PqZn1hvr7xO1nn71Yeup8oZ-w-1ELcR-R0-8crtkEw0DXS1rIenPTXD44kVv8dcTeo2G3qrx0RXNEMNIr1W26HOa_W7QlfG5YFubN_RC4ahycOHeauWW8xxSVtOSiX45jk9-zhKiwZjekteDcgnfvbxH5MfZl7vTi-rq-_nl6clVpVldrmbWfSOxZkILAUavYGh5ywUOjWpl3TdgGt4IKB9hGPRcQwtc6obr1aCGdbM6ImLvq2NIKeLQPUY7qrjrGHRTR93cUTd11EnZ7TsqsuO_ZNrmkjf4HJV1_xN_2ouxBHu2GLuk7XxPG0tDnQn23wZ_AAMNsSY
CitedBy_id crossref_primary_10_3389_fgene_2020_527484
ContentType Journal Article
Copyright 2018 American Society of Hematology
Copyright_xml – notice: 2018 American Society of Hematology
DBID 6I.
AAFTH
AAYXX
CITATION
DOI 10.1182/blood-2018-99-119068
DatabaseName ScienceDirect Open Access Titles
Elsevier:ScienceDirect:Open Access
CrossRef
DatabaseTitle CrossRef
DatabaseTitleList CrossRef

DeliveryMethod fulltext_linktorsrc
Discipline Medicine
Chemistry
Biology
Anatomy & Physiology
EISSN 1528-0020
EndPage 504
ExternalDocumentID 10_1182_blood_2018_99_119068
S000649711936553X
GroupedDBID ---
-~X
.55
1CY
23N
2WC
34G
39C
4.4
53G
5GY
5RE
5VS
6I.
6J9
AAEDW
AAFTH
AAXUO
ABOCM
ABVKL
ACGFO
ADBBV
AENEX
AFOSN
AHPSJ
ALMA_UNASSIGNED_HOLDINGS
AMRAJ
BAWUL
BTFSW
CS3
DIK
DU5
E3Z
EBS
EJD
EX3
F5P
FDB
FRP
GS5
GX1
IH2
K-O
KQ8
L7B
LSO
MJL
N9A
OK1
P2P
R.V
RHF
RHI
ROL
SJN
THE
TR2
TWZ
W2D
W8F
WH7
WOQ
WOW
X7M
YHG
YKV
ZA5
0R~
AALRI
AAYXX
ACVFH
ADCNI
ADVLN
AEUPX
AFETI
AFPUW
AGCQF
AIGII
AITUG
AKBMS
AKRWK
AKYEP
CITATION
H13
ID FETCH-LOGICAL-c1518-d6b79e514c440dc30f82824ef7a895b70d72740b704d10b2c08029c72c3faf673
ISSN 0006-4971
IngestDate Thu Apr 24 23:09:37 EDT 2025
Tue Jul 01 00:22:48 EDT 2025
Fri Feb 23 02:41:27 EST 2024
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue Supplement 1
Language English
License This article is made available under the Elsevier license.
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-c1518-d6b79e514c440dc30f82824ef7a895b70d72740b704d10b2c08029c72c3faf673
OpenAccessLink https://dx.doi.org/10.1182/blood-2018-99-119068
PageCount 1
ParticipantIDs crossref_primary_10_1182_blood_2018_99_119068
crossref_citationtrail_10_1182_blood_2018_99_119068
elsevier_sciencedirect_doi_10_1182_blood_2018_99_119068
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate 2018-11-29
PublicationDateYYYYMMDD 2018-11-29
PublicationDate_xml – month: 11
  year: 2018
  text: 2018-11-29
  day: 29
PublicationDecade 2010
PublicationTitle Blood
PublicationYear 2018
Publisher Elsevier Inc
Publisher_xml – name: Elsevier Inc
SSID ssj0014325
Score 2.2793736
Snippet The Rare Diseases Pilot study of the 100,000 Genomes Project had two objectives. Firstly, to identify the DNA variants underlying unresolved Mendelian...
SourceID crossref
elsevier
SourceType Enrichment Source
Index Database
Publisher
StartPage 504
Title The Rare Diseases Pilot for the 100,000 Genomes Project: Findings in Known and New Genes by Analysis of 3,549 Whole Genome Sequenced Samples from Patients and Relatives with Haematological, Haemostasis and Immune Disorders
URI https://dx.doi.org/10.1182/blood-2018-99-119068
Volume 132
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnZ1bb9MwFICtMsTlBUHHxLjpPCBe2kDiOInDGxtMFdJQgU3sLcrFEZVKMzUp0vix_Ap-AOfYTppCNRgvURolJ678xT4-PhfGnmVSha7iqVOUReaIEj_FOI8zR6LuIRXOMKGu3nD8Ppycindnwdlg8LPntbRqshf5961xJf_Tq3gN-5WiZK_Qs51QvIDn2L94xB7G4z_38Udy3Xpjtlnq0XQ2r5rOddCjEsxk06fs0hWlZZoawwvZAY5mOqBF-8NS8WvjlUz-jpSKuia9tJ-xxEdJuMobfaaCulYejjTGERuV1pTSDNcmXGVqkrXWNvpxrpOL2zi6SUpZYtshV898eKVCJZVepL2TKWRFdXlB641957ktbq-tQrNvONXPKXn5wnD9ifz1687CfTCrlnZ7wlD_Zdm3cniSwv2sKaQduUOqhmc4VHawpuzaLnc3RvO1uXRF4TTn1g1_5PWG6cCUPLYzvv3152QiKTmtCSDQbYpjbFbsmjpAm7m7f5tTO09HvcaSPNFSEpKSxHFipFxj13mEGh-5EnxY730Jn5u6G_Yf24BPlPJyW1u2K1Q9JenkLrtjVzfw2qB6jw3UYsh2EaOm-noBz0H7G-uNnCG7cdCe3Tpsqw4O2c1j6-yxy34g3kB4Q4s3aLwB8QbsZ0C8x9h-sHCDhfsVtGjDbAEabUCuANEGjTZkF9CiDVUJ_hjBBg22lQUd2GDBBgIbWrC1vA5sILBhE-wx9LDWtxusocP6Pjs9entyOHFsORInR7VYOkWYRTEOXyIXwi1y3y0ll1yoMkplHGSRW-BaQLh4IgrPzTjl8OdxHvHcL1McBP09trOoFuoBgzwVoafI1hFwUfJQloEQApeOkVIoW-0zv-3TJLe5-qlkzDy5jKd95nRPnZtcNX-5P2pxSay-bfToBL-AS598eMU3PWK31x_1Y7bTLFfqCSrzTfZUs_8Luf_zkg
linkProvider Colorado Alliance of Research Libraries
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=The+Rare+Diseases+Pilot+for+the+100%2C000+Genomes+Project%3A+Findings+in+Known+and+New+Genes+by+Analysis+of+3%2C549+Whole+Genome+Sequenced+Samples+from+Patients+and+Relatives+with+Haematological%2C+Haemostasis+and+Immune+Disorders&rft.jtitle=Blood&rft.au=Sivapalaratnam%2C+Suthesh&rft.au=Bioresource%2C+Nihr&rft.date=2018-11-29&rft.issn=0006-4971&rft.eissn=1528-0020&rft.volume=132&rft.issue=Supplement+1&rft.spage=504&rft.epage=504&rft_id=info:doi/10.1182%2Fblood-2018-99-119068&rft.externalDBID=n%2Fa&rft.externalDocID=10_1182_blood_2018_99_119068
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0006-4971&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0006-4971&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0006-4971&client=summon