Computational analysis of 10,860 phenotypic annotations in individuals with SCN2A-related disorders

Pathogenic variants in SCN2A cause a wide range of neurodevelopmental phenotypes. Reports of genotype–phenotype correlations are often anecdotal, and the available phenotypic data have not been systematically analyzed. We extracted phenotypic information from primary descriptions of SCN2A-related di...

Full description

Saved in:
Bibliographic Details
Published inGenetics in medicine Vol. 23; no. 7; pp. 1263 - 1272
Main Authors Crawford, Katherine, Xian, Julie, Helbig, Katherine L., Galer, Peter D., Parthasarathy, Shridhar, Lewis-Smith, David, Kaufman, Michael C., Fitch, Eryn, Ganesan, Shiva, O’Brien, Margaret, Codoni, Veronica, Ellis, Colin A., Conway, Laura J., Taylor, Deanne, Krause, Roland, Helbig, Ingo
Format Journal Article
LanguageEnglish
Published New York Elsevier Inc 01.07.2021
Nature Publishing Group US
Elsevier Limited
Subjects
Online AccessGet full text
ISSN1098-3600
1530-0366
1530-0366
DOI10.1038/s41436-021-01120-1

Cover

More Information
Summary:Pathogenic variants in SCN2A cause a wide range of neurodevelopmental phenotypes. Reports of genotype–phenotype correlations are often anecdotal, and the available phenotypic data have not been systematically analyzed. We extracted phenotypic information from primary descriptions of SCN2A-related disorders in the literature between 2001 and 2019, which we coded in Human Phenotype Ontology (HPO) terms. With higher-level phenotype terms inferred by the HPO structure, we assessed the frequencies of clinical features and investigated the association of these features with variant classes and locations within the NaV1.2 protein. We identified 413 unrelated individuals and derived a total of 10,860 HPO terms with 562 unique terms. Protein-truncating variants were associated with autism and behavioral abnormalities. Missense variants were associated with neonatal onset, epileptic spasms, and seizures, regardless of type. Phenotypic similarity was identified in 8/62 recurrent SCN2A variants. Three independent principal components accounted for 33% of the phenotypic variance, allowing for separation of gain-of-function versus loss-of-function variants with good performance. Our work shows that translating clinical features into a computable format using a standardized language allows for quantitative phenotype analysis, mapping the phenotypic landscape of SCN2A-related disorders in unprecedented detail and revealing genotype–phenotype correlations along a multidimensional spectrum.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
ISSN:1098-3600
1530-0366
1530-0366
DOI:10.1038/s41436-021-01120-1