P261 Early and accurate diagnosis of patients with axial spondyloarthritis using machine learning: a predictive analysis from electronic health records in the United Kingdom

Abstract Background/Aims On average, there is a delay of 6.7 years between symptom onset and diagnosis of axial spondyloarthritis (axSpA). Since traditional approaches to improving early axSpA identification have had limited success, predictive automated analyses using patient records may help allev...

Full description

Saved in:
Bibliographic Details
Published inRheumatology (Oxford, England) Vol. 61; no. Supplement_1
Main Authors Sengupta, Raj, Narasimham, Shruti, Mato, Borja S, Meglic, Matic, Perella, Chiara, Pamies, Paula, Emery, Paul
Format Journal Article
LanguageEnglish
Published Oxford University Press 23.04.2022
Online AccessGet full text
ISSN1462-0324
1462-0332
1462-0332
DOI10.1093/rheumatology/keac133.260

Cover

Abstract Abstract Background/Aims On average, there is a delay of 6.7 years between symptom onset and diagnosis of axial spondyloarthritis (axSpA). Since traditional approaches to improving early axSpA identification have had limited success, predictive automated analyses using patient records may help alleviate the burden on healthcare providers. We report results from a machine learning (ML) algorithm developed with UK electronic health records (EHRs) Clinical Practice Research Datalink (CPRD) data to estimate the probability or likelihood of a patient being diagnosed with axSpA based on prior clinical indicators and patient history. Methods Primary care UK EHR data - CPRD GOLD was used to identify patients with axSpA and healthy controls (HC). Patients aged ≥18 years with first diagnosis date of axSpA within the identification period (01-Jan-2005 to 31-Dec-2018) and fulfilling CPRD research acceptability criteria were included. Data pertaining to clinical presentation, consultation, referral, test, and therapy history were extracted for individual patients prior to diagnosis of axSpA. A total of 5,090 patients with axSpA satisfied the acceptability criteria. HC were randomly sampled to create a subset of one unique HC matched to each patient with axSpA, resulting in 5,089 HC. ML usable features derived from the total population (patients with axSpA and HC) numbered 820. After using a further exclusion criterion for the patients with axSpA and HC who had ≥1 of 820 usable features, the final dataset included 7,813 patients (3,902 with axSpA and 3,911 HC). This combined dataset was randomly split (67:33) into a train (n = 5237) and a test (n = 2576) dataset. A random forest (RF) model was trained on the train dataset. Cross-validation was performed for hyper-parameter tuning of the RF classifier. Once the model was trained, accuracy, precision, and F-1 scores were obtained with the test dataset. Results The RF-based algorithm resulted in a high level of accuracy (88.12%), with precision of 0.95 for patients with axSpA and 0.83 for HC. The RF algorithm identified 89 best clinical predictors (out of 820 used as inputs) that differentiated between patient and HC such as: total number of tests, total number of referrals, first age of consultation, first symptom age, and number of low back pain symptoms. The model sensitivity was 0.75 and positive predictive value was 80.88%. The model specificity was 0.96 and negative predictive value was 82.56%. Conclusion The ML algorithm demonstrated a high level of accuracy and precision in the identification of possible cases of axSpA, which may be useful in reducing the delay in diagnosis. Previous studies have successfully demonstrated automated cohort identification of axSpA in large datasets, with only a few using ML-based approaches for diagnosis from patient medical history. While our model supports previous work in axSpA, it needs further validation in routine clinical practice (exploration ongoing). Disclosure R. Sengupta: Honoraria; AbbVie, Biogen, Celgene, Lilly, MSD, Novartis, Roche, UCB. Grants/research support; AbbVie, Celgene, Novartis, UCB. S. Narasimham: Shareholder/stock ownership; Novartis. Other; Employee of Novartis. B.S. Mato: Shareholder/stock ownership; Novartis. Other; Employee of Novartis. M. Meglic: Other; Employee of Novartis. C. Perella: Other; Employee of Novartis. P. Pamies: Other; Employee of Novartis. P. Emery: Consultancies; AbbVie, Astra-Zeneca, BMS, Boehringer Ingelheim, Celltrion, Gilead, Janssen, MSD, Lilly, Novartis, Pfizer, Roche, Samsung, UCB. Grants/research support; AbbVie, BMS, Lilly, Novartis, Pfizer, Roche, Samsung.
AbstractList Abstract Background/Aims On average, there is a delay of 6.7 years between symptom onset and diagnosis of axial spondyloarthritis (axSpA). Since traditional approaches to improving early axSpA identification have had limited success, predictive automated analyses using patient records may help alleviate the burden on healthcare providers. We report results from a machine learning (ML) algorithm developed with UK electronic health records (EHRs) Clinical Practice Research Datalink (CPRD) data to estimate the probability or likelihood of a patient being diagnosed with axSpA based on prior clinical indicators and patient history. Methods Primary care UK EHR data - CPRD GOLD was used to identify patients with axSpA and healthy controls (HC). Patients aged ≥18 years with first diagnosis date of axSpA within the identification period (01-Jan-2005 to 31-Dec-2018) and fulfilling CPRD research acceptability criteria were included. Data pertaining to clinical presentation, consultation, referral, test, and therapy history were extracted for individual patients prior to diagnosis of axSpA. A total of 5,090 patients with axSpA satisfied the acceptability criteria. HC were randomly sampled to create a subset of one unique HC matched to each patient with axSpA, resulting in 5,089 HC. ML usable features derived from the total population (patients with axSpA and HC) numbered 820. After using a further exclusion criterion for the patients with axSpA and HC who had ≥1 of 820 usable features, the final dataset included 7,813 patients (3,902 with axSpA and 3,911 HC). This combined dataset was randomly split (67:33) into a train (n = 5237) and a test (n = 2576) dataset. A random forest (RF) model was trained on the train dataset. Cross-validation was performed for hyper-parameter tuning of the RF classifier. Once the model was trained, accuracy, precision, and F-1 scores were obtained with the test dataset. Results The RF-based algorithm resulted in a high level of accuracy (88.12%), with precision of 0.95 for patients with axSpA and 0.83 for HC. The RF algorithm identified 89 best clinical predictors (out of 820 used as inputs) that differentiated between patient and HC such as: total number of tests, total number of referrals, first age of consultation, first symptom age, and number of low back pain symptoms. The model sensitivity was 0.75 and positive predictive value was 80.88%. The model specificity was 0.96 and negative predictive value was 82.56%. Conclusion The ML algorithm demonstrated a high level of accuracy and precision in the identification of possible cases of axSpA, which may be useful in reducing the delay in diagnosis. Previous studies have successfully demonstrated automated cohort identification of axSpA in large datasets, with only a few using ML-based approaches for diagnosis from patient medical history. While our model supports previous work in axSpA, it needs further validation in routine clinical practice (exploration ongoing). Disclosure R. Sengupta: Honoraria; AbbVie, Biogen, Celgene, Lilly, MSD, Novartis, Roche, UCB. Grants/research support; AbbVie, Celgene, Novartis, UCB. S. Narasimham: Shareholder/stock ownership; Novartis. Other; Employee of Novartis. B.S. Mato: Shareholder/stock ownership; Novartis. Other; Employee of Novartis. M. Meglic: Other; Employee of Novartis. C. Perella: Other; Employee of Novartis. P. Pamies: Other; Employee of Novartis. P. Emery: Consultancies; AbbVie, Astra-Zeneca, BMS, Boehringer Ingelheim, Celltrion, Gilead, Janssen, MSD, Lilly, Novartis, Pfizer, Roche, Samsung, UCB. Grants/research support; AbbVie, BMS, Lilly, Novartis, Pfizer, Roche, Samsung.
Author Pamies, Paula
Meglic, Matic
Sengupta, Raj
Narasimham, Shruti
Perella, Chiara
Mato, Borja S
Emery, Paul
Author_xml – sequence: 1
  givenname: Raj
  surname: Sengupta
  fullname: Sengupta, Raj
– sequence: 2
  givenname: Shruti
  surname: Narasimham
  fullname: Narasimham, Shruti
– sequence: 3
  givenname: Borja S
  surname: Mato
  fullname: Mato, Borja S
– sequence: 4
  givenname: Matic
  surname: Meglic
  fullname: Meglic, Matic
– sequence: 5
  givenname: Chiara
  surname: Perella
  fullname: Perella, Chiara
– sequence: 6
  givenname: Paula
  surname: Pamies
  fullname: Pamies, Paula
– sequence: 7
  givenname: Paul
  surname: Emery
  fullname: Emery, Paul
BookMark eNqVkNFKHTEQhkOxUD3tO8wLHE02u-vZXhSKWCsK9qJeL2MyOZs2myxJtrp3Qh_Id_JJjBwppVf2aoZh_u-H74Dt-eCJMRD8UPBOHsWB5hFzcGG7HP0kVELKw6rlb9i-qNtqzaWs9v7sVf2OHaT0g3PeCLnZZw_fqlY83v8-xegWQK8BlZojZgJtcetDsgmCgQmzJZ8T3No8AN5ZdJCm4PXiAsY8RJvL45ys38KIarCewBFGXw4fAWGKpK3K9heVEnTLM9bEMAI5UjkGbxUMhK7AI6kQdQLrIQ8E195m0nBRQDqM79lbgy7Rh5e5YtdfTr-ffF1fXp2dn3y-XCvRbfha35iN4Cga0dZIx1ILhTd1axSaY91KLVtBQpq6Ew01RsimNU3Nibqiy2hDcsW6HXf2Ey636Fw_RTtiXHrB-2fz_d_m-xfzfTFfsptdVsWQUiTzP9FP_0SVzUV98Dmida8ByB0gzNPra58AUWPAxw
CitedBy_id crossref_primary_10_3390_s23094455
crossref_primary_10_1016_j_berh_2023_101870
ContentType Journal Article
Copyright The Author(s) 2022. Published by Oxford University Press on behalf of the British Society for Rheumatology. All rights reserved. For permissions, please email: journals.permissions@oup.com. 2022
Copyright_xml – notice: The Author(s) 2022. Published by Oxford University Press on behalf of the British Society for Rheumatology. All rights reserved. For permissions, please email: journals.permissions@oup.com. 2022
DBID AAYXX
CITATION
ADTOC
UNPAY
DOI 10.1093/rheumatology/keac133.260
DatabaseName CrossRef
Unpaywall for CDI: Periodical Content
Unpaywall
DatabaseTitle CrossRef
DatabaseTitleList
Database_xml – sequence: 1
  dbid: UNPAY
  name: Unpaywall
  url: https://proxy.k.utb.cz/login?url=https://unpaywall.org/
  sourceTypes: Open Access Repository
DeliveryMethod fulltext_linktorsrc
Discipline Medicine
DocumentTitleAlternate British Society for Rheumatology Annual Conference 2022 Abstracts
EISSN 1462-0332
ExternalDocumentID 10.1093/rheumatology/keac133.260
10_1093_rheumatology_keac133_260
GroupedDBID ---
-E4
.2P
.GJ
.I3
.XZ
.ZR
08P
0R~
18M
1TH
29P
2WC
354
3O-
4.4
48X
53G
5RE
5VS
5WA
5WD
70D
AABZA
AACZT
AAGKA
AAJKP
AAJQQ
AAMDB
AAMVS
AAOGV
AAPGJ
AAPNW
AAPQZ
AAPXW
AARHZ
AASNB
AAUAY
AAUQX
AAVAP
AAWDT
AAWTL
ABEUO
ABIXL
ABJNI
ABKDP
ABLJU
ABNHQ
ABNKS
ABOCM
ABPTD
ABQLI
ABQNK
ABQTQ
ABXVV
ABZBJ
ACFRR
ACGFO
ACGFS
ACPRK
ACUFI
ACUTJ
ACUTO
ACYHN
ADBBV
ADEYI
ADEZT
ADGZP
ADHKW
ADHZD
ADIPN
ADJQC
ADOCK
ADQBN
ADRIX
ADRTK
ADVEK
ADYVW
ADZXQ
AEGPL
AEGXH
AEJOX
AEKSI
AEMDU
AENEX
AENZO
AEPUE
AETBJ
AEWNT
AFFNX
AFFZL
AFIYH
AFOFC
AFXAL
AFXEN
AGINJ
AGKEF
AGSYK
AGUTN
AHMBA
AHXPO
AIAGR
AIJHB
AJEEA
AKWXX
ALMA_UNASSIGNED_HOLDINGS
ALUQC
APIBT
APWMN
AQDSO
ATGXG
ATTQO
AXUDD
AZFZN
BAWUL
BAYMD
BCRHZ
BEYMZ
BHONS
BTRTY
BVRKM
C45
CAG
CDBKE
COF
CS3
CZ4
DAKXR
DIK
DILTD
DU5
D~K
E3Z
EBD
EBS
EE~
EIHJH
EJD
EMOBN
ENERS
F5P
F9B
FECEO
FEDTE
FLUFQ
FOEOM
FOTVD
FQBLK
GAUVT
GJXCC
GX1
H13
H5~
HAR
HVGLF
HW0
HZ~
IOX
J21
KAQDR
KBUDW
KC5
KOP
KQ8
KSI
KSN
L7B
M-Z
M49
MHKGH
N9A
NGC
NOMLY
NOYVH
NTWIH
NU-
NVLIB
O0~
O9-
OAUYM
OAWHX
OBH
OCZFY
ODMLO
OHH
OJQWA
OJZSN
OK1
OPAEJ
OVD
OWPYF
O~Y
P2P
PAFKI
PB-
PEELM
PQQKQ
Q1.
Q5Y
R44
RD5
RHF
RIG
RNI
ROL
ROX
RUSNO
RW1
RXO
RZF
RZO
SV3
TCURE
TEORI
TJX
TMA
TR2
VVN
W8F
WOQ
X7H
YAYTL
YKOAZ
YXANX
ZGI
ZKX
ZY1
~91
AAYXX
ABDFA
ABEJV
ABGNP
ABVGC
ADNBA
AEMQT
AHGBF
AHMMS
AJBYB
AJNCP
ALXQX
CITATION
JXSIZ
ABNGD
ACUKT
ADMTO
ADTOC
AEHUL
AGORE
AGQPQ
UNPAY
ID FETCH-LOGICAL-c1980-dbf810a15164ae73d1cab46fcaf7d63d361e13f4915e5f1356f540ee9033fdfe3
IEDL.DBID UNPAY
ISSN 1462-0324
1462-0332
IngestDate Tue Aug 19 21:46:24 EDT 2025
Thu Apr 24 22:59:07 EDT 2025
Wed Oct 01 04:43:24 EDT 2025
Wed Aug 28 03:19:00 EDT 2024
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue Supplement_1
Language English
License This article is published and distributed under the terms of the Oxford University Press, Standard Journals Publication Model (https://academic.oup.com/journals/pages/open_access/funder_policies/chorus/standard_publication_model)
https://academic.oup.com/journals/pages/open_access/funder_policies/chorus/standard_publication_model
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c1980-dbf810a15164ae73d1cab46fcaf7d63d361e13f4915e5f1356f540ee9033fdfe3
OpenAccessLink https://proxy.k.utb.cz/login?url=https://academic.oup.com/rheumatology/article-pdf/61/Supplement_1/keac133.260/43436175/keac133.260.pdf
ParticipantIDs unpaywall_primary_10_1093_rheumatology_keac133_260
crossref_primary_10_1093_rheumatology_keac133_260
crossref_citationtrail_10_1093_rheumatology_keac133_260
oup_primary_10_1093_rheumatology_keac133_260
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate 2022-04-23
PublicationDateYYYYMMDD 2022-04-23
PublicationDate_xml – month: 04
  year: 2022
  text: 2022-04-23
  day: 23
PublicationDecade 2020
PublicationTitle Rheumatology (Oxford, England)
PublicationYear 2022
Publisher Oxford University Press
Publisher_xml – name: Oxford University Press
SSID ssj0005138
Score 2.3756204
Snippet Abstract Background/Aims On average, there is a delay of 6.7 years between symptom onset and diagnosis of axial spondyloarthritis (axSpA). Since traditional...
SourceID unpaywall
crossref
oup
SourceType Open Access Repository
Enrichment Source
Index Database
Publisher
Title P261 Early and accurate diagnosis of patients with axial spondyloarthritis using machine learning: a predictive analysis from electronic health records in the United Kingdom
URI https://academic.oup.com/rheumatology/article-pdf/61/Supplement_1/keac133.260/43436175/keac133.260.pdf
UnpaywallVersion publishedVersion
Volume 61
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVAFT
  databaseName: Open Access Digital Library
  customDbUrl:
  eissn: 1462-0332
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0005138
  issn: 1462-0324
  databaseCode: KQ8
  dateStart: 19960101
  isFulltext: true
  titleUrlDefault: http://grweb.coalliance.org/oadl/oadl.html
  providerName: Colorado Alliance of Research Libraries
– providerCode: PRVBFR
  databaseName: Free Medical Journals
  customDbUrl:
  eissn: 1462-0332
  dateEnd: 20241101
  omitProxy: true
  ssIdentifier: ssj0005138
  issn: 1462-0324
  databaseCode: DIK
  dateStart: 19960101
  isFulltext: true
  titleUrlDefault: http://www.freemedicaljournals.com
  providerName: Flying Publisher
– providerCode: PRVFQY
  databaseName: GFMER Free Medical Journals
  customDbUrl:
  eissn: 1462-0332
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0005138
  issn: 1462-0324
  databaseCode: GX1
  dateStart: 0
  isFulltext: true
  titleUrlDefault: http://www.gfmer.ch/Medical_journals/Free_medical.php
  providerName: Geneva Foundation for Medical Education and Research
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV3NbtQwELboVoITUH5EK6jm0CNO4tpxkt4qRKmQWvXASsspsmO7rbrNrnY3gvZUiQOPwzvxJIxjb9lyKkjcoihjx__fjGe-IWSHZ97kLkpa6NxRIRpBVcEdNVmuG585rerpi4-O5eFQfBzlo-ht4WNhVPQKT5YhDbMz2yF2623MaexNOjUulSztE1_2VrSapRe4haDClSBAT324JB7N-erLBIXWyLrMEbMPyPrw-GT_cwg92qUZD9lvwzNfuvygtn-3_pXi7pxjITbuUddO1dUXNR6vHFEHT8j3ZeOCZ8pF0i100lz_wfv4_1v_lDyOKBf2Q1Eb5IFtn5GHR_Ee_zn5cYIq1c-bbz29MqjWgGqazrNWgAm-f-dzmDiIrK9z8OZiUF9xsYD36TVX4wn-51nPyATed_8ULnu3UAsxD8bpHiiYznyVfkfHSgL7CviAGvid_QdCJCgEa9UczltATAwBi4NP-WImly_I8OD9p3eHNKaOoA2ryowa7UqWKYQzUihbcMMapYV0jXKFkdxg_1jGnahYbnPHeC4dQldrKxx9Z5zlL8mgnbT2FQFRFcxq3Au1QOVTS0TEhfXhuFKWZaXlJimWE6FuIq-6T-8xrsP9Pq9XB7GOY1LjmGwSdis5Ddwi95B5i3PjLz7fvZ2U9xba-heh12SwmHX2DUK0hd4max9GbDuusl8k7UW3
linkProvider Unpaywall
linkToUnpaywall http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1Lb9QwELZgK8GJN2oRoDlwxElcO07CrUJUFVKrHlipnCI_26rb7Gp3I9qekDjwc_hP_BLGsbdsORUkblGUseP3N-OZbwh5w4tgchc1rXTpqRBGUFVxT21RahMypzUDffH-gdwbi49H5VHytgixMCp5hWerkIb5iesRuw025jz1Jp1Zn0uWD4kvBytay_Iz3EJQ4coQoOchXBKP5nL9ZYZCd8mGLBGzj8jG-OBw53MMPdqmBY_Zb-MzX7n8oLZ_s_614m6cYzE27n7fzdTlFzWZrB1Ruw_J91XjomfKWdYvdWau_uB9_P-tf0QeJJQLO7Gox-SO656Qe_vpHv8p-XGIKtXPr98GemVQnQVlTB9YK8BG37_TBUw9JNbXBQRzMagLXCwQfHrt5WSK_3kyMDJB8N0_hvPBLdRByoNx_A4UzOahyrCjYyWRfQVCQA38zv4DMRIUorVqAacdICaGiMUhpHyx0_NnZLz74dP7PZpSR1DDmrqgVvuaFQrhjBTKVdwyo7SQ3ihfWckt9o9j3IuGla70jJfSI3R1rsHR99Y7_pyMumnnNgmIpmJO416oBSqfWiIirlwIx5Wyrhstt0i1mgitSbzqIb3HpI33-7xdH8Q2jUmLY7JF2LXkLHKL3ELmLc6Nv_h8-3pS3lroxb8IvSSj5bx3rxCiLfXrtL5-AYhNRMY
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=P261%E2%80%83Early+and+accurate+diagnosis+of+patients+with+axial+spondyloarthritis+using+machine+learning%3A+a+predictive+analysis+from+electronic+health+records+in+the+United+Kingdom&rft.jtitle=Rheumatology+%28Oxford%2C+England%29&rft.au=Sengupta%2C+Raj&rft.au=Narasimham%2C+Shruti&rft.au=Mato%2C+Borja+S&rft.au=Meglic%2C+Matic&rft.date=2022-04-23&rft.issn=1462-0324&rft.eissn=1462-0332&rft.volume=61&rft.issue=Supplement_1&rft_id=info:doi/10.1093%2Frheumatology%2Fkeac133.260&rft.externalDBID=n%2Fa&rft.externalDocID=10_1093_rheumatology_keac133_260
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1462-0324&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1462-0324&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1462-0324&client=summon