P261 Early and accurate diagnosis of patients with axial spondyloarthritis using machine learning: a predictive analysis from electronic health records in the United Kingdom
Abstract Background/Aims On average, there is a delay of 6.7 years between symptom onset and diagnosis of axial spondyloarthritis (axSpA). Since traditional approaches to improving early axSpA identification have had limited success, predictive automated analyses using patient records may help allev...
Saved in:
| Published in | Rheumatology (Oxford, England) Vol. 61; no. Supplement_1 |
|---|---|
| Main Authors | , , , , , , |
| Format | Journal Article |
| Language | English |
| Published |
Oxford University Press
23.04.2022
|
| Online Access | Get full text |
| ISSN | 1462-0324 1462-0332 1462-0332 |
| DOI | 10.1093/rheumatology/keac133.260 |
Cover
| Abstract | Abstract
Background/Aims
On average, there is a delay of 6.7 years between symptom onset and diagnosis of axial spondyloarthritis (axSpA). Since traditional approaches to improving early axSpA identification have had limited success, predictive automated analyses using patient records may help alleviate the burden on healthcare providers. We report results from a machine learning (ML) algorithm developed with UK electronic health records (EHRs) Clinical Practice Research Datalink (CPRD) data to estimate the probability or likelihood of a patient being diagnosed with axSpA based on prior clinical indicators and patient history.
Methods
Primary care UK EHR data - CPRD GOLD was used to identify patients with axSpA and healthy controls (HC). Patients aged ≥18 years with first diagnosis date of axSpA within the identification period (01-Jan-2005 to 31-Dec-2018) and fulfilling CPRD research acceptability criteria were included. Data pertaining to clinical presentation, consultation, referral, test, and therapy history were extracted for individual patients prior to diagnosis of axSpA. A total of 5,090 patients with axSpA satisfied the acceptability criteria. HC were randomly sampled to create a subset of one unique HC matched to each patient with axSpA, resulting in 5,089 HC. ML usable features derived from the total population (patients with axSpA and HC) numbered 820. After using a further exclusion criterion for the patients with axSpA and HC who had ≥1 of 820 usable features, the final dataset included 7,813 patients (3,902 with axSpA and 3,911 HC). This combined dataset was randomly split (67:33) into a train (n = 5237) and a test (n = 2576) dataset. A random forest (RF) model was trained on the train dataset. Cross-validation was performed for hyper-parameter tuning of the RF classifier. Once the model was trained, accuracy, precision, and F-1 scores were obtained with the test dataset.
Results
The RF-based algorithm resulted in a high level of accuracy (88.12%), with precision of 0.95 for patients with axSpA and 0.83 for HC. The RF algorithm identified 89 best clinical predictors (out of 820 used as inputs) that differentiated between patient and HC such as: total number of tests, total number of referrals, first age of consultation, first symptom age, and number of low back pain symptoms. The model sensitivity was 0.75 and positive predictive value was 80.88%. The model specificity was 0.96 and negative predictive value was 82.56%.
Conclusion
The ML algorithm demonstrated a high level of accuracy and precision in the identification of possible cases of axSpA, which may be useful in reducing the delay in diagnosis. Previous studies have successfully demonstrated automated cohort identification of axSpA in large datasets, with only a few using ML-based approaches for diagnosis from patient medical history. While our model supports previous work in axSpA, it needs further validation in routine clinical practice (exploration ongoing).
Disclosure
R. Sengupta: Honoraria; AbbVie, Biogen, Celgene, Lilly, MSD, Novartis, Roche, UCB. Grants/research support; AbbVie, Celgene, Novartis, UCB. S. Narasimham: Shareholder/stock ownership; Novartis. Other; Employee of Novartis. B.S. Mato: Shareholder/stock ownership; Novartis. Other; Employee of Novartis. M. Meglic: Other; Employee of Novartis. C. Perella: Other; Employee of Novartis. P. Pamies: Other; Employee of Novartis. P. Emery: Consultancies; AbbVie, Astra-Zeneca, BMS, Boehringer Ingelheim, Celltrion, Gilead, Janssen, MSD, Lilly, Novartis, Pfizer, Roche, Samsung, UCB. Grants/research support; AbbVie, BMS, Lilly, Novartis, Pfizer, Roche, Samsung. |
|---|---|
| AbstractList | Abstract
Background/Aims
On average, there is a delay of 6.7 years between symptom onset and diagnosis of axial spondyloarthritis (axSpA). Since traditional approaches to improving early axSpA identification have had limited success, predictive automated analyses using patient records may help alleviate the burden on healthcare providers. We report results from a machine learning (ML) algorithm developed with UK electronic health records (EHRs) Clinical Practice Research Datalink (CPRD) data to estimate the probability or likelihood of a patient being diagnosed with axSpA based on prior clinical indicators and patient history.
Methods
Primary care UK EHR data - CPRD GOLD was used to identify patients with axSpA and healthy controls (HC). Patients aged ≥18 years with first diagnosis date of axSpA within the identification period (01-Jan-2005 to 31-Dec-2018) and fulfilling CPRD research acceptability criteria were included. Data pertaining to clinical presentation, consultation, referral, test, and therapy history were extracted for individual patients prior to diagnosis of axSpA. A total of 5,090 patients with axSpA satisfied the acceptability criteria. HC were randomly sampled to create a subset of one unique HC matched to each patient with axSpA, resulting in 5,089 HC. ML usable features derived from the total population (patients with axSpA and HC) numbered 820. After using a further exclusion criterion for the patients with axSpA and HC who had ≥1 of 820 usable features, the final dataset included 7,813 patients (3,902 with axSpA and 3,911 HC). This combined dataset was randomly split (67:33) into a train (n = 5237) and a test (n = 2576) dataset. A random forest (RF) model was trained on the train dataset. Cross-validation was performed for hyper-parameter tuning of the RF classifier. Once the model was trained, accuracy, precision, and F-1 scores were obtained with the test dataset.
Results
The RF-based algorithm resulted in a high level of accuracy (88.12%), with precision of 0.95 for patients with axSpA and 0.83 for HC. The RF algorithm identified 89 best clinical predictors (out of 820 used as inputs) that differentiated between patient and HC such as: total number of tests, total number of referrals, first age of consultation, first symptom age, and number of low back pain symptoms. The model sensitivity was 0.75 and positive predictive value was 80.88%. The model specificity was 0.96 and negative predictive value was 82.56%.
Conclusion
The ML algorithm demonstrated a high level of accuracy and precision in the identification of possible cases of axSpA, which may be useful in reducing the delay in diagnosis. Previous studies have successfully demonstrated automated cohort identification of axSpA in large datasets, with only a few using ML-based approaches for diagnosis from patient medical history. While our model supports previous work in axSpA, it needs further validation in routine clinical practice (exploration ongoing).
Disclosure
R. Sengupta: Honoraria; AbbVie, Biogen, Celgene, Lilly, MSD, Novartis, Roche, UCB. Grants/research support; AbbVie, Celgene, Novartis, UCB. S. Narasimham: Shareholder/stock ownership; Novartis. Other; Employee of Novartis. B.S. Mato: Shareholder/stock ownership; Novartis. Other; Employee of Novartis. M. Meglic: Other; Employee of Novartis. C. Perella: Other; Employee of Novartis. P. Pamies: Other; Employee of Novartis. P. Emery: Consultancies; AbbVie, Astra-Zeneca, BMS, Boehringer Ingelheim, Celltrion, Gilead, Janssen, MSD, Lilly, Novartis, Pfizer, Roche, Samsung, UCB. Grants/research support; AbbVie, BMS, Lilly, Novartis, Pfizer, Roche, Samsung. |
| Author | Pamies, Paula Meglic, Matic Sengupta, Raj Narasimham, Shruti Perella, Chiara Mato, Borja S Emery, Paul |
| Author_xml | – sequence: 1 givenname: Raj surname: Sengupta fullname: Sengupta, Raj – sequence: 2 givenname: Shruti surname: Narasimham fullname: Narasimham, Shruti – sequence: 3 givenname: Borja S surname: Mato fullname: Mato, Borja S – sequence: 4 givenname: Matic surname: Meglic fullname: Meglic, Matic – sequence: 5 givenname: Chiara surname: Perella fullname: Perella, Chiara – sequence: 6 givenname: Paula surname: Pamies fullname: Pamies, Paula – sequence: 7 givenname: Paul surname: Emery fullname: Emery, Paul |
| BookMark | eNqVkNFKHTEQhkOxUD3tO8wLHE02u-vZXhSKWCsK9qJeL2MyOZs2myxJtrp3Qh_Id_JJjBwppVf2aoZh_u-H74Dt-eCJMRD8UPBOHsWB5hFzcGG7HP0kVELKw6rlb9i-qNtqzaWs9v7sVf2OHaT0g3PeCLnZZw_fqlY83v8-xegWQK8BlZojZgJtcetDsgmCgQmzJZ8T3No8AN5ZdJCm4PXiAsY8RJvL45ys38KIarCewBFGXw4fAWGKpK3K9heVEnTLM9bEMAI5UjkGbxUMhK7AI6kQdQLrIQ8E195m0nBRQDqM79lbgy7Rh5e5YtdfTr-ffF1fXp2dn3y-XCvRbfha35iN4Cga0dZIx1ILhTd1axSaY91KLVtBQpq6Ew01RsimNU3Nibqiy2hDcsW6HXf2Ey636Fw_RTtiXHrB-2fz_d_m-xfzfTFfsptdVsWQUiTzP9FP_0SVzUV98Dmida8ByB0gzNPra58AUWPAxw |
| CitedBy_id | crossref_primary_10_3390_s23094455 crossref_primary_10_1016_j_berh_2023_101870 |
| ContentType | Journal Article |
| Copyright | The Author(s) 2022. Published by Oxford University Press on behalf of the British Society for Rheumatology. All rights reserved. For permissions, please email: journals.permissions@oup.com. 2022 |
| Copyright_xml | – notice: The Author(s) 2022. Published by Oxford University Press on behalf of the British Society for Rheumatology. All rights reserved. For permissions, please email: journals.permissions@oup.com. 2022 |
| DBID | AAYXX CITATION ADTOC UNPAY |
| DOI | 10.1093/rheumatology/keac133.260 |
| DatabaseName | CrossRef Unpaywall for CDI: Periodical Content Unpaywall |
| DatabaseTitle | CrossRef |
| DatabaseTitleList | |
| Database_xml | – sequence: 1 dbid: UNPAY name: Unpaywall url: https://proxy.k.utb.cz/login?url=https://unpaywall.org/ sourceTypes: Open Access Repository |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Medicine |
| DocumentTitleAlternate | British Society for Rheumatology Annual Conference 2022 Abstracts |
| EISSN | 1462-0332 |
| ExternalDocumentID | 10.1093/rheumatology/keac133.260 10_1093_rheumatology_keac133_260 |
| GroupedDBID | --- -E4 .2P .GJ .I3 .XZ .ZR 08P 0R~ 18M 1TH 29P 2WC 354 3O- 4.4 48X 53G 5RE 5VS 5WA 5WD 70D AABZA AACZT AAGKA AAJKP AAJQQ AAMDB AAMVS AAOGV AAPGJ AAPNW AAPQZ AAPXW AARHZ AASNB AAUAY AAUQX AAVAP AAWDT AAWTL ABEUO ABIXL ABJNI ABKDP ABLJU ABNHQ ABNKS ABOCM ABPTD ABQLI ABQNK ABQTQ ABXVV ABZBJ ACFRR ACGFO ACGFS ACPRK ACUFI ACUTJ ACUTO ACYHN ADBBV ADEYI ADEZT ADGZP ADHKW ADHZD ADIPN ADJQC ADOCK ADQBN ADRIX ADRTK ADVEK ADYVW ADZXQ AEGPL AEGXH AEJOX AEKSI AEMDU AENEX AENZO AEPUE AETBJ AEWNT AFFNX AFFZL AFIYH AFOFC AFXAL AFXEN AGINJ AGKEF AGSYK AGUTN AHMBA AHXPO AIAGR AIJHB AJEEA AKWXX ALMA_UNASSIGNED_HOLDINGS ALUQC APIBT APWMN AQDSO ATGXG ATTQO AXUDD AZFZN BAWUL BAYMD BCRHZ BEYMZ BHONS BTRTY BVRKM C45 CAG CDBKE COF CS3 CZ4 DAKXR DIK DILTD DU5 D~K E3Z EBD EBS EE~ EIHJH EJD EMOBN ENERS F5P F9B FECEO FEDTE FLUFQ FOEOM FOTVD FQBLK GAUVT GJXCC GX1 H13 H5~ HAR HVGLF HW0 HZ~ IOX J21 KAQDR KBUDW KC5 KOP KQ8 KSI KSN L7B M-Z M49 MHKGH N9A NGC NOMLY NOYVH NTWIH NU- NVLIB O0~ O9- OAUYM OAWHX OBH OCZFY ODMLO OHH OJQWA OJZSN OK1 OPAEJ OVD OWPYF O~Y P2P PAFKI PB- PEELM PQQKQ Q1. Q5Y R44 RD5 RHF RIG RNI ROL ROX RUSNO RW1 RXO RZF RZO SV3 TCURE TEORI TJX TMA TR2 VVN W8F WOQ X7H YAYTL YKOAZ YXANX ZGI ZKX ZY1 ~91 AAYXX ABDFA ABEJV ABGNP ABVGC ADNBA AEMQT AHGBF AHMMS AJBYB AJNCP ALXQX CITATION JXSIZ ABNGD ACUKT ADMTO ADTOC AEHUL AGORE AGQPQ UNPAY |
| ID | FETCH-LOGICAL-c1980-dbf810a15164ae73d1cab46fcaf7d63d361e13f4915e5f1356f540ee9033fdfe3 |
| IEDL.DBID | UNPAY |
| ISSN | 1462-0324 1462-0332 |
| IngestDate | Tue Aug 19 21:46:24 EDT 2025 Thu Apr 24 22:59:07 EDT 2025 Wed Oct 01 04:43:24 EDT 2025 Wed Aug 28 03:19:00 EDT 2024 |
| IsDoiOpenAccess | true |
| IsOpenAccess | true |
| IsPeerReviewed | true |
| IsScholarly | true |
| Issue | Supplement_1 |
| Language | English |
| License | This article is published and distributed under the terms of the Oxford University Press, Standard Journals Publication Model (https://academic.oup.com/journals/pages/open_access/funder_policies/chorus/standard_publication_model) https://academic.oup.com/journals/pages/open_access/funder_policies/chorus/standard_publication_model |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-c1980-dbf810a15164ae73d1cab46fcaf7d63d361e13f4915e5f1356f540ee9033fdfe3 |
| OpenAccessLink | https://proxy.k.utb.cz/login?url=https://academic.oup.com/rheumatology/article-pdf/61/Supplement_1/keac133.260/43436175/keac133.260.pdf |
| ParticipantIDs | unpaywall_primary_10_1093_rheumatology_keac133_260 crossref_primary_10_1093_rheumatology_keac133_260 crossref_citationtrail_10_1093_rheumatology_keac133_260 oup_primary_10_1093_rheumatology_keac133_260 |
| ProviderPackageCode | CITATION AAYXX |
| PublicationCentury | 2000 |
| PublicationDate | 2022-04-23 |
| PublicationDateYYYYMMDD | 2022-04-23 |
| PublicationDate_xml | – month: 04 year: 2022 text: 2022-04-23 day: 23 |
| PublicationDecade | 2020 |
| PublicationTitle | Rheumatology (Oxford, England) |
| PublicationYear | 2022 |
| Publisher | Oxford University Press |
| Publisher_xml | – name: Oxford University Press |
| SSID | ssj0005138 |
| Score | 2.3756204 |
| Snippet | Abstract
Background/Aims
On average, there is a delay of 6.7 years between symptom onset and diagnosis of axial spondyloarthritis (axSpA). Since traditional... |
| SourceID | unpaywall crossref oup |
| SourceType | Open Access Repository Enrichment Source Index Database Publisher |
| Title | P261 Early and accurate diagnosis of patients with axial spondyloarthritis using machine learning: a predictive analysis from electronic health records in the United Kingdom |
| URI | https://academic.oup.com/rheumatology/article-pdf/61/Supplement_1/keac133.260/43436175/keac133.260.pdf |
| UnpaywallVersion | publishedVersion |
| Volume | 61 |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| journalDatabaseRights | – providerCode: PRVAFT databaseName: Open Access Digital Library customDbUrl: eissn: 1462-0332 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0005138 issn: 1462-0324 databaseCode: KQ8 dateStart: 19960101 isFulltext: true titleUrlDefault: http://grweb.coalliance.org/oadl/oadl.html providerName: Colorado Alliance of Research Libraries – providerCode: PRVBFR databaseName: Free Medical Journals customDbUrl: eissn: 1462-0332 dateEnd: 20241101 omitProxy: true ssIdentifier: ssj0005138 issn: 1462-0324 databaseCode: DIK dateStart: 19960101 isFulltext: true titleUrlDefault: http://www.freemedicaljournals.com providerName: Flying Publisher – providerCode: PRVFQY databaseName: GFMER Free Medical Journals customDbUrl: eissn: 1462-0332 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0005138 issn: 1462-0324 databaseCode: GX1 dateStart: 0 isFulltext: true titleUrlDefault: http://www.gfmer.ch/Medical_journals/Free_medical.php providerName: Geneva Foundation for Medical Education and Research |
| link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV3NbtQwELboVoITUH5EK6jm0CNO4tpxkt4qRKmQWvXASsspsmO7rbrNrnY3gvZUiQOPwzvxJIxjb9lyKkjcoihjx__fjGe-IWSHZ97kLkpa6NxRIRpBVcEdNVmuG585rerpi4-O5eFQfBzlo-ht4WNhVPQKT5YhDbMz2yF2623MaexNOjUulSztE1_2VrSapRe4haDClSBAT324JB7N-erLBIXWyLrMEbMPyPrw-GT_cwg92qUZD9lvwzNfuvygtn-3_pXi7pxjITbuUddO1dUXNR6vHFEHT8j3ZeOCZ8pF0i100lz_wfv4_1v_lDyOKBf2Q1Eb5IFtn5GHR_Ee_zn5cYIq1c-bbz29MqjWgGqazrNWgAm-f-dzmDiIrK9z8OZiUF9xsYD36TVX4wn-51nPyATed_8ULnu3UAsxD8bpHiiYznyVfkfHSgL7CviAGvid_QdCJCgEa9UczltATAwBi4NP-WImly_I8OD9p3eHNKaOoA2ryowa7UqWKYQzUihbcMMapYV0jXKFkdxg_1jGnahYbnPHeC4dQldrKxx9Z5zlL8mgnbT2FQFRFcxq3Au1QOVTS0TEhfXhuFKWZaXlJimWE6FuIq-6T-8xrsP9Pq9XB7GOY1LjmGwSdis5Ddwi95B5i3PjLz7fvZ2U9xba-heh12SwmHX2DUK0hd4max9GbDuusl8k7UW3 |
| linkProvider | Unpaywall |
| linkToUnpaywall | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1Lb9QwELZgK8GJN2oRoDlwxElcO07CrUJUFVKrHlipnCI_26rb7Gp3I9qekDjwc_hP_BLGsbdsORUkblGUseP3N-OZbwh5w4tgchc1rXTpqRBGUFVxT21RahMypzUDffH-gdwbi49H5VHytgixMCp5hWerkIb5iesRuw025jz1Jp1Zn0uWD4kvBytay_Iz3EJQ4coQoOchXBKP5nL9ZYZCd8mGLBGzj8jG-OBw53MMPdqmBY_Zb-MzX7n8oLZ_s_614m6cYzE27n7fzdTlFzWZrB1Ruw_J91XjomfKWdYvdWau_uB9_P-tf0QeJJQLO7Gox-SO656Qe_vpHv8p-XGIKtXPr98GemVQnQVlTB9YK8BG37_TBUw9JNbXBQRzMagLXCwQfHrt5WSK_3kyMDJB8N0_hvPBLdRByoNx_A4UzOahyrCjYyWRfQVCQA38zv4DMRIUorVqAacdICaGiMUhpHyx0_NnZLz74dP7PZpSR1DDmrqgVvuaFQrhjBTKVdwyo7SQ3ihfWckt9o9j3IuGla70jJfSI3R1rsHR99Y7_pyMumnnNgmIpmJO416oBSqfWiIirlwIx5Wyrhstt0i1mgitSbzqIb3HpI33-7xdH8Q2jUmLY7JF2LXkLHKL3ELmLc6Nv_h8-3pS3lroxb8IvSSj5bx3rxCiLfXrtL5-AYhNRMY |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=P261%E2%80%83Early+and+accurate+diagnosis+of+patients+with+axial+spondyloarthritis+using+machine+learning%3A+a+predictive+analysis+from+electronic+health+records+in+the+United+Kingdom&rft.jtitle=Rheumatology+%28Oxford%2C+England%29&rft.au=Sengupta%2C+Raj&rft.au=Narasimham%2C+Shruti&rft.au=Mato%2C+Borja+S&rft.au=Meglic%2C+Matic&rft.date=2022-04-23&rft.issn=1462-0324&rft.eissn=1462-0332&rft.volume=61&rft.issue=Supplement_1&rft_id=info:doi/10.1093%2Frheumatology%2Fkeac133.260&rft.externalDBID=n%2Fa&rft.externalDocID=10_1093_rheumatology_keac133_260 |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1462-0324&client=summon |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1462-0324&client=summon |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1462-0324&client=summon |