Development and external validation of tools for categorizing diagnosis codes in international hospital data

The Clinical Classification Software Refined (CCSR) is a tool that groups many thousands of International Classification of Diseases 10th Revision (ICD-10) diagnosis codes into approximately 500 clinically meaningful categories, simplifying analyses. However, CCSR was developed for use in the United...

Full description

Saved in:
Bibliographic Details
Published inInternational journal of medical informatics (Shannon, Ireland) Vol. 189; p. 105508
Main Authors Malecki, Sarah L., Loffler, Anne, Tamming, Daniel, Dyrby Johansen, Niklas, Biering-Sørensen, Tor, Fralick, Michael, Sohail, Shahmir, Shi, Jessica, Roberts, Surain B, Colacci, Michael, Ismail, Marwa, Razak, Fahad, Verma, Amol A.
Format Journal Article
LanguageEnglish
Published Ireland Elsevier B.V 01.09.2024
Subjects
Online AccessGet full text
ISSN1386-5056
1872-8243
1872-8243
DOI10.1016/j.ijmedinf.2024.105508

Cover

More Information
Summary:The Clinical Classification Software Refined (CCSR) is a tool that groups many thousands of International Classification of Diseases 10th Revision (ICD-10) diagnosis codes into approximately 500 clinically meaningful categories, simplifying analyses. However, CCSR was developed for use in the United States and may not work well with other country-specific ICD-10 coding systems. We developed an algorithm for semi-automated matching of Canadian ICD-10 codes (ICD-10-CA) to CCSR categories using discharge diagnoses from adult admissions at 7 hospitals between Apr 1, 2010 and Dec 31, 2020, and manually validated the results. We then externally validated our approach using inpatient hospital encounters in Denmark from 2017 to 2018. There were 383,972 Canadian hospital admissions with 5,186 distinct ICD-10-CA diagnosis codes and 1,855,837 Danish encounters with 4,612 ICD-10 diagnosis codes. Only 46.6% of Canadian codes and 49.4% of Danish codes could be directly categorized using the official CCSR tool. Our algorithm facilitated the mapping of 98.5% of all Canadian codes and 97.7% of Danish codes. Validation of our algorithm by clinicians demonstrated excellent accuracy (97.1% and 97.0% in Canadian and Danish data, respectively). Without our algorithm, many common conditions did not match directly to a CCSR category, such as 96.6% of hospital admissions for heart failure. The GEMINI CCSR matching algorithm (available as an open-source package at https://github.com/GEMINI-Medicine/gemini-ccsr) improves the categorization of Canadian and Danish ICD-10 codes into clinically coherent categories compared to the original CCSR tool. We expect this approach to generalize well to other countries and enable a wide range of research and quality measurement applications.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ObjectType-Undefined-3
ISSN:1386-5056
1872-8243
1872-8243
DOI:10.1016/j.ijmedinf.2024.105508