Identifying Pneumonia Sub-types from Electronic Health Records Using Rule-based Algorithms

International Classification of Disease (ICD) coding for pneumonia classification is based on causal organism or use of general pneumonia codes, creating challenges for epidemiological evaluations, where pneumonia is standardly subtyped by settings, exposures and time of emergence. Pneumonia subtype...

Full description

Saved in:
Bibliographic Details
Published inMethods of information in medicine
Main Authors Hegde, Harshad, Glurich, Ingrid, Panny, Aloksagar, Vedre, Jayanth G, VanWormer, Jeffrey J, Berg, Richard, Scannapieco, Frank A, Miecznikowski, Jeffrey, Acharya, Amit
Format Journal Article
LanguageEnglish
Published Germany 17.03.2022
Online AccessGet more information
ISSN2511-705X
DOI10.1055/a-1801-2718

Cover

Abstract International Classification of Disease (ICD) coding for pneumonia classification is based on causal organism or use of general pneumonia codes, creating challenges for epidemiological evaluations, where pneumonia is standardly subtyped by settings, exposures and time of emergence. Pneumonia subtype classification requires data available in electronic health records (EHR), frequently in non-structured formats including radiological interpretation or clinical notes that complicate electronic classification. The current study undertook development of a rule-based pneumonia subtyping algorithm for stratifying pneumonia by the setting in which it emerged using information documented in the EHR. Pneumonia subtype classification was developed by interrogating patient information within the EHR of a large private Health System. ICD coding was mined in the EHR applying requirements for 'rule of two' pneumonia-related codes or one ICD code and radiologically-confirmed pneumonia validated by natural language processing and/or documented antibiotic prescriptions. A rule-based algorithm flow chart was created to support sub-classification based on features including symptomatic patient point of entry into the healthcare system timing of pneumonia emergence and identification of clinical, laboratory or medication orders that informed definition of the pneumonia sub-classification algorithm. Data from 65,904 study-eligible patients with 91,998 episodes of pneumonia diagnoses documented by 380,509 encounters were analyzed, while 8,611 episodes were excluded following NLP classification of pneumonia status as 'negative' or 'unknown'. Subtyping of 83,387 episodes identified: community acquired (54.5%), hospital-acquired (20%), aspiration-related (10.7%), healthcare-acquired (5%), ventilator-associated (0.4%) cases, and 9.4% were not classifiable by the algorithm. Study outcome indicated capacity to achieve electronic pneumonia subtype classification based on interrogation of big data available in the EHR. Examination of portability of the algorithm to achieve rule-based pneumonia classification in other health systems remains to be explored.
AbstractList International Classification of Disease (ICD) coding for pneumonia classification is based on causal organism or use of general pneumonia codes, creating challenges for epidemiological evaluations, where pneumonia is standardly subtyped by settings, exposures and time of emergence. Pneumonia subtype classification requires data available in electronic health records (EHR), frequently in non-structured formats including radiological interpretation or clinical notes that complicate electronic classification. The current study undertook development of a rule-based pneumonia subtyping algorithm for stratifying pneumonia by the setting in which it emerged using information documented in the EHR. Pneumonia subtype classification was developed by interrogating patient information within the EHR of a large private Health System. ICD coding was mined in the EHR applying requirements for 'rule of two' pneumonia-related codes or one ICD code and radiologically-confirmed pneumonia validated by natural language processing and/or documented antibiotic prescriptions. A rule-based algorithm flow chart was created to support sub-classification based on features including symptomatic patient point of entry into the healthcare system timing of pneumonia emergence and identification of clinical, laboratory or medication orders that informed definition of the pneumonia sub-classification algorithm. Data from 65,904 study-eligible patients with 91,998 episodes of pneumonia diagnoses documented by 380,509 encounters were analyzed, while 8,611 episodes were excluded following NLP classification of pneumonia status as 'negative' or 'unknown'. Subtyping of 83,387 episodes identified: community acquired (54.5%), hospital-acquired (20%), aspiration-related (10.7%), healthcare-acquired (5%), ventilator-associated (0.4%) cases, and 9.4% were not classifiable by the algorithm. Study outcome indicated capacity to achieve electronic pneumonia subtype classification based on interrogation of big data available in the EHR. Examination of portability of the algorithm to achieve rule-based pneumonia classification in other health systems remains to be explored.
Author Berg, Richard
Scannapieco, Frank A
Hegde, Harshad
Acharya, Amit
Panny, Aloksagar
Vedre, Jayanth G
VanWormer, Jeffrey J
Miecznikowski, Jeffrey
Glurich, Ingrid
Author_xml – sequence: 1
  givenname: Harshad
  surname: Hegde
  fullname: Hegde, Harshad
  organization: Center for Oral and Systemic Health, Marshfield Clinic Research Institute, Marshfield, United States
– sequence: 2
  givenname: Ingrid
  surname: Glurich
  fullname: Glurich, Ingrid
  organization: Center for Oral and Systemic Health, Marshfield Clinic Research Institute, Marshfield, United States
– sequence: 3
  givenname: Aloksagar
  surname: Panny
  fullname: Panny, Aloksagar
  organization: Center for Oral and Systemic Health, Marshfield Clinic Research Institute, Marshfield, United States
– sequence: 4
  givenname: Jayanth G
  surname: Vedre
  fullname: Vedre, Jayanth G
  organization: Critical Care Medicine Department, Marshfield Clinic Health System, Marshfield, United States
– sequence: 5
  givenname: Jeffrey J
  surname: VanWormer
  fullname: VanWormer, Jeffrey J
  organization: Center for Oral and Systemic Health, Marshfield Clinic Research Institute, Marshfield, United States
– sequence: 6
  givenname: Richard
  surname: Berg
  fullname: Berg, Richard
  organization: Office of Research Computing and Analytics, Marshfield Clinic Research Institute, Marshfield, United States
– sequence: 7
  givenname: Frank A
  surname: Scannapieco
  fullname: Scannapieco, Frank A
  organization: Department of Oral Biology, School of Dental Medicine, State University of New York at Buffalo, Buffalo, United States
– sequence: 8
  givenname: Jeffrey
  surname: Miecznikowski
  fullname: Miecznikowski, Jeffrey
  organization: Department of Biostatistics, School of Public Health and Health Professions, State University of New York at Buffalo, Buffalo, United States
– sequence: 9
  givenname: Amit
  surname: Acharya
  fullname: Acharya, Amit
  organization: Advocate Aurora Research Institute, Advocate Aurora Health Inc, Downers Grove, United States
BackLink https://www.ncbi.nlm.nih.gov/pubmed/35299265$$D View this record in MEDLINE/PubMed
BookMark eNo1j01Lw0AURQdR7Ieu3Mv8gdH3Jplksiyl2kJBqRXETZlJXtpIMgkzySL_3oq6uovLuZw7Y5eudcTYHcIDglKPRqAGFDJFfcGmUiGKFNTHhM1C-AIArSG-ZpNIySyTiZqyz01Brq_KsXJH_upoaFpXGf42WNGPHQVe-rbhq5ry3p-bnK_J1P2J7yhvfRH4e_gBd0NNwppABV_Ux9ZX_akJN-yqNHWg27-cs_3Tar9ci-3L82a52Iou1SjKKCODuT57GyUjbeIYpDRSJcpCFsdoySRRpnKLEjCPIbVkkxQ0FjJJEOWc3f_OdoNtqDh0vmqMHw__H-U37HZR_w
ContentType Journal Article
Copyright Thieme. All rights reserved.
Copyright_xml – notice: Thieme. All rights reserved.
DBID NPM
DOI 10.1055/a-1801-2718
DatabaseName PubMed
DatabaseTitle PubMed
DatabaseTitleList PubMed
Database_xml – sequence: 1
  dbid: NPM
  name: PubMed
  url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
DeliveryMethod no_fulltext_linktorsrc
Discipline Medicine
EISSN 2511-705X
ExternalDocumentID 35299265
Genre Journal Article
GroupedDBID ---
0R~
123
4.4
5RE
AAWTL
ABCQX
ABJNI
ABOCM
ACGFS
AENEX
AHRSK
ALMA_UNASSIGNED_HOLDINGS
C45
CS3
DU5
EBS
F5P
H13
L7B
NPM
OK1
OVD
RTC
RTE
TEORI
ID FETCH-LOGICAL-p781-f39ea1c8180a5238a44022a2565b09441bea6395cb1201c407beb67081d266112
IngestDate Thu Jan 02 22:54:56 EST 2025
IsPeerReviewed true
IsScholarly true
Language English
License Thieme. All rights reserved.
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-p781-f39ea1c8180a5238a44022a2565b09441bea6395cb1201c407beb67081d266112
PMID 35299265
ParticipantIDs pubmed_primary_35299265
PublicationCentury 2000
PublicationDate 2022-Mar-17
PublicationDateYYYYMMDD 2022-03-17
PublicationDate_xml – month: 03
  year: 2022
  text: 2022-Mar-17
  day: 17
PublicationDecade 2020
PublicationPlace Germany
PublicationPlace_xml – name: Germany
PublicationTitle Methods of information in medicine
PublicationTitleAlternate Methods Inf Med
PublicationYear 2022
SSID ssj0008804
Score 2.2919877
Snippet International Classification of Disease (ICD) coding for pneumonia classification is based on causal organism or use of general pneumonia codes, creating...
SourceID pubmed
SourceType Index Database
Title Identifying Pneumonia Sub-types from Electronic Health Records Using Rule-based Algorithms
URI https://www.ncbi.nlm.nih.gov/pubmed/35299265
hasFullText
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1NS8NAEF38APEifn_LHrzJarZmm-ZYpFIERaRK8SK72U1brGmx9aC_3pnMNilFRb2EkiVtyHuZfbPdN8PYMep4G9lYhDYIRJjE8ErVaqFQgTRWJzWIm2gUvr6pNu_Dq7Zql9ttc3fJ2JwmH1_6Sv6DKpwDXNEl-wdkiy-FE_AZ8IUjIAzHX2FMLltyKt1m7g1-uKcxFghcWR2Rd6RRNrrxniNKOUcntF3g7q3vBE5m9qTe7wxee-Our2A-afSUN5mm2rRZYXbEhZLZP-abrkNVfJuQLne1LTb39LF2UZfiUee1Vwzc6ix7J5_N4HmkO7rYKvzgLK2OX-l3AL_re4D59QlIbXGzG82hLo9jmMSIKFDtLyN2oLC4hRYSpkpRiSgYT2E3fMnBA50YxxVqK_Hz6Ez57MnQPJuPIuztcYPLOX6qhuAVetMm3MfZ1F0ss6XJlTMJRy48WqtsxWcMvE7wr7E5l62zpWv_6DfY4xQLeMECXrCAIwt4yQJOLOCeBTxnAS9ZwEsWbLLWZaN10RS-Y4YYRjUp0vPYaZmgfV8r0GI6DAEQDapWGUjjQ2mcBkWqEiNB9yWQyxtnqhGoQos6TVa22EI2yNwO47KqKmkiz5MUFHcVrpcuts4qLOaeBibdZdv0UJ6GVBXlafK49r4d2WfLJUEO2GIKr6E7BE03Nkc5Lp-xd0zL
linkProvider National Library of Medicine
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Identifying+Pneumonia+Sub-types+from+Electronic+Health+Records+Using+Rule-based+Algorithms&rft.jtitle=Methods+of+information+in+medicine&rft.au=Hegde%2C+Harshad&rft.au=Glurich%2C+Ingrid&rft.au=Panny%2C+Aloksagar&rft.au=Vedre%2C+Jayanth+G&rft.date=2022-03-17&rft.eissn=2511-705X&rft_id=info:doi/10.1055%2Fa-1801-2718&rft_id=info%3Apmid%2F35299265&rft_id=info%3Apmid%2F35299265&rft.externalDocID=35299265