An algorithm to predict data completeness in oncology electronic medical records for comparative effectiveness research

•Electronic health record (EHR) discontinuity can lead to misclassification bias.•Patients with high continuity in an EHR may have less misclassification.•We constructed an algorithm that identifies high EHR-continuity in oncology patients. Electronic health record (EHR) discontinuity (missing out-o...

Full description

Saved in:
Bibliographic Details
Published inAnnals of epidemiology Vol. 76; pp. 143 - 149
Main Authors Merola, David, Schneeweiss, Sebastian, Schrag, Deborah, Lii, Joyce, Lin, Kueiyu Joshua
Format Journal Article
LanguageEnglish
Published United States Elsevier Inc 01.12.2022
Subjects
Online AccessGet full text
ISSN1047-2797
1873-2585
1873-2585
DOI10.1016/j.annepidem.2022.07.007

Cover

More Information
Summary:•Electronic health record (EHR) discontinuity can lead to misclassification bias.•Patients with high continuity in an EHR may have less misclassification.•We constructed an algorithm that identifies high EHR-continuity in oncology patients. Electronic health record (EHR) discontinuity (missing out-of-network encounters) can lead to information bias. We sought to construct an algorithm that identifies high EHR-continuity among oncology patients. Using a linked Medicare-EHR database and regression, we sought to 1) measure how often Medicare claims for outpatient encounters were substantiated by visits recorded in the EHR, and 2) predict continuity ratio, defined as the yearly proportion of outpatient encounters reported to Medicare that were captured by EHR data. The prediction model...s performance was evaluated with the coefficient of determination and Spearman...s correlation. We quantified variable misclassification by decile of continuity ratio using standardized difference and sensitivity. A total of 79,678 subjects met all eligibility criteria. Predicted and observed continuity was highly correlated (σSpearman=0.86). On average across all variables measured, MSD was reduced by a factor of 1/7th and sensitivity was improved 35-fold comparing subjects in the highest vs. lowest decile of CR. In the oncology population, restricting EHR-based study cohorts to subjects with high continuity may reduce misclassification without greatly impacting representativeness. Further work is needed to elucidate the best manner of implementing continuity prediction rules in cohort studies.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:1047-2797
1873-2585
1873-2585
DOI:10.1016/j.annepidem.2022.07.007