An algorithm to predict data completeness in oncology electronic medical records for comparative effectiveness research
•Electronic health record (EHR) discontinuity can lead to misclassification bias.•Patients with high continuity in an EHR may have less misclassification.•We constructed an algorithm that identifies high EHR-continuity in oncology patients. Electronic health record (EHR) discontinuity (missing out-o...
Saved in:
Published in | Annals of epidemiology Vol. 76; pp. 143 - 149 |
---|---|
Main Authors | , , , , |
Format | Journal Article |
Language | English |
Published |
United States
Elsevier Inc
01.12.2022
|
Subjects | |
Online Access | Get full text |
ISSN | 1047-2797 1873-2585 1873-2585 |
DOI | 10.1016/j.annepidem.2022.07.007 |
Cover
Summary: | •Electronic health record (EHR) discontinuity can lead to misclassification bias.•Patients with high continuity in an EHR may have less misclassification.•We constructed an algorithm that identifies high EHR-continuity in oncology patients.
Electronic health record (EHR) discontinuity (missing out-of-network encounters) can lead to information bias. We sought to construct an algorithm that identifies high EHR-continuity among oncology patients.
Using a linked Medicare-EHR database and regression, we sought to 1) measure how often Medicare claims for outpatient encounters were substantiated by visits recorded in the EHR, and 2) predict continuity ratio, defined as the yearly proportion of outpatient encounters reported to Medicare that were captured by EHR data. The prediction model...s performance was evaluated with the coefficient of determination and Spearman...s correlation. We quantified variable misclassification by decile of continuity ratio using standardized difference and sensitivity.
A total of 79,678 subjects met all eligibility criteria. Predicted and observed continuity was highly correlated (σSpearman=0.86). On average across all variables measured, MSD was reduced by a factor of 1/7th and sensitivity was improved 35-fold comparing subjects in the highest vs. lowest decile of CR.
In the oncology population, restricting EHR-based study cohorts to subjects with high continuity may reduce misclassification without greatly impacting representativeness. Further work is needed to elucidate the best manner of implementing continuity prediction rules in cohort studies. |
---|---|
Bibliography: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 |
ISSN: | 1047-2797 1873-2585 1873-2585 |
DOI: | 10.1016/j.annepidem.2022.07.007 |