Identification of smoking using Medicare data - a validation study of claims-based algorithms

Purpose This study examined the accuracy of claims‐based algorithms to identify smoking against self‐reported smoking data. Methods Medicare patients enrolled in the Brigham and Women's Hospital Rheumatoid Arthritis Sequential Study were identified. For each patient, self‐reported smoking statu...

Full description

Saved in:

Bibliographic Details
Published in	Pharmacoepidemiology and drug safety Vol. 25; no. 4; pp. 472 - 475
Main Authors	Desai, Rishi J, Solomon, Daniel H, Shadick, Nancy, Iannaccone, Christine, Kim, Seoyoung C
Format	Journal Article
Language	English
Published	England Blackwell Publishing Ltd 01.04.2016 Wiley Subscription Services, Inc
Subjects	Aged Algorithms claims-based algorithm Databases, Factual - statistics & numerical data Female Humans Insurance claims Male Medicare Medicare - statistics & numerical data Middle Aged pharmacoepidemiology Predictive Value of Tests Prospective Studies Self Report Sensitivity and Specificity Smoking Smoking - epidemiology United States validation claims-based algorithm validation smoking pharmacoepidemiology
Online Access	Get full text
ISSN	1053-8569 1099-1557 1099-1557
DOI	10.1002/pds.3953

Cover

More Information
Summary:	Purpose This study examined the accuracy of claims‐based algorithms to identify smoking against self‐reported smoking data. Methods Medicare patients enrolled in the Brigham and Women's Hospital Rheumatoid Arthritis Sequential Study were identified. For each patient, self‐reported smoking status was extracted from Women's Hospital Rheumatoid Arthritis Sequential Study and the date of this measurement was defined as the index‐date. Two algorithms identified smoking in Medicare claims: (i) only using diagnoses and procedure codes and (ii) using anti‐smoking prescriptions in addition to diagnoses and procedure codes. Both algorithms were implemented: first, only using 365‐days pre‐index claims and then using all available pre‐index claims. Considering self‐reported smoking status as the gold standard, we calculated specificity, sensitivity, positive predictive value, negative predictive value (NPV), and area under the curve (AUC). Results A total of 128 patients were included in this study, of which 48% reported smoking. The algorithm only using diagnosis and procedure codes had the lowest sensitivity (9.8%, 95%CI 2.4%–17.3%), NPV (54.9%, 95%CI 46.1%–63.9%), and AUC (0.55, 95%CI 0.51–0.59) when applied in the period of 365 days pre‐index. Incorporating pharmacy claims and using all available pre‐index information improved the sensitivity (27.9%, 95%CI 16.6%–39.1%), NPV (60.4%, 95%CI 51.3%–69.5%), and AUC (0.64, 95%CI 0.58–0.70). The specificity and positive predictive value was 100% for all the algorithms tested. Conclusion Claims‐based algorithms can identify smokers with limited sensitivity but very high specificity. In the absence of other reliable means, use of a claims‐based algorithm to identify smoking could be cautiously considered in observational studies. Copyright © 2016 John Wiley & Sons, Ltd.
Bibliography:	ArticleID:PDS3953 ark:/67375/WNG-R6SN2ZPC-0 istex:8FA81E91DD96EB69C50AE87C435A7F5978A605AF ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 content type line 23 ObjectType-Undefined-3
ISSN:	1053-8569 1099-1557 1099-1557
DOI:	10.1002/pds.3953