A comparison of 12 algorithms for matching on the propensity score
Propensity‐score matching is increasingly being used to reduce the confounding that can occur in observational studies examining the effects of treatments or interventions on outcomes. We used Monte Carlo simulations to examine the following algorithms for forming matched pairs of treated and untrea...
Saved in:
| Published in | Statistics in medicine Vol. 33; no. 6; pp. 1057 - 1069 |
|---|---|
| Main Author | |
| Format | Journal Article |
| Language | English |
| Published |
England
Blackwell Publishing Ltd
15.03.2014
Wiley Subscription Services, Inc BlackWell Publishing Ltd |
| Subjects | |
| Online Access | Get full text |
| ISSN | 0277-6715 1097-0258 1097-0258 |
| DOI | 10.1002/sim.6004 |
Cover
| Summary: | Propensity‐score matching is increasingly being used to reduce the confounding that can occur in observational studies examining the effects of treatments or interventions on outcomes. We used Monte Carlo simulations to examine the following algorithms for forming matched pairs of treated and untreated subjects: optimal matching, greedy nearest neighbor matching without replacement, and greedy nearest neighbor matching without replacement within specified caliper widths. For each of the latter two algorithms, we examined four different sub‐algorithms defined by the order in which treated subjects were selected for matching to an untreated subject: lowest to highest propensity score, highest to lowest propensity score, best match first, and random order. We also examined matching with replacement. We found that (i) nearest neighbor matching induced the same balance in baseline covariates as did optimal matching; (ii) when at least some of the covariates were continuous, caliper matching tended to induce balance on baseline covariates that was at least as good as the other algorithms; (iii) caliper matching tended to result in estimates of treatment effect with less bias compared with optimal and nearest neighbor matching; (iv) optimal and nearest neighbor matching resulted in estimates of treatment effect with negligibly less variability than did caliper matching; (v) caliper matching had amongst the best performance when assessed using mean squared error; (vi) the order in which treated subjects were selected for matching had at most a modest effect on estimation; and (vii) matching with replacement did not have superior performance compared with caliper matching without replacement. © 2013 The Authors. Statistics in Medicine published by John Wiley & Sons, Ltd. |
|---|---|
| Bibliography: | Supporting info item istex:B0886D5E685F8C0ACE6D22699D991A3B5300FDC0 ark:/67375/WNG-S4XRRF1D-G ArticleID:SIM6004 SourceType-Scholarly Journals-1 ObjectType-Feature-1 content type line 14 ObjectType-Article-2 content type line 23 |
| ISSN: | 0277-6715 1097-0258 1097-0258 |
| DOI: | 10.1002/sim.6004 |