The Use of Bootstrapping when Using Propensity-Score Matching without Replacement: A Simulation Study

Austin, Peter C; Small, Dylan S

The Use of Bootstrapping when Using Propensity-Score Matching without Replacement: A Simulation Study

Files

Austin_Small_The_use_of_bootstrapping_when_using_propensity_score_matching_without_replacement.pdf (909.17 KB)

Penn collection

Statistics Papers

Subject

propensity score
propensity-score matching
bootstrap
variance estimation
Monte Carlo simulations
matching
Business
Business Analytics
Management Sciences and Quantitative Methods
Statistics and Probability

Permalink

https://repository.upenn.edu/handle/20.500.14332/48027

View all metadata

Author

Austin, Peter C

Small, Dylan S

Abstract

Propensity‐score matching is frequently used to estimate the effect of treatments, exposures, and interventions when using observational data. An important issue when using propensity‐score matching is how to estimate the standard error of the estimated treatment effect. Accurate variance estimation permits construction of confidence intervals that have the advertised coverage rates and tests of statistical significance that have the correct type I error rates. There is disagreement in the literature as to how standard errors should be estimated. The bootstrap is a commonly used resampling method that permits estimation of the sampling variability of estimated parameters. Bootstrap methods are rarely used in conjunction with propensity‐score matching. We propose two different bootstrap methods for use when using propensity‐score matching without replacement and examined their performance with a series of Monte Carlo simulations. The first method involved drawing bootstrap samples from the matched pairs in the propensity‐score‐matched sample. The second method involved drawing bootstrap samples from the original sample and estimating the propensity score separately in each bootstrap sample and creating a matched sample within each of these bootstrap samples. The former approach was found to result in estimates of the standard error that were closer to the empirical standard deviation of the sampling distribution of estimated effects.

Publication date

2014-08-04

Collection

Reports