Evaluating large-scale propensity score performance through real-world and synthetic data experiments

Y Tian, MJ Schuemie… - International journal of …, 2018 - academic.oup.com
International journal of epidemiology, 2018academic.oup.com
Background Propensity score adjustment is a popular approach for confounding control in
observational studies. Reliable frameworks are needed to determine relative propensity
score performance in large-scale studies, and to establish optimal propensity score model
selection methods. Methods We detail a propensity score evaluation framework that
includes synthetic and real-world data experiments. Our synthetic experimental design
extends the 'plasmode'framework and simulates survival data under known effect sizes, and …
Background
Propensity score adjustment is a popular approach for confounding control in observational studies. Reliable frameworks are needed to determine relative propensity score performance in large-scale studies, and to establish optimal propensity score model selection methods.
Methods
We detail a propensity score evaluation framework that includes synthetic and real-world data experiments. Our synthetic experimental design extends the ‘plasmode’ framework and simulates survival data under known effect sizes, and our real-world experiments use a set of negative control outcomes with presumed null effect sizes. In reproductions of two published cohort studies, we compare two propensity score estimation methods that contrast in their model selection approach: -regularized regression that conducts a penalized likelihood regression, and the ‘high-dimensional propensity score’ (hdPS) that employs a univariate covariate screen. We evaluate methods on a range of outcome-dependent and outcome-independent metrics.
Results
-regularization propensity score methods achieve superior model fit, covariate balance and negative control bias reduction compared with the hdPS. Simulation results are mixed and fluctuate with simulation parameters, revealing a limitation of simulation under the proportional hazards framework. Including regularization with the hdPS reduces commonly reported non-convergence issues but has little effect on propensity score performance.
Conclusions
-regularization incorporates all covariates simultaneously into the propensity score model and offers propensity score performance superior to the hdPS marginal screen.
Oxford University Press