17
views
0
recommends
+1 Recommend
1 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Propensity score analysis with partially observed covariates: How should multiple imputation be used?

      research-article

      Read this article at

      ScienceOpenPublisherPMC
      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Inverse probability of treatment weighting is a popular propensity score-based approach to estimate marginal treatment effects in observational studies at risk of confounding bias. A major issue when estimating the propensity score is the presence of partially observed covariates. Multiple imputation is a natural approach to handle missing data on covariates: covariates are imputed and a propensity score analysis is performed in each imputed dataset to estimate the treatment effect. The treatment effect estimates from each imputed dataset are then combined to obtain an overall estimate. We call this method MIte. However, an alternative approach has been proposed, in which the propensity scores are combined across the imputed datasets (MIps). Therefore, there are remaining uncertainties about how to implement multiple imputation for propensity score analysis: (a) should we apply Rubin’s rules to the inverse probability of treatment weighting treatment effect estimates or to the propensity score estimates themselves? (b) does the outcome have to be included in the imputation model? (c) how should we estimate the variance of the inverse probability of treatment weighting estimator after multiple imputation? We studied the consistency and balancing properties of the MIte and MIps estimators and performed a simulation study to empirically assess their performance for the analysis of a binary outcome. We also compared the performance of these methods to complete case analysis and the missingness pattern approach, which uses a different propensity score model for each pattern of missingness, and a third multiple imputation approach in which the propensity score parameters are combined rather than the propensity scores themselves (MIpar). Under a missing at random mechanism, complete case and missingness pattern analyses were biased in most cases for estimating the marginal treatment effect, whereas multiple imputation approaches were approximately unbiased as long as the outcome was included in the imputation model. Only MIte was unbiased in all the studied scenarios and Rubin’s rules provided good variance estimates for MIte. The propensity score estimated in the MIte approach showed good balancing properties. In conclusion, when using multiple imputation in the inverse probability of treatment weighting context, MIte with the outcome included in the imputation model is the preferred approach.

          Related collections

          Most cited references30

          • Record: found
          • Abstract: found
          • Article: not found

          Randomized, Controlled Trials, Observational Studies, and the Hierarchy of Research Designs

          New England Journal of Medicine, 342(25), 1887-1892
            Bookmark
            • Record: found
            • Abstract: not found
            • Book: not found

            Causal Inference for Statistics, Social, and Biomedical Sciences

              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Using the outcome for imputation of missing predictor values was preferred.

              Epidemiologic studies commonly estimate associations between predictors (risk factors) and outcome. Most software automatically exclude subjects with missing values. This commonly causes bias because missing values seldom occur completely at random (MCAR) but rather selectively based on other (observed) variables, missing at random (MAR). Multiple imputation (MI) of missing predictor values using all observed information including outcome is advocated to deal with selective missing values. This seems a self-fulfilling prophecy. We tested this hypothesis using data from a study on diagnosis of pulmonary embolism. We selected five predictors of pulmonary embolism without missing values. Their regression coefficients and standard errors (SEs) estimated from the original sample were considered as "true" values. We assigned missing values to these predictors--both MCAR and MAR--and repeated this 1,000 times using simulations. Per simulation we multiple imputed the missing values without and with the outcome, and compared the regression coefficients and SEs to the truth. Regression coefficients based on MI including outcome were close to the truth. MI without outcome yielded very biased--underestimated--coefficients. SEs and coverage of the 90% confidence intervals were not different between MI with and without outcome. Results were the same for MCAR and MAR. For all types of missing values, imputation of missing predictor values using the outcome is preferred over imputation without outcome and is no self-fulfilling prophecy.
                Bookmark

                Author and article information

                Journal
                Stat Methods Med Res
                Stat Methods Med Res
                SMM
                spsmm
                Statistical Methods in Medical Research
                SAGE Publications (Sage UK: London, England )
                0962-2802
                1477-0334
                02 June 2017
                January 2019
                : 28
                : 1
                : 3-19
                Affiliations
                [1 ]Department of Medical Statistics, London School of Hygiene and Tropical Medicine, UK
                [2 ]MRC Biostatistics Unit, Cambridge Institute for Public Health, Cambridge, UK
                [3 ]London Hub for Trials Methodology Research, MRC Clinical Trials Unit, UCL, London, UK
                [4 ]Department of Epidemiology and Population Health, London School of Hygiene and Tropical Medicine, UK
                [5 ]IMS Health, Real-World Evidence Solutions, UK
                [6 ]SBIM Biostatistics and Medical Information, Hôpital Saint-Louis, France
                [7 ]ECSTRA Team (Epidémiologie Clinique et Statistiques pour la Recherche en Santé), UMR 1153 INSERM, Université Paris Diderot, France
                [8 ]Farr Institute of Health Informatics, London University College, London, UK
                Author notes
                [*]Clémence Leyrat, Department of Medical Statistics, London School of Hygiene and Tropical Medicine, Keppel Street, London WC1E 7HT, UK. Email: clemence.leyrat@ 123456lshtm.ac.uk
                Article
                10.1177_0962280217713032
                10.1177/0962280217713032
                6313366
                28573919
                7730ea2a-f5da-47f9-88b6-7789174b9a4e
                © The Author(s) 2017

                This article is distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 License ( http://www.creativecommons.org/licenses/by-nc/4.0/) which permits non-commercial use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access pages ( https://us.sagepub.com/en-us/nam/open-access-at-sage).

                History
                Categories
                Articles

                missing covariates,chained equations,rubin’s rules,inverse probability of treatment weighting,missingness pattern

                Comments

                Comment on this article