19
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Regression with missing Ys: An improved strategy for analyzing multiply imputed data

      Preprint

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          When fitting a generalized linear model -- such as a linear regression, a logistic regression, or a hierarchical linear model -- analysts often wonder how to handle missing values of the dependent variable Y. If missing values have been filled in using multiple imputation, the usual advice is to use the imputed Y values in analysis. We show, however, that using imputed Ys can add needless noise to the estimates. Better estimates can usually be obtained using a modified strategy that we call multiple imputation, then deletion (MID). Under MID, all cases are used for imputation, but following imputation cases with imputed Y values are excluded from the analysis. When there is something wrong with the imputed Y values, MID protects the estimates from the problematic imputations. And when the imputed Y values are acceptable, MID usually offers somewhat more efficient estimates than an ordinary MI strategy.

          Related collections

          Author and article information

          Journal
          2016-05-03
          Article
          10.1111/j.1467-9531.2007.00180.x
          1605.01095
          3be0548d-b999-4227-a9cc-e6c9a7bca60d

          http://arxiv.org/licenses/nonexclusive-distrib/1.0/

          History
          Custom metadata
          Sociological Methodology (2007) volume 37, pp. 83-117
          stat.ME

          Methodology
          Methodology

          Comments

          Comment on this article