140
views
0
recommends
+1 Recommend
1 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Evaluation of two-fold fully conditional specification multiple imputation for longitudinal electronic health record data

      research-article

      Read this article at

      ScienceOpenPublisherPMC
      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Most implementations of multiple imputation (MI) of missing data are designed for simple rectangular data structures ignoring temporal ordering of data. Therefore, when applying MI to longitudinal data with intermittent patterns of missing data, some alternative strategies must be considered. One approach is to divide data into time blocks and implement MI independently at each block. An alternative approach is to include all time blocks in the same MI model. With increasing numbers of time blocks, this approach is likely to break down because of co-linearity and over-fitting. The new two-fold fully conditional specification (FCS) MI algorithm addresses these issues, by only conditioning on measurements, which are local in time. We describe and report the results of a novel simulation study to critically evaluate the two-fold FCS algorithm and its suitability for imputation of longitudinal electronic health records. After generating a full data set, approximately 70% of selected continuous and categorical variables were made missing completely at random in each of ten time blocks. Subsequently, we applied a simple time-to-event model. We compared efficiency of estimated coefficients from a complete records analysis, MI of data in the baseline time block and the two-fold FCS algorithm. The results show that the two-fold FCS algorithm maximises the use of data available, with the gain relative to baseline MI depending on the strength of correlations within and between variables. Using this approach also increases plausibility of the missing at random assumption by using repeated measures over time of variables whose baseline values may be missing.

          Related collections

          Most cited references29

          • Record: found
          • Abstract: found
          • Article: not found
          Is Open Access

          Multiple imputation for missing data in epidemiological and clinical research: potential and pitfalls

          Most studies have some missing data. Jonathan Sterne and colleagues describe the appropriate use and reporting of the multiple imputation approach to dealing with them
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Multiple imputation of discrete and continuous data by fully conditional specification.

            The goal of multiple imputation is to provide valid inferences for statistical estimates from incomplete data. To achieve that goal, imputed values should preserve the structure in the data, as well as the uncertainty about this structure, and include any knowledge about the process that generated the missing data. Two approaches for imputing multivariate data exist: joint modeling (JM) and fully conditional specification (FCS). JM is based on parametric statistical theory, and leads to imputation procedures whose statistical properties are known. JM is theoretically sound, but the joint model may lack flexibility needed to represent typical data features, potentially leading to bias. FCS is a semi-parametric and flexible alternative that specifies the multivariate model by a series of conditional models, one for each incomplete variable. FCS provides tremendous flexibility and is easy to apply, but its statistical properties are difficult to establish. Simulation work shows that FCS behaves very well in the cases studied. The present paper reviews and compares the approaches. JM and FCS were applied to pubertal development data of 3801 Dutch girls that had missing data on menarche (two categories), breast development (five categories) and pubic hair development (six stages). Imputations for these data were created under two models: a multivariate normal model with rounding and a conditionally specified discrete model. The JM approach introduced biases in the reference curves, whereas FCS did not. The paper concludes that FCS is a useful and easily applied flexible alternative to JM when no convenient and realistic joint distribution can be specified.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Risk of myocardial infarction and stroke after acute infection or vaccination.

              There is evidence that chronic inflammation may promote atherosclerotic disease. We tested the hypothesis that acute infection and vaccination increase the short-term risk of vascular events. We undertook within-person comparisons, using the case-series method, to study the risks of myocardial infarction and stroke after common vaccinations and naturally occurring infections. The study was based on the United Kingdom General Practice Research Database, which contains computerized medical records of more than 5 million patients. A total of 20,486 persons with a first myocardial infarction and 19,063 persons with a first stroke who received influenza vaccine were included in the analysis. There was no increase in the risk of myocardial infarction or stroke in the period after influenza, tetanus, or pneumococcal vaccination. However, the risks of both events were substantially higher after a diagnosis of systemic respiratory tract infection and were highest during the first three days (incidence ratio for myocardial infarction, 4.95; 95 percent confidence interval, 4.43 to 5.53; incidence ratio for stroke, 3.19; 95 percent confidence interval, 2.81 to 3.62). The risks then gradually fell during the following weeks. The risks were raised significantly but to a lesser degree after a diagnosis of urinary tract infection. The findings for recurrent myocardial infarctions and stroke were similar to those for first events. Our findings provide support for the concept that acute infections are associated with a transient increase in the risk of vascular events. By contrast, influenza, tetanus, and pneumococcal vaccinations do not produce a detectable increase in the risk of vascular events. Copyright 2004 Massachusetts Medical Society.
                Bookmark

                Author and article information

                Journal
                Stat Med
                Stat Med
                sim
                Statistics in Medicine
                BlackWell Publishing Ltd (Oxford, UK )
                0277-6715
                1097-0258
                20 September 2014
                30 April 2014
                : 33
                : 21
                : 3725-3737
                Affiliations
                [a ]Department of Primary Care and Population Health, University College London (UCL) London, U.K.
                [b ]Department of Medical Statistics, London School of Hygiene and Tropical Medicine London, U.K.
                [c ]MRC Biostatistics Unit Cambridge, U.K.
                [d ]MRC Clinical Trials Unit, Aviation House Kingsway, London, U.K.
                Author notes
                *Correspondence to: Irene Petersen, Department of Primary Care and Population Health, University of College London (UCL), Rowland Hill Street, London NW3 2PF, U.K.
                Article
                10.1002/sim.6184
                4285297
                24782349
                e53e74fa-3046-4857-ab38-f3fee1d11063
                © 2014 The Authors. Statistics in Medicine published by John Wiley & Sons, Ltd.

                This is an open access article under the terms of the Creative Commons Attribution License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.

                History
                : 05 May 2013
                : 03 April 2014
                Categories
                Research Articles

                Biostatistics
                multiple imputation,missing data,partially observed,longitudinal electronic health records

                Comments

                Comment on this article