13
views
0
recommends
+1 Recommend
1 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Health indicator recording in UK primary care electronic health records: key implications for handling missing data

      research-article

      Read this article at

      ScienceOpenPublisherPMC
      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Background

          Clinical databases are increasingly used for health research; many of them capture information on common health indicators including height, weight, blood pressure, cholesterol level, smoking status, and alcohol consumption. However, these are often not recorded on a regular basis; missing data are ubiquitous. We described the recording of health indicators in UK primary care and evaluated key implications for handling missing data.

          Methods

          We examined the recording of health indicators in The Health Improvement Network (THIN) UK primary care database over time, by demographic variables (age and sex) and chronic diseases (diabetes, myocardial infarction, and stroke). Using weight as an example, we fitted linear and logistic regression models to examine the associations of weight measurements and the probability of having weight recorded with individuals’ demographic characteristics and chronic diseases.

          Results

          In total, 6,345,851 individuals aged 18–99 years contributed data to THIN between 2000 and 2015. Women aged 18–65 years were more likely than men of the same age to have health indicators recorded; this gap narrowed after age 65. About 60–80% of individuals had their height, weight, blood pressure, smoking status, and alcohol consumption recorded during the first year of registration. In the years following registration, these proportions fell to 10%–40%. Individuals with chronic diseases were more likely to have health indicators recorded, particularly after the introduction of a General Practitioner incentive scheme. Individuals’ demographic characteristics and chronic diseases were associated with both observed weight measurements and missingness in weight.

          Conclusion

          Missing data in common health indicators will affect statistical analysis in health research studies. A single analysis of primary care data using the available information alone may be misleading. Multiple imputation of missing values accounting for demographic characteristics and disease status is recommended but should be considered and implemented carefully. Sensitivity analysis exploring alternative assumptions for missing data should also be evaluated.

          Related collections

          Most cited references16

          • Record: found
          • Abstract: found
          • Article: not found

          Generalisability of The Health Improvement Network (THIN) database: demographics, chronic disease prevalence and mortality rates.

          The degree of generalisability of patient databases to the general population is important for interpreting database research. This report describes the representativeness of The Health Improvement Network (THIN), a UK primary care database, of the UK population. Demographics, deprivation (Townsend), Quality and Outcomes Framework (QOF) condition prevalence and deaths from THIN were compared with national statistical and QOF 2006/2007 data. Demographics were similar although THIN contained fewer people aged under 25 years. Condition prevalence was comparable, e.g. 3.5% diabetes prevalence in THIN, 3.7% nationally. More THIN patients lived in the most affluent areas (23.5% in THIN, 20% nationally). Between 1990 and 2009, standardised mortality ratio ranged from 0.81 (95% CI: 0.39-1.49; 1990) to 0.93 (95% CI: 0.48-1.64; 1995). Adjusting for demographics/deprivation, the 2006 THIN death rate was 9.08/1000 population close to the national death rate of 9.4/1000 population. THIN is generalisable to the UK for demographics, major condition prevalence and death rates adjusted for demographics and deprivation.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Multiple imputation: current perspectives.

            This paper provides an overview of multiple imputation and current perspectives on its use in medical research. We begin with a brief review of the problem of handling missing data in general and place multiple imputation in this context, emphasizing its relevance for longitudinal clinical trials and observational studies with missing covariates. We outline how multiple imputation proceeds in practice and then sketch its rationale. We explore the problem of obtaining proper imputations in some detail and distinguish two main classes of approach, methods based on fully multivariate models, and those that iterate conditional univariate models. We show how the use of so-called uncongenial imputation models are particularly valuable for sensitivity analyses and also for certain analyses in clinical trial settings. We also touch upon other forms of sensitivity analysis that use multiple imputation. Finally, we give some open questions that the increasing use of multiple imputation has thrown up, which we believe are useful directions for future research.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Use of multiple imputation in the epidemiologic literature.

              The authors attempted to catalog the use of procedures to impute missing data in the epidemiologic literature and to determine the degree to which imputed results differed in practice from unimputed results. The full text of articles published in 2005 and 2006 in four leading epidemiologic journals was searched for the text imput. Sixteen articles utilizing multiple imputation, inverse probability weighting, or the expectation-maximization algorithm to impute missing data were found. The small number of relevant manuscripts and diversity of detail provided precluded systematic analysis of the use of imputation procedures. To form a bridge between current and future practice, the authors suggest details that should be included in articles that utilize these procedures.
                Bookmark

                Author and article information

                Journal
                Clin Epidemiol
                Clin Epidemiol
                Clinical Epidemiology
                Clinical Epidemiology
                Dove Medical Press
                1179-1349
                2019
                11 February 2019
                : 11
                : 157-167
                Affiliations
                [1 ]Department of Primary Care and Population Health, University College London, London NW3 2PF, UK, i.petersen@ 123456ucl.ac.uk
                [2 ]Department of Clinical Epidemiology, Aarhus University, 8200 Aarhus N, Denmark, i.petersen@ 123456ucl.ac.uk
                [3 ]Department of Health Sciences, University of Leicester, Leicester LE1 7RH, UK
                [4 ]Department of Population Health Sciences, Bristol Medical School, University of Bristol, Bristol BS8 2PS, UK
                [5 ]MRC Clinical Trials Unit at UCL, London WC1V 6LJ, UK
                [6 ]Department of Medical Statistics, London School of Hygiene and Tropical Medicine, London WC1E 7HT, UK
                Author notes
                Correspondence: Irene Petersen, Department of Primary Care and Population Health, Upper Third Floor, UCL Medical School (Royal Free Campus), Rowland Hill Street, London NW3 2PF, UK, Tel +44 207 794 0500; ext 34395, Email i.petersen@ 123456ucl.ac.uk
                Article
                clep-11-157
                10.2147/CLEP.S191437
                6377050
                30809103
                822ed8bd-ee57-47d9-a305-2dadfb87c92d
                © 2019 Petersen et al. This work is published by Dove Medical Press Limited, and licensed under a Creative Commons Attribution License

                The full terms of the License are available at http://creativecommons.org/licenses/by/4.0/. The license permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

                History
                Categories
                Original Research

                Public health
                primary care,ehrs,recording,qof,multiple imputation,statistics,epidemiology,research methods,data analysis

                Comments

                Comment on this article