11
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      rEHR: An R package for manipulating and analysing Electronic Health Record data

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Research with structured Electronic Health Records (EHRs) is expanding as data becomes more accessible; analytic methods advance; and the scientific validity of such studies is increasingly accepted. However, data science methodology to enable the rapid searching/extraction, cleaning and analysis of these large, often complex, datasets is less well developed. In addition, commonly used software is inadequate, resulting in bottlenecks in research workflows and in obstacles to increased transparency and reproducibility of the research. Preparing a research-ready dataset from EHRs is a complex and time consuming task requiring substantial data science skills, even for simple designs. In addition, certain aspects of the workflow are computationally intensive, for example extraction of longitudinal data and matching controls to a large cohort, which may take days or even weeks to run using standard software. The rEHR package simplifies and accelerates the process of extracting ready-for-analysis datasets from EHR databases. It has a simple import function to a database backend that greatly accelerates data access times. A set of generic query functions allow users to extract data efficiently without needing detailed knowledge of SQL queries. Longitudinal data extractions can also be made in a single command, making use of parallel processing. The package also contains functions for cutting data by time-varying covariates, matching controls to cases, unit conversion and construction of clinical code lists. There are also functions to synthesise dummy EHR. The package has been tested with one for the largest primary care EHRs, the Clinical Practice Research Datalink (CPRD), but allows for a common interface to other EHRs. This simplified and accelerated work flow for EHR data extraction results in simpler, cleaner scripts that are more easily debugged, shared and reproduced.

          Related collections

          Most cited references16

          • Record: found
          • Abstract: found
          • Article: not found

          Pay-for-performance programs in family practices in the United Kingdom.

          In 2004, after a series of national initiatives associated with marked improvements in the quality of care, the National Health Service of the United Kingdom introduced a pay-for-performance contract for family practitioners. This contract increases existing income according to performance with respect to 146 quality indicators covering clinical care for 10 chronic diseases, organization of care, and patient experience. We analyzed data extracted automatically from clinical computing systems for 8105 family practices in England in the first year of the pay-for-performance program (April 2004 through March 2005), data from the U.K. Census, and data on characteristics of individual family practices. We examined the proportion of patients deemed eligible for a clinical quality indicator for whom the indicator was met (reported achievement) and the proportion of the total number of patients with a medical condition for whom a quality indicator was met (population achievement), and we used multiple regression analysis to determine the extent to which practices achieved high scores by classifying patients as ineligible for quality indicators (exception reporting). The median reported achievement in the first year of the new contract was 83.4 percent (interquartile range, 78.2 to 87.0 percent). Sociodemographic characteristics of the patients (age and socioeconomic features) and practices (size of practice, number of patients per practitioner, age of practitioner, and whether the practitioner was medically educated in the United Kingdom) had moderate but significant effects on performance. Exception reporting by practices was not extensive (median rate, 6 percent), but it was the strongest predictor of achievement: a 1 percent increase in the rate of exception reporting was associated with a 0.31 percent increase in reported achievement. Exception reporting was high in a small number of practices: 1 percent of practices excluded more than 15 percent of patients. English family practices attained high levels of achievement in the first year of the new pay-for-performance contract. A small number of practices appear to have achieved high scores by excluding large numbers of patients by exception reporting. More research is needed to determine whether these practices are excluding patients for sound clinical reasons or in order to increase income. Copyright 2006 Massachusetts Medical Society.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: found
            Is Open Access

            ClinicalCodes: An Online Clinical Codes Repository to Improve the Validity and Reproducibility of Research Using Electronic Medical Records

            Lists of clinical codes are the foundation for research undertaken using electronic medical records (EMRs). If clinical code lists are not available, reviewers are unable to determine the validity of research, full study replication is impossible, researchers are unable to make effective comparisons between studies, and the construction of new code lists is subject to much duplication of effort. Despite this, the publication of clinical codes is rarely if ever a requirement for obtaining grants, validating protocols, or publishing research. In a representative sample of 450 EMR primary research articles indexed on PubMed, we found that only 19 (5.1%) were accompanied by a full set of published clinical codes and 32 (8.6%) stated that code lists were available on request. To help address these problems, we have built an online repository where researchers using EMRs can upload and download lists of clinical codes. The repository will enable clinical researchers to better validate EMR studies, build on previous code lists and compare disease definitions across studies. It will also assist health informaticians in replicating database studies, tracking changes in disease definitions or clinical coding practice through time and sharing clinical code information across platforms and data sources as research objects.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Observational data for comparative effectiveness research: an emulation of randomised trials of statins and primary prevention of coronary heart disease.

              This article reviews methods for comparative effectiveness research using observational data. The basic idea is using an observational study to emulate a hypothetical randomised trial by comparing initiators versus non-initiators of treatment. After adjustment for measured baseline confounders, one can then conduct the observational analogue of an intention-to-treat analysis. We also explain two approaches to conduct the analogues of per-protocol and as-treated analyses after further adjusting for measured time-varying confounding and selection bias using inverse-probability weighting. As an example, we implemented these methods to estimate the effect of statins for primary prevention of coronary heart disease (CHD) using data from electronic medical records in the UK. Despite strong confounding by indication, our approach detected a potential benefit of statin therapy. The analogue of the intention-to-treat hazard ratio (HR) of CHD was 0.89 (0.73, 1.09) for statin initiators versus non-initiators. The HR of CHD was 0.84 (0.54, 1.30) in the per-protocol analysis and 0.79 (0.41, 1.41) in the as-treated analysis for 2 years of use versus no use. In contrast, a conventional comparison of current users versus never users of statin therapy resulted in a HR of 1.31 (1.04, 1.66). We provide a flexible and annotated SAS program to implement the proposed analyses.
                Bookmark

                Author and article information

                Contributors
                Role: Editor
                Journal
                PLoS One
                PLoS ONE
                plos
                plosone
                PLoS ONE
                Public Library of Science (San Francisco, CA USA )
                1932-6203
                2017
                23 February 2017
                : 12
                : 2
                : e0171784
                Affiliations
                [1 ]NIHR School for Primary Care Research, University of Manchester, Manchester, United Kingdom
                [2 ]Centre for Biostatistics, Faculty of Biology, Medicine & Health, University of Manchester, Manchester, United Kingdom
                [3 ]Centre for Pharmacoepidemiology & Drug Safety, Faculty of Biology, Medicine & Health, University of Manchester, Manchester, United Kingdom
                [4 ]Informatics Research Centre, School of Computing Mathematics and Digital Technology, Manchester Metropolitan University, Manchester, United Kingdom
                [5 ]The Farr Institute for Health Informatics Research, Faculty of Biology, Medicine & Health, University of Manchester, Manchester, United Kingdom
                Indiana University, UNITED STATES
                Author notes

                Competing Interests: The authors have declared that no competing interests exist.

                • Conceptualization: DAS EK.

                • Funding acquisition: EK DR.

                • Methodology: DAS EK.

                • Software: DAS RP.

                • Supervision: EK.

                • Validation: RP IO.

                • Visualization: DAS EK.

                • Writing – original draft: DAS.

                • Writing – review & editing: RP IO DR EK.

                [¤]

                Current address: Vaughan House, Portsmouth Street, M13 9GB, Manchester, United Kingdom

                Author information
                http://orcid.org/0000-0001-6450-5815
                Article
                PONE-D-16-40998
                10.1371/journal.pone.0171784
                5323003
                28231289
                bcb1de2a-ca89-4315-9bae-6963187c14d7
                © 2017 Springate et al

                This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

                History
                : 14 October 2016
                : 25 January 2017
                Page count
                Figures: 0, Tables: 3, Pages: 25
                Funding
                Funded by: NIHR School for Primary Care Research
                Award ID: 211
                Award Recipient :
                Funded by: funder-id http://dx.doi.org/10.13039/501100000265, Medical Research Council;
                Award ID: MR/K006665/1
                Award Recipient :
                This study was funded by the National Institute for Health Research (NIHR) School for Primary Care Research (SPCR), under the title ‘An analytical framework for increasing the efficiency and validity of research using primary care databases’ (Project no. 211). This paper presents independent research funded by the National Institute for Health Research (NIHR). The views expressed are those of the authors and not necessarily those of the NHS, the National Institute for Health Research or the Department of Health. In addition, MRC Health eResearch Centre Grant MR/K006665/1 supported the time and facilities of one investigator (EK).
                Categories
                Research Article
                Computer and Information Sciences
                Information Technology
                Databases
                Research and Analysis Methods
                Database and Informatics Methods
                Database Searching
                Medicine and Health Sciences
                Endocrinology
                Endocrine Disorders
                Diabetes Mellitus
                Medicine and Health Sciences
                Metabolic Disorders
                Diabetes Mellitus
                Medicine and Health Sciences
                Health Care
                Primary Care
                Research and Analysis Methods
                Database and Informatics Methods
                Medicine and Health Sciences
                Pulmonology
                Asthma
                Computer and Information Sciences
                Information Technology
                Data Processing
                Medicine and Health Sciences
                Pharmacology
                Drug Screening
                Custom metadata
                Data are available from the Comprehensive R Archive Network (CRAN) ( https://cran.r-project.org/web/packages/rEHR/index.html) and via Github ( https://github.com/rOpenHealth/rEHR).

                Uncategorized
                Uncategorized

                Comments

                Comment on this article