256
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Methods and dimensions of electronic health record data quality assessment: enabling reuse for clinical research

      research-article

      Read this article at

      ScienceOpenPublisherPMC
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Objective

          To review the methods and dimensions of data quality assessment in the context of electronic health record (EHR) data reuse for research.

          Materials and methods

          A review of the clinical research literature discussing data quality assessment methodology for EHR data was performed. Using an iterative process, the aspects of data quality being measured were abstracted and categorized, as well as the methods of assessment used.

          Results

          Five dimensions of data quality were identified, which are completeness, correctness, concordance, plausibility, and currency, and seven broad categories of data quality assessment methods: comparison with gold standards, data element agreement, data source agreement, distribution comparison, validity checks, log review, and element presence.

          Discussion

          Examination of the methods by which clinical researchers have investigated the quality and suitability of EHR data for research shows that there are fundamental features of data quality, which may be difficult to measure, as well as proxy dimensions. Researchers interested in the reuse of EHR data for clinical research are recommended to consider the adoption of a consistent taxonomy of EHR data quality, to remain aware of the task-dependence of data quality, to integrate work on data quality assessment from other fields, and to adopt systematic, empirically driven, statistically based methods of data quality assessment.

          Conclusion

          There is currently little consistency or potential generalizability in the methods used to assess EHR data quality. If the reuse of EHR data for clinical research is to become accepted, researchers should adopt validated, systematic methods of EHR data quality assessment.

          Related collections

          Most cited references94

          • Record: found
          • Abstract: found
          • Article: not found

          Validation of information recorded on general practitioner based computerised data resource in the United Kingdom.

          To determine the extent of agreement between clinical information recorded on surgery computers of selected general practitioners and similar information in manual records of letters received from hospital consultants and kept in the general practitioners' files. Hospital consultants' letters in the manual records of selected general practitioners were photocopied and the consultants' clinical diagnoses were compared with diagnoses recorded on computer. General practices in the United Kingdom using computers provided by VAMP Health for recording clinical information. 2491 patients who received one of three non-steroidal anti-inflammatory drugs and who attended 58 practices whose computer recorded data were considered after a preliminary review to be of satisfactory quality. Among 1191 patients for whom consultants' letters were forwarded a clinical diagnosis reflecting the diagnosis noted on a consultant letter was present on the computer record for 1038 (87%). Clinical information available on the computer records of the general practitioners who participated in this study is satisfactory for many clinical studies.
            • Record: found
            • Abstract: found
            • Article: not found

            Review: electronic health records and the reliability and validity of quality measures: a review of the literature.

            Previous reviews of research on electronic health record (EHR) data quality have not focused on the needs of quality measurement. The authors reviewed empirical studies of EHR data quality, published from January 2004, with an emphasis on data attributes relevant to quality measurement. Many of the 35 studies reviewed examined multiple aspects of data quality. Sixty-six percent evaluated data accuracy, 57% data completeness, and 23% data comparability. The diversity in data element, study setting, population, health condition, and EHR system studied within this body of literature made drawing specific conclusions regarding EHR data quality challenging. Future research should focus on the quality of data from specific EHR components and important data attributes for quality measurement such as granularity, timeliness, and comparability. Finally, factors associated with poor or variability in data quality need to be better understood and effective interventions developed.
              • Record: found
              • Abstract: found
              • Article: not found

              Secondary Use of EHR: Data Quality Issues and Informatics Opportunities

              Given the large-scale deployment of Electronic Health Records (EHR), secondary use of EHR data will be increasingly needed in all kinds of health services or clinical research. This paper reports some data quality issues we encountered in a survival analysis of pancreatic cancer patients. Using the clinical data warehouse at Columbia University Medical Center in the City of New York, we mined EHR data elements collected between 1999 and 2009 for a cohort of pancreatic cancer patients. Of the 3068 patients who had ICD-9-CM diagnoses for pancreatic cancer, only 1589 had corresponding disease documentation in pathology reports. Incompleteness was the leading data quality issue; many study variables had missing values to various degrees. Inaccuracy and inconsistency were the next common problems. In this paper, we present the manifestations of these data quality issues and discuss some strategies for using emerging informatics technologies to solve these problems.

                Author and article information

                Journal
                J Am Med Inform Assoc
                J Am Med Inform Assoc
                jamia
                amiajnl
                Journal of the American Medical Informatics Association : JAMIA
                BMJ Group (BMA House, Tavistock Square, London, WC1H 9JR )
                1067-5027
                1527-974X
                Jan-Feb 2013
                1 January 2013
                : 20
                : 1
                : 144-151
                Affiliations
                Department of Biomedical Informatics, Columbia University, New York, New York, USA
                Author notes
                [Correspondence to ] Nicole Gray Weiskopf, Department of Biomedical Informatics, Columbia University, 622 W 168th Street, VC-5, New York, NY 10032, USA; nicole.weiskopf@ 123456dbmi.columbia.edu
                Article
                amiajnl-2011-000681
                10.1136/amiajnl-2011-000681
                3555312
                22733976
                964ee660-af2f-4713-872f-d5d28455a95a
                Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions

                This is an open-access article distributed under the terms of the Creative Commons Attribution Non-commercial License, which permits use, distribution, and reproduction in any medium, provided the original work is properly cited, the use is non commercial and is otherwise in compliance with the license. See: http://creativecommons.org/licenses/by-nc/3.0/ and http://creativecommons.org/licenses/by-nc/3.0/legalcode

                History
                : 3 November 2011
                : 3 May 2012
                Categories
                Focus on Data Sharing
                1506
                Custom metadata
                unlocked

                Bioinformatics & Computational biology
                clinical research,knowledge bases,knowledge representations,knowledge acquisition,electronic health records,clinical research informatics,secondary use,data quality,knowledge acquisition and knowledge management,methods for integration of information from disparate sources

                Comments

                Comment on this article

                Related Documents Log