2
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Merging Data Diversity of Clinical Medical Records to Improve Effectiveness

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Medicine is a knowledge area continuously experiencing changes. Every day, discoveries and procedures are tested with the goal of providing improved service and quality of life to patients. With the evolution of computer science, multiple areas experienced an increase in productivity with the implementation of new technical solutions. Medicine is no exception. Providing healthcare services in the future will involve the storage and manipulation of large volumes of data (big data) from medical records, requiring the integration of different data sources, for a multitude of purposes, such as prediction, prevention, personalization, participation, and becoming digital. Data integration and data sharing will be essential to achieve these goals. Our work focuses on the development of a framework process for the integration of data from different sources to increase its usability potential. We integrated data from an internal hospital database, external data, and also structured data resulting from natural language processing (NPL) applied to electronic medical records. An extract-transform and load (ETL) process was used to merge different data sources into a single one, allowing more effective use of these data and, eventually, contributing to more efficient use of the available resources.

          Related collections

          Most cited references42

          • Record: found
          • Abstract: found
          • Article: found
          Is Open Access

          Methods and dimensions of electronic health record data quality assessment: enabling reuse for clinical research

          Objective To review the methods and dimensions of data quality assessment in the context of electronic health record (EHR) data reuse for research. Materials and methods A review of the clinical research literature discussing data quality assessment methodology for EHR data was performed. Using an iterative process, the aspects of data quality being measured were abstracted and categorized, as well as the methods of assessment used. Results Five dimensions of data quality were identified, which are completeness, correctness, concordance, plausibility, and currency, and seven broad categories of data quality assessment methods: comparison with gold standards, data element agreement, data source agreement, distribution comparison, validity checks, log review, and element presence. Discussion Examination of the methods by which clinical researchers have investigated the quality and suitability of EHR data for research shows that there are fundamental features of data quality, which may be difficult to measure, as well as proxy dimensions. Researchers interested in the reuse of EHR data for clinical research are recommended to consider the adoption of a consistent taxonomy of EHR data quality, to remain aware of the task-dependence of data quality, to integrate work on data quality assessment from other fields, and to adopt systematic, empirically driven, statistically based methods of data quality assessment. Conclusion There is currently little consistency or potential generalizability in the methods used to assess EHR data quality. If the reuse of EHR data for clinical research is to become accepted, researchers should adopt validated, systematic methods of EHR data quality assessment.
            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            The Value of Unstructured Electronic Health Record Data in Geriatric Syndrome Case Identification

              Bookmark
              • Record: found
              • Abstract: found
              • Article: found
              Is Open Access

              Data Processing and Text Mining Technologies on Electronic Medical Records: A Review

              Currently, medical institutes generally use EMR to record patient's condition, including diagnostic information, procedures performed, and treatment results. EMR has been recognized as a valuable resource for large-scale analysis. However, EMR has the characteristics of diversity, incompleteness, redundancy, and privacy, which make it difficult to carry out data mining and analysis directly. Therefore, it is necessary to preprocess the source data in order to improve data quality and improve the data mining results. Different types of data require different processing technologies. Most structured data commonly needs classic preprocessing technologies, including data cleansing, data integration, data transformation, and data reduction. For semistructured or unstructured data, such as medical text, containing more health information, it requires more complex and challenging processing methods. The task of information extraction for medical texts mainly includes NER (named-entity recognition) and RE (relation extraction). This paper focuses on the process of EMR processing and emphatically analyzes the key techniques. In addition, we make an in-depth study on the applications developed based on text mining together with the open challenges and research issues for future work.
                Bookmark

                Author and article information

                Journal
                Int J Environ Res Public Health
                Int J Environ Res Public Health
                ijerph
                International Journal of Environmental Research and Public Health
                MDPI
                1661-7827
                1660-4601
                03 March 2019
                March 2019
                : 16
                : 5
                : 769
                Affiliations
                [1 ]Logistics, Molde University College, Molde, NO-6410 Molde, Norway; Berit.I.Helgheim@ 123456hiMolde.no
                [2 ]DEI, Instituto Superior Técnico, 1049-001 Lisboa, Portugal; rui.maia@ 123456tecnico.ulisboa.pt
                [3 ]Instituto Universitário de Lisboa (ISCTE-IUL), ISTAR-IUL, 1649-026 Lisbon, Portugal
                [4 ]Instituto Universitário de Lisboa (ISCTE-IUL), BRU-IUL, 1649-026 Lisbon, Portugal; almartins@ 123456iscte-iul.pt
                Author notes
                [* ]Correspondence: jcafa@ 123456iscte-iul.pt ; Tel.: +35-1910969885
                Author information
                https://orcid.org/0000-0003-1600-7867
                https://orcid.org/0000-0002-6662-0806
                Article
                ijerph-16-00769
                10.3390/ijerph16050769
                6427263
                30832447
                f0632bc6-8104-40fe-a8d5-280328d404b8
                © 2019 by the authors.

                Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license ( http://creativecommons.org/licenses/by/4.0/).

                History
                : 30 December 2018
                : 24 February 2019
                Categories
                Article

                Public health
                big data,data,etl,framework,integration,knowledge,medical records,extract-transform and load
                Public health
                big data, data, etl, framework, integration, knowledge, medical records, extract-transform and load

                Comments

                Comment on this article