6
views
0
recommends
+1 Recommend
3 collections
    0
    shares

      Submit your digital health research with an established publisher
      - celebrating 25 years of open access

      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Automated Travel History Extraction From Clinical Notes for Informing the Detection of Emergent Infectious Disease Events: Algorithm Development and Validation

      research-article
      , MS 1 , 2 , , , DO 1 , 2 , , PhD 1 , 2 , , MS 1 , 2 , , MSPH 1 , 3 , , MT(ASCP) 4 , , PhD 1 , 2 , , MPH, PhD 4 , 5 , , MD 4 , 5 , 6 , , PharmD 7 , 8 , , MSCI, MD 1 , 2
      (Reviewer), (Reviewer), (Reviewer), (Reviewer)
      JMIR Public Health and Surveillance
      JMIR Publications
      natural language processing, machine learning, travel history, COVID-19, Zika, infectious disease surveillance, surveillance applications, biosurveillance, electronic health record

      Read this article at

      ScienceOpenPublisherPMC
      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Background

          Patient travel history can be crucial in evaluating evolving infectious disease events. Such information can be challenging to acquire in electronic health records, as it is often available only in unstructured text.

          Objective

          This study aims to assess the feasibility of annotating and automatically extracting travel history mentions from unstructured clinical documents in the Department of Veterans Affairs across disparate health care facilities and among millions of patients. Information about travel exposure augments existing surveillance applications for increased preparedness in responding quickly to public health threats.

          Methods

          Clinical documents related to arboviral disease were annotated following selection using a semiautomated bootstrapping process. Using annotated instances as training data, models were developed to extract from unstructured clinical text any mention of affirmed travel locations outside of the continental United States. Automated text processing models were evaluated, involving machine learning and neural language models for extraction accuracy.

          Results

          Among 4584 annotated instances, 2659 (58%) contained an affirmed mention of travel history, while 347 (7.6%) were negated. Interannotator agreement resulted in a document-level Cohen kappa of 0.776. Automated text processing accuracy (F1 85.6, 95% CI 82.5-87.9) and computational burden were acceptable such that the system can provide a rapid screen for public health events.

          Conclusions

          Automated extraction of patient travel history from clinical documents is feasible for enhanced passive surveillance public health systems. Without such a system, it would usually be necessary to manually review charts to identify recent travel or lack of travel, use an electronic health record that enforces travel history documentation, or ignore this potential source of information altogether. The development of this tool was initially motivated by emergent arboviral diseases. More recently, this system was used in the early phases of response to COVID-19 in the United States, although its utility was limited to a relatively brief window due to the rapid domestic spread of the virus. Such systems may aid future efforts to prevent and contain the spread of infectious diseases.

          Related collections

          Most cited references43

          • Record: found
          • Abstract: not found
          • Article: not found

          A Coefficient of Agreement for Nominal Scales

            Bookmark
            • Record: found
            • Abstract: not found
            • Conference Proceedings: not found

            Language models are unsupervised multitask learners

              Bookmark
              • Record: found
              • Abstract: not found
              • Conference Proceedings: not found

              Incorporating non-local information into information extraction systems by Gibbs sampling

                Bookmark

                Author and article information

                Contributors
                Journal
                JMIR Public Health Surveill
                JMIR Public Health Surveill
                JPH
                JMIR Public Health and Surveillance
                JMIR Publications (Toronto, Canada )
                2369-2960
                March 2021
                24 March 2021
                24 March 2021
                : 7
                : 3
                : e26719
                Affiliations
                [1 ] VA Salt Lake City Health Care System US Department of Veterans Affairs Salt Lake City, UT United States
                [2 ] Division of Epidemiology Department of Internal Medicine University of Utah Salt Lake City, UT United States
                [3 ] Department of Rocky Mountain Cancer Data Systems University of Utah Salt Lake City, UT United States
                [4 ] National Infectious Diseases Service Specialty Care Services US Department of Veterans Affairs Cincinnati, OH United States
                [5 ] Division of Infectious Diseases Department of Internal Medicine University of Cincinnati College of Medicine Cincinnati, OH United States
                [6 ] Cincinnati VA Medical Center US Department of Veterans Affairs Cincinnati, OH United States
                [7 ] Office of Biosurveillance Veterans Affairs Central Office US Department of Veterans Affairs Washington, DC United States
                [8 ] National Biosurveillance Integration Center Countering Weapons of Mass Destruction Department of Homeland Security Washington, DC United States
                Author notes
                Corresponding Author: Kelly S Peterson kelly.peterson2@ 123456va.gov
                Author information
                https://orcid.org/0000-0001-7803-6984
                https://orcid.org/0000-0003-4771-5452
                https://orcid.org/0000-0002-8717-5975
                https://orcid.org/0000-0003-3711-2682
                https://orcid.org/0000-0002-0883-6756
                https://orcid.org/0000-0001-8558-6431
                https://orcid.org/0000-0001-8933-5453
                https://orcid.org/0000-0001-7541-7976
                https://orcid.org/0000-0003-0608-7672
                https://orcid.org/0000-0001-5283-0987
                https://orcid.org/0000-0001-5580-6739
                Article
                v7i3e26719
                10.2196/26719
                7993087
                33759790
                2b8476a5-1f87-4381-9eff-1ab5f3d75f9e
                ©Kelly S Peterson, Julia Lewis, Olga V Patterson, Alec B Chapman, Daniel W Denhalter, Patricia A Lye, Vanessa W Stevens, Shantini D Gamage, Gary A Roselle, Katherine S Wallace, Makoto Jones. Originally published in JMIR Public Health and Surveillance (http://publichealth.jmir.org), 24.03.2021.

                This is an open-access article distributed under the terms of the Creative Commons Attribution License ( https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Public Health and Surveillance, is properly cited. The complete bibliographic information, a link to the original publication on http://publichealth.jmir.org, as well as this copyright and license information must be included.

                History
                : 23 December 2020
                : 18 January 2021
                : 5 February 2021
                : 12 February 2021
                Categories
                Original Paper
                Original Paper

                natural language processing,machine learning,travel history,covid-19,zika,infectious disease surveillance,surveillance applications,biosurveillance,electronic health record

                Comments

                Comment on this article