25
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: not found

      Open access image repositories: high-quality data to enable machine learning research

      research-article

      Read this article at

      ScienceOpenPublisherPMC
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Originally motivated by the need for research reproducibility and data reuse, large-scale, open access information repositories have become key resources for training and testing of advanced machine learning applications in biomedical and clinical research. To be of value, such repositories must provide large, high-quality data sets, where quality is defined as minimising variance due to data collection protocols and data misrepresentations. Curation is the key to quality. We have constructed a large public access image repository, The Cancer Imaging Archive, dedicated to the promotion of open science to advance the global effort to diagnose and treat cancer. Drawing on this experience and our experience in applying machine learning techniques to the analysis of radiology and pathology image data, we will review the requirements placed on such information repositories by state-of-the-art machine learning applications and how these requirements can be met.

          Related collections

          Author and article information

          Journal
          1306016
          3069
          Clin Radiol
          Clin Radiol
          Clinical radiology
          0009-9260
          1365-229X
          28 April 2019
          28 April 2019
          January 2020
          01 January 2021
          : 75
          : 1
          : 7-12
          Affiliations
          [a ]Department of Biomedical Informatics, University of Arkansas for Medical Sciences, 4301 W. Markham St, Little Rock, AR 72205, USA
          [b ]National Institutes of Health, National Cancer Institute, 9609 Medical Center Drive, Bethesda, MD 20892, USA
          [c ]Department of Biomedical Informatics, Emory University, 101 Woodruff Circle, #4104, Atlanta, GA 30322, USA
          [d ]Department of Biomedical Informatics, Stoney Brook University, Health Science Center Level 3, Room 043, Stony Brook, NY 11794, USA
          [e ]Department of Radiation Oncology, University of Massachusetts Medical School, Worcester, MA 01655, USA
          Author notes

          Author Contributions

          1. guarantor of integrity of the entire study - Fred Prior

          2. study concepts and design – Fred Prior, Kirk Smith

          3. literature research - All Authors

          4. clinical studies - N/A

          5. experimental studies / data analysis - N/A

          6. statistical analysis - N/A

          7. manuscript preparation - All Authors

          8. manuscript editing - Fred Prior

          [* ]Guarantor and correspondent: Fred Prior, Department of Biomedical Informatics, University of Arkansas for Medical Sciences, 4301 W. Markham St, Little Rock, Arkansas, 72205, USA. Tel.: +1 501 686 5966, fwprior@ 123456uams.edu
          Article
          PMC6815686 PMC6815686 6815686 nihpa1528096
          10.1016/j.crad.2019.04.002
          6815686
          31040006
          58a17d23-340c-436f-a02a-14bd6701a6ae
          History
          Categories
          Article

          Comments

          Comment on this article