0
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      A downsampling strategy to assess the predictive value of radiomic features

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Many studies are devoted to the design of radiomic models for a prediction task. When no effective model is found, it is often difficult to know whether the radiomic features do not include information relevant to the task or because of insufficient data. We propose a downsampling method to answer that question when considering a classification task into two groups. Using two large patient cohorts, several experimental configurations involving different numbers of patients were created. Univariate or multivariate radiomic models were designed from each configuration. Their performance as reflected by the Youden index (YI) and Area Under the receiver operating characteristic Curve (AUC) was compared to the stable performance obtained with the highest number of patients. A downsampling method is described to predict the YI and AUC achievable with a large number of patients. Using the multivariate models involving machine learning, YI and AUC increased with the number of patients while they decreased for univariate models. The downsampling method better estimated YI and AUC obtained with the largest number of patients than the YI and AUC obtained using the number of available patients and identifies the lack of information relevant to the classification task when no such information exists.

          Related collections

          Most cited references12

          • Record: found
          • Abstract: found
          • Article: not found

          LIFEx: A Freeware for Radiomic Feature Calculation in Multimodality Imaging to Accelerate Advances in the Characterization of Tumor Heterogeneity

          Textural and shape analysis is gaining considerable interest in medical imaging, particularly to identify parameters characterizing tumor heterogeneity and to feed radiomic models. Here, we present a free, multiplatform, and easy-to-use freeware called LIFEx, which enables the calculation of conventional, histogram-based, textural, and shape features from PET, SPECT, MR, CT, and US images, or from any combination of imaging modalities. The application does not require any programming skills and was developed for medical imaging professionals. The goal is that independent and multicenter evidence of the usefulness and limitations of radiomic features for characterization of tumor heterogeneity and subsequent patient management can be gathered. Many options are offered for interactive textural index calculation and for increasing the reproducibility among centers. The software already benefits from a large user community (more than 800 registered users), and interactions within that community are part of the development strategy.Significance: This study presents a user-friendly, multi-platform freeware to extract radiomic features from PET, SPECT, MR, CT, and US images, or any combination of imaging modalities. Cancer Res; 78(16); 4786-9. ©2018 AACR.
            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            The effects of species’ range sizes on the accuracy of distribution models: ecological phenomenon or statistical artefact?

              Bookmark
              • Record: found
              • Abstract: found
              • Article: found
              Is Open Access

              Predicting sample size required for classification performance

              Background Supervised learning methods need annotated data in order to generate efficient models. Annotated data, however, is a relatively scarce resource and can be expensive to obtain. For both passive and active learning methods, there is a need to estimate the size of the annotated sample required to reach a performance target. Methods We designed and implemented a method that fits an inverse power law model to points of a given learning curve created using a small annotated training set. Fitting is carried out using nonlinear weighted least squares optimization. The fitted model is then used to predict the classifier's performance and confidence interval for larger sample sizes. For evaluation, the nonlinear weighted curve fitting method was applied to a set of learning curves generated using clinical text and waveform classification tasks with active and passive sampling methods, and predictions were validated using standard goodness of fit measures. As control we used an un-weighted fitting method. Results A total of 568 models were fitted and the model predictions were compared with the observed performances. Depending on the data set and sampling method, it took between 80 to 560 annotated samples to achieve mean average and root mean squared error below 0.01. Results also show that our weighted fitting method outperformed the baseline un-weighted method (p < 0.05). Conclusions This paper describes a simple and effective sample size prediction algorithm that conducts weighted fitting of learning curves. The algorithm outperformed an un-weighted algorithm described in previous literature. It can help researchers determine annotation sample size for supervised machine learning.
                Bookmark

                Author and article information

                Contributors
                dirandannesophie@gmail.com
                Journal
                Sci Rep
                Sci Rep
                Scientific Reports
                Nature Publishing Group UK (London )
                2045-2322
                28 November 2019
                28 November 2019
                2019
                : 9
                : 17869
                Affiliations
                ISNI 0000 0001 2171 2558, GRID grid.5842.b, Imagerie Moléculaire In Vivo, CEA-SHJF, Inserm, CNRS, , Université Paris-Sud, Université Paris-Saclay, ; Orsay, France
                Author information
                http://orcid.org/0000-0002-0348-1933
                http://orcid.org/0000-0002-7288-1543
                Article
                54190
                10.1038/s41598-019-54190-2
                6883070
                31780708
                91f233b6-54dc-48d9-a82c-9377b6130f77
                © The Author(s) 2019

                Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

                History
                : 7 June 2019
                : 23 October 2019
                Categories
                Article
                Custom metadata
                © The Author(s) 2019

                Uncategorized
                predictive markers,mathematics and computing
                Uncategorized
                predictive markers, mathematics and computing

                Comments

                Comment on this article