168
views
0
recommends
+1 Recommend
1 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found

      Evaluation of prostate segmentation algorithms for MRI: The PROMISE12 challenge

      Read this article at

      ScienceOpenPublisherPMC
      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Prostate MRI image segmentation has been an area of intense research due to the increased use of MRI as a modality for the clinical workup of prostate cancer. Segmentation is useful for various tasks, e.g. to accurately localize prostate boundaries for radiotherapy or to initialize multi-modal registration algorithms. In the past, it has been difficult for research groups to evaluate prostate segmentation algorithms on multi-center, multi-vendor and multi-protocol data. Especially because we are dealing with MR images, image appearance, resolution and the presence of artifacts are affected by differences in scanners and/or protocols, which in turn can have a large influence on algorithm accuracy. The Prostate MR Image Segmentation (PROMISE12) challenge was setup to allow a fair and meaningful comparison of segmentation methods on the basis of performance and robustness. In this work we will discuss the initial results of the online PROMISE12 challenge, and the results obtained in the live challenge workshop hosted by the MICCAI2012 conference. In the challenge, 100 prostate MR cases from 4 different centers were included, with differences in scanner manufacturer, field strength and protocol. A total of 11 teams from academic research groups and industry participated. Algorithms showed a wide variety in methods and implementation, including active appearance models, atlas registration and level sets. Evaluation was performed using boundary and volume based metrics which were combined into a single score relating the metrics to human expert performance. The winners of the challenge where the algorithms by teams Imorphics and ScrAutoProstate, with scores of 85.72 and 84.29 overall. Both algorithms where significantly better than all other algorithms in the challenge (p<0.05) and had an efficient implementation with a run time of 8min and 3s per case respectively. Overall, active appearance model based approaches seemed to outperform other approaches like multi-atlas registration, both on accuracy and computation time. Although average algorithm performance was good to excellent and the Imorphics algorithm outperformed the second observer on average, we showed that algorithm combination might lead to further improvement, indicating that optimal performance for prostate segmentation is not yet obtained. All results are available online at http://promise12.grand-challenge.org/. Copyright © 2013 Elsevier B.V. All rights reserved.

          Related collections

          Most cited references46

          • Record: found
          • Abstract: found
          • Article: not found

          ESUR prostate MR guidelines 2012

          The aim was to develop clinical guidelines for multi-parametric MRI of the prostate by a group of prostate MRI experts from the European Society of Urogenital Radiology (ESUR), based on literature evidence and consensus expert opinion. True evidence-based guidelines could not be formulated, but a compromise, reflected by “minimal” and “optimal” requirements has been made. The scope of these ESUR guidelines is to promulgate high quality MRI in acquisition and evaluation with the correct indications for prostate cancer across the whole of Europe and eventually outside Europe. The guidelines for the optimal technique and three protocols for “detection”, “staging” and “node and bone” are presented. The use of endorectal coil vs. pelvic phased array coil and 1.5 vs. 3 T is discussed. Clinical indications and a PI-RADS classification for structured reporting are presented. Key Points • This report provides guidelines for magnetic resonance imaging (MRI) in prostate cancer. • Clinical indications, and minimal and optimal imaging acquisition protocols are provided. • A structured reporting system (PI-RADS) is described.
            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            Marching cubes: A high resolution 3D surface construction algorithm

              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Simultaneous truth and performance level estimation (STAPLE): an algorithm for the validation of image segmentation.

              Characterizing the performance of image segmentation approaches has been a persistent challenge. Performance analysis is important since segmentation algorithms often have limited accuracy and precision. Interactive drawing of the desired segmentation by human raters has often been the only acceptable approach, and yet suffers from intra-rater and inter-rater variability. Automated algorithms have been sought in order to remove the variability introduced by raters, but such algorithms must be assessed to ensure they are suitable for the task. The performance of raters (human or algorithmic) generating segmentations of medical images has been difficult to quantify because of the difficulty of obtaining or estimating a known true segmentation for clinical data. Although physical and digital phantoms can be constructed for which ground truth is known or readily estimated, such phantoms do not fully reflect clinical images due to the difficulty of constructing phantoms which reproduce the full range of imaging characteristics and normal and pathological anatomical variability observed in clinical data. Comparison to a collection of segmentations by raters is an attractive alternative since it can be carried out directly on the relevant clinical imaging data. However, the most appropriate measure or set of measures with which to compare such segmentations has not been clarified and several measures are used in practice. We present here an expectation-maximization algorithm for simultaneous truth and performance level estimation (STAPLE). The algorithm considers a collection of segmentations and computes a probabilistic estimate of the true segmentation and a measure of the performance level represented by each segmentation. The source of each segmentation in the collection may be an appropriately trained human rater or raters, or may be an automated segmentation algorithm. The probabilistic estimate of the true segmentation is formed by estimating an optimal combination of the segmentations, weighting each segmentation depending upon the estimated performance level, and incorporating a prior model for the spatial distribution of structures being segmented as well as spatial homogeneity constraints. STAPLE is straightforward to apply to clinical imaging data, it readily enables assessment of the performance of an automated image segmentation algorithm, and enables direct comparison of human rater and algorithm performance.
                Bookmark

                Author and article information

                Journal
                Medical Image Analysis
                Medical Image Analysis
                Elsevier BV
                13618415
                February 2014
                February 2014
                : 18
                : 2
                : 359-373
                Article
                10.1016/j.media.2013.12.002
                4137968
                24418598
                8889a2f7-62f0-4f42-bc52-7f233093c36f
                © 2014

                https://www.elsevier.com/tdm/userlicense/1.0/

                History

                Comments

                Comment on this article