Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

Remarkable progress has been made in image recognition, primarily due to the availability of large-scale annotated datasets and deep convolutional neural networks (CNNs). CNNs enable learning data-driven, highly representative, hierarchical image features from sufficient training data. However, obtaining datasets as comprehensively annotated as ImageNet in the medical imaging domain remains a challenge. There are currently three major techniques that successfully employ CNNs to medical image classification: training the CNN from scratch, using off-the-shelf pre-trained CNN features, and conducting unsupervised CNN pre-training with supervised fine-tuning. Another effective method is transfer learning, i.e., fine-tuning CNN models pre-trained from natural image dataset to medical image tasks. In this paper, we exploit three important, but previously understudied factors of employing deep convolutional neural networks to computer-aided detection problems. We first explore and evaluate different CNN architectures. The studied models contain 5 thousand to 160 million parameters, and vary in numbers of layers. We then evaluate the influence of dataset scale and spatial image context on performance. Finally, we examine when and why transfer learning from pre-trained ImageNet (via fine-tuning) can be useful. We study two specific computer-aided detection (CADe) problems, namely thoraco-abdominal lymph node (LN) detection and interstitial lung disease (ILD) classification. We achieve the state-of-the-art performance on the mediastinal LN detection, and report the first five-fold cross-validation classification results on predicting axial CT slices with ILD categories. Our extensive empirical evaluation, CNN model analysis and valuable insights can be extended to the design of high performance CAD systems for other medical imaging tasks.

Related collections

Most cited references 61

Record: found
Abstract: found
Article: not found

The Multimodal Brain Tumor Image Segmentation Benchmark (BRATS).

Bjoern H. Menze, Andras Jakab, Stefan Bauer … (2016)

In this paper we report the set-up and results of the Multimodal Brain Tumor Image Segmentation Benchmark (BRATS) organized in conjunction with the MICCAI 2012 and 2013 conferences. Twenty state-of-the-art tumor segmentation algorithms were applied to a set of 65 multi-contrast MR scans of low- and high-grade glioma patients-manually annotated by up to four raters-and to 65 comparable scans generated using tumor image simulation software. Quantitative evaluations revealed considerable disagreement between the human raters in segmenting various tumor sub-regions (Dice scores in the range 74%-85%), illustrating the difficulty of this task. We found that different algorithms worked best for different sub-regions (reaching performance comparable to human inter-rater variability), but that no single algorithm ranked in the top for all sub-regions simultaneously. Fusing several good algorithms using a hierarchical majority vote yielded segmentations that consistently ranked above all individual algorithms, indicating remaining opportunities for further methodological improvements. The BRATS image data and manual annotations continue to be publicly available through an online evaluation system as an ongoing benchmarking resource.

0 comments Cited 974 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: not found
Article: not found

The Pascal Visual Object Classes Challenge: A Retrospective

Mark Everingham, S. M. Ali Eslami, Luc Van Gool … (2015)

0 comments Cited 579 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

Learning hierarchical features for scene labeling.

Clément Farabet, Camille Couprie, Laurent Najman … (2013)

Scene labeling consists of labeling each pixel in an image with the category of the object it belongs to. We propose a method that uses a multiscale convolutional network trained from raw pixels to extract dense feature vectors that encode regions of multiple sizes centered on each pixel. The method alleviates the need for engineered features, and produces a powerful representation that captures texture, shape, and contextual information. We report results using multiple postprocessing methods to produce the final labeling. Among those, we propose a technique to automatically retrieve, from a pool of segmentation components, an optimal set of components that best explain the scene; these components are arbitrary, for example, they can be taken from a segmentation tree or from any family of oversegmentations. The system yields record accuracies on the SIFT Flow dataset (33 classes) and the Barcelona dataset (170 classes) and near-record accuracy on Stanford background dataset (eight classes), while being an order of magnitude faster than competing approaches, producing a $(320\times 240)$ image labeling in less than a second, including feature extraction.

0 comments Cited 499 times – based on 0 reviews      Review now

Bookmark

All references

Author and article information

Contributors

Shin Hoo-Chang Shin Hoo-Chang

Roth Holger R. Roth Holger R.

Gao Mingchen Gao Mingchen

Lu Le Lu Le

Xu Ziyue Xu Ziyue

Nogues Isabella Nogues Isabella

Yao Jianhua Yao Jianhua

Mollura Daniel Mollura Daniel

Summers Ronald M. Summers Ronald M.

Journal

Journal ID (nlm-ta): IEEE Trans Med Imaging

Journal ID (iso-abbrev): IEEE Trans Med Imaging

Journal ID (publisher-id): 0048700

Journal ID (coden): ITMID4

Journal ID (publisher-id): TMI

Title: Ieee Transactions on Medical Imaging

Publisher: IEEE

ISSN (Print): 0278-0062

ISSN (Electronic): 1558-254X

Publication date Collection: May 2016

Publication date (Electronic): 11 February 2016

Volume: 35

Issue: 5

Pages: 1285-1298

Affiliations

[1 ] divisionImaging Biomarkers and Computer-Aided Diagnosis Laboratory;

[2 ] institutionCenter for Infectious Disease Imaging;

[3 ] institutionNational Institutes of Health Clinical Center, divisionClinical Image Processing Service, departmentRadiology and Imaging Sciences Department; Bethesda MD 20892-1182 USA

Article

DOI: 10.1109/TMI.2016.2528162

PMC ID: 4890616

PubMed ID: 26886976

SO-VID: 92be66ae-1249-4359-a84b-8b68e185afe8

License:

This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/

History

Date received : 08 January 2016

Date revision received : 04 February 2016

Date accepted : 05 February 2016

Date: 29 April 2016

Page count

Figures: 13, Tables: 8, Equations: 0, References: 73, Pages: 14

Funding

Funded by: Intramural Research Program of the National Institutes of Health Clinical Center;

Funded by: KRIBB Research Initiative Program (Korean Biomedical Scientist Fellowship Program), Korea Research Institute of Bioscience and Biotechnology, Republic of Korea;

This work was supported in part by the Intramural Research Program of the National Institutes of Health Clinical Center, and in part by a grant from the KRIBB Research Initiative Program (Korean Biomedical Scientist Fellowship Program), Korea Research Institute of Bioscience and Biotechnology, Republic of Korea.

Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning

Read this article at

Abstract

Related collections

IEEE: COVID-19 related research

Most cited references 61

The Multimodal Brain Tumor Image Segmentation Benchmark (BRATS).

The Pascal Visual Object Classes Challenge: A Retrospective

Learning hierarchical features for scene labeling.

Author and article information

Contributors

Journal

Affiliations

Article

History

Page count

Funding

Categories

Comments

Comment on this article

Similar content 162

Cited by 970