2
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      ISP4ML: Understanding the Role of Image Signal Processing in Efficient Deep Learning Vision Systems

      Preprint

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Convolutional neural networks (CNNs) are now predominant components in a variety of computer vision (CV) systems. These systems typically include an image signal processor (ISP), even though the ISP is traditionally designed to produce images that look appealing to humans. In CV systems, it is not clear what the role of the ISP is, or if it is even required at all for accurate prediction. In this work, we investigate the efficacy of the ISP in CNN classification tasks, and outline the system-level trade-offs between prediction accuracy and computational cost. To do so, we build software models of a configurable ISP and an imaging sensor in order to train CNNs on ImageNet with a range of different ISP settings and functionality. Results on ImageNet show that an ISP improves accuracy by 4.6%-12.2% on MobileNet architectures of different widths. Results using ResNets demonstrate that these trends also generalize to deeper networks. An ablation study of the various processing stages in a typical ISP reveals that the tone mapper is the most significant stage when operating on high dynamic range (HDR) images, by providing 5.8% average accuracy improvement alone. Overall, the ISP benefits system efficiency because the memory and computational costs of the ISP is minimal compared to the cost of using a larger CNN to achieve the same accuracy.

          Related collections

          Most cited references7

          • Record: found
          • Abstract: not found
          • Conference Proceedings: not found

          Distillation as a Defense to Adversarial Perturbations Against Deep Neural Networks

            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            Eyeriss v2: A Flexible Accelerator for Emerging Deep Neural Networks on Mobile Devices

              Bookmark
              • Record: found
              • Abstract: not found
              • Conference Proceedings: not found

              RedEye: Analog ConvNet Image Sensor Architecture for Continuous Mobile Vision

                Bookmark

                Author and article information

                Journal
                18 November 2019
                Article
                1911.07954
                8e5a036f-66b1-4859-ad91-586be6e27404

                http://creativecommons.org/publicdomain/zero/1.0/

                History
                Custom metadata
                13 pages, 11 figures
                eess.IV cs.CV cs.LG

                Computer vision & Pattern recognition,Artificial intelligence,Electrical engineering

                Comments

                Comment on this article