33
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      A 0.3-2.6 TOPS/W Precision-Scalable Processor for Real-Time Large-Scale ConvNets

      Preprint
      ,

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          A low-power precision-scalable processor for ConvNets or convolutional neural networks (CNN) is implemented in a 40nm technology. Its 256 parallel processing units achieve a peak 102GOPS running at 204MHz. To minimize energy consumption while maintaining throughput, this works is the first to both exploit the sparsity of convolutions and to implement dynamic precision-scalability enabling supply- and energy scaling. The processor is fully C-programmable, consumes 25-288mW at 204 MHz and scales efficiency from 0.3-2.6 real TOPS/W. This system hereby outperforms the state-of-the-art up to 3.9x in energy efficiency.

          Related collections

          Author and article information

          Journal
          2016-06-16
          Article
          1606.05094
          4969e583-1d60-40af-8b0f-19ed2311cdff

          http://arxiv.org/licenses/nonexclusive-distrib/1.0/

          History
          Custom metadata
          paper C17p1
          Published at the Symposium on VLSI Circuits, 2016, Honolulu, HI, US
          cs.AR

          Hardware architecture
          Hardware architecture

          Comments

          Comment on this article