1
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Resource Constrained Neural Network Architecture Search

      Preprint
      , ,

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          The design of neural network architectures is frequently either based on human expertise using trial/error and empirical feedback or tackled via large scale reinforcement learning strategies run over distinct discrete architecture choices. In the latter case, the optimization task is non-differentiable and also not very amenable to derivative-free optimization methods. Most methods in use today require exorbitant computational resources. And if we want networks that additionally satisfy resource constraints, the above challenges are exacerbated because the search procedure must now balance accuracy with certain budget constraints on resources. We formulate this problem as the optimization of a set function - we find that the empirical behavior of this set function often (but not always) satisfies marginal gain and monotonicity principles - properties central to the idea of submodularity. Based on this observation, we adapt algorithms that are well-known within discrete optimization to obtain heuristic schemes for neural network architecture search, with resource constraints on the architecture. This simple scheme when applied on CIFAR-100 and ImageNet, identifies resource-constrained architectures with quantifiably better performance than current state-of-the-art models designed for mobile devices. Specifically, we find high-performing architectures with fewer parameters and computations by a search method that is much faster.

          Related collections

          Most cited references2

          • Record: found
          • Abstract: not found
          • Book Chapter: not found

          What Energy Functions Can Be Minimized via Graph Cuts?

            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            Shuffle Net: an application of generalized perfect shuffles to multihop lightwave networks

              Bookmark

              Author and article information

              Journal
              07 April 2019
              Article
              1904.03786
              573f17a1-75b5-4eee-9f63-20cc939dccff

              http://arxiv.org/licenses/nonexclusive-distrib/1.0/

              History
              Custom metadata
              cs.CV

              Computer vision & Pattern recognition
              Computer vision & Pattern recognition

              Comments

              Comment on this article