1
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Depth with Nonlinearity Creates No Bad Local Minima in ResNets

      Preprint
      ,

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          In this paper, we prove that depth with nonlinearity creates no bad local minima in a type of arbitrarily deep ResNets studied in previous work, in the sense that the values of all local minima are no worse than the global minima values of corresponding shallow linear predictors with arbitrary fixed features, and are guaranteed to further improve via residual representations. As a result, this paper provides an affirmative answer to an open question stated in a paper in the conference on Neural Information Processing Systems (NIPS) 2018. We note that even though our paper advances the theoretical foundation of deep learning and non-convex optimization, there is still a gap between theory and many practical deep learning applications.

          Related collections

          Most cited references6

          • Record: found
          • Abstract: not found
          • Conference Proceedings: not found

          Aggregated Residual Transformations for Deep Neural Networks

            Bookmark
            • Record: found
            • Abstract: not found
            • Book Chapter: not found

            Identity Mappings in Deep Residual Networks

              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              Universal approximation bounds for superpositions of a sigmoidal function

                Bookmark

                Author and article information

                Journal
                21 October 2018
                Article
                1810.09038
                b356477d-d2da-44a1-8d95-aef68870915c

                http://arxiv.org/licenses/nonexclusive-distrib/1.0/

                History
                Custom metadata
                stat.ML cs.AI cs.LG math.OC

                Numerical methods,Machine learning,Artificial intelligence
                Numerical methods, Machine learning, Artificial intelligence

                Comments

                Comment on this article