6
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Hardness of Learning Halfspaces with Massart Noise

      Preprint
      ,

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          We study the complexity of PAC learning halfspaces in the presence of Massart (bounded) noise. Specifically, given labeled examples \((x, y)\) from a distribution \(D\) on \(\mathbb{R}^{n} \times \{ \pm 1\}\) such that the marginal distribution on \(x\) is arbitrary and the labels are generated by an unknown halfspace corrupted with Massart noise at rate \(\eta<1/2\), we want to compute a hypothesis with small misclassification error. Characterizing the efficient learnability of halfspaces in the Massart model has remained a longstanding open problem in learning theory. Recent work gave a polynomial-time learning algorithm for this problem with error \(\eta+\epsilon\). This error upper bound can be far from the information-theoretically optimal bound of \(\mathrm{OPT}+\epsilon\). More recent work showed that {\em exact learning}, i.e., achieving error \(\mathrm{OPT}+\epsilon\), is hard in the Statistical Query (SQ) model. In this work, we show that there is an exponential gap between the information-theoretically optimal error and the best error that can be achieved by a polynomial-time SQ algorithm. In particular, our lower bound implies that no efficient SQ algorithm can approximate the optimal error within any polynomial factor.

          Related collections

          Author and article information

          Journal
          17 December 2020
          Article
          2012.09720
          84a9d04e-6d85-45ed-892d-8362ffd423cc

          http://arxiv.org/licenses/nonexclusive-distrib/1.0/

          History
          Custom metadata
          cs.LG cs.CC math.ST stat.ML stat.TH

          Theoretical computer science,Machine learning,Artificial intelligence,Statistics theory

          Comments

          Comment on this article