2
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Introduction to Multi-Armed Bandits

      Preprint

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Multi-armed bandits a simple but very powerful framework for algorithms that make decisions over time under uncertainty. An enormous body of work has accumulated over the years, covered in several books and surveys. This book provides a more introductory, textbook-like treatment of the subject. Each chapter tackles a particular line of work, providing a self-contained, teachable technical introduction and a review of the more advanced results. The chapters are as follows: Stochastic bandits; Lower bounds; Bayesian Bandits and Thompson Sampling; Lipschitz Bandits; Full Feedback and Adversarial Costs; Adversarial Bandits; Linear Costs and Semi-bandits; Contextual Bandits; Bandits and Zero-Sum Games; Bandits with Knapsacks; Incentivized Exploration and Connections to Mechanism Design. Status of the manuscript: essentially complete (modulo some polishing), except for last chapter, which the author plans to add over the next few months.

          Related collections

          Most cited references19

          • Record: found
          • Abstract: not found
          • Article: not found

          The Nonstochastic Multiarmed Bandit Problem

            Bookmark
            • Record: found
            • Abstract: not found
            • Book Chapter: not found

            Bandit Based Monte-Carlo Planning

              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              Subjectivity and correlation in randomized strategies

                Bookmark

                Author and article information

                Journal
                15 April 2019
                Article
                1904.07272
                931549f6-df2d-4d30-a047-3ab269f035c3

                http://arxiv.org/licenses/nonexclusive-distrib/1.0/

                History
                Custom metadata
                cs.LG cs.AI cs.DS stat.ML

                Data structures & Algorithms,Machine learning,Artificial intelligence
                Data structures & Algorithms, Machine learning, Artificial intelligence

                Comments

                Comment on this article