13
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Bayesian Inference in Monte-Carlo Tree Search

      Preprint
      , ,

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Monte-Carlo Tree Search (MCTS) methods are drawing great interest after yielding breakthrough results in computer Go. This paper proposes a Bayesian approach to MCTS that is inspired by distributionfree approaches such as UCT [13], yet significantly differs in important respects. The Bayesian framework allows potentially much more accurate (Bayes-optimal) estimation of node values and node uncertainties from a limited number of simulation trials. We further propose propagating inference in the tree via fast analytic Gaussian approximation methods: this can make the overhead of Bayesian inference manageable in domains such as Go, while preserving high accuracy of expected-value estimates. We find substantial empirical outperformance of UCT in an idealized bandit-tree test environment, where we can obtain valuable insights by comparing with known ground truth. Additionally we rigorously prove on-policy and off-policy convergence of the proposed methods.

          Related collections

          Most cited references5

          • Record: found
          • Abstract: not found
          • Book Chapter: not found

          Bandit Based Monte-Carlo Planning

            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            The Greatest of a Finite Set of Random Variables

              Bookmark
              • Record: found
              • Abstract: not found
              • Book Chapter: not found

              Single-Player Monte-Carlo Tree Search

                Bookmark

                Author and article information

                Journal
                15 March 2012
                Article
                1203.3519
                75ef56bf-2da2-4e3a-9f36-82c2a560bfaa

                http://arxiv.org/licenses/nonexclusive-distrib/1.0/

                History
                Custom metadata
                UAI-P-2010-PG-580-588
                Appears in Proceedings of the Twenty-Sixth Conference on Uncertainty in Artificial Intelligence (UAI2010)
                cs.LG cs.AI stat.ML
                auai

                Comments

                Comment on this article