16
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Bayesian Inference in Monte-Carlo Tree Search

      Preprint

      Read this article at

          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Monte-Carlo Tree Search (MCTS) methods are drawing great interest after yielding breakthrough results in computer Go. This paper proposes a Bayesian approach to MCTS that is inspired by distributionfree approaches such as UCT [13], yet significantly differs in important respects. The Bayesian framework allows potentially much more accurate (Bayes-optimal) estimation of node values and node uncertainties from a limited number of simulation trials. We further propose propagating inference in the tree via fast analytic Gaussian approximation methods: this can make the overhead of Bayesian inference manageable in domains such as Go, while preserving high accuracy of expected-value estimates. We find substantial empirical outperformance of UCT in an idealized bandit-tree test environment, where we can obtain valuable insights by comparing with known ground truth. Additionally we rigorously prove on-policy and off-policy convergence of the proposed methods.

          Related collections

          Most cited references5

          • Record: found
          • Abstract: not found
          • Book Chapter: not found

          Bandit Based Monte-Carlo Planning

            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            The Greatest of a Finite Set of Random Variables

              Bookmark
              • Record: found
              • Abstract: not found
              • Book Chapter: not found

              Single-Player Monte-Carlo Tree Search

                Bookmark

                Author and article information

                Journal
                1203.3519

                Machine learning,Artificial intelligence
                Machine learning, Artificial intelligence

                Comments

                Comment on this article