81
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Gradient Estimation Using Stochastic Computation Graphs

      Preprint
      , , ,

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          In a variety of problems originating in supervised, unsupervised, and reinforcement learning, the loss function is defined by an expectation over a collection of random variables, which might be part of a probabilistic model or the external world. Estimating the gradient of this loss function, using samples, lies at the core of gradient-based learning algorithms for these problems. We introduce the formalism of stochastic computation graphs---directed acyclic graphs that include both deterministic functions and conditional probability distributions---and describe how to easily and automatically derive an unbiased estimator of the loss function's gradient. The resulting algorithm for computing the gradient estimator is a simple modification of the standard backpropagation algorithm. The generic scheme we propose unifies estimators derived in variety of prior work, along with variance-reduction techniques therein. It could assist researchers in developing intricate models involving a combination of stochastic and deterministic operations, enabling, for example, attention, memory, and control actions.

          Related collections

          Author and article information

          Journal
          2015-06-17
          2016-01-05
          Article
          1506.05254
          156dab0e-75d8-4129-ba61-9a77c7d5d0dd

          http://arxiv.org/licenses/nonexclusive-distrib/1.0/

          History
          Custom metadata
          Advances in Neural Information Processing Systems 28 (NIPS 2015)
          cs.LG

          Artificial intelligence
          Artificial intelligence

          Comments

          Comment on this article