21
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Cerebellar climbing fibers encode expected reward size

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Climbing fiber inputs to the cerebellum encode error signals that instruct learning. Recently, evidence has accumulated to suggest that the cerebellum is also involved in the processing of reward. To study how rewarding events are encoded, we recorded the activity of climbing fibers when monkeys were engaged in an eye movement task. At the beginning of each trial, the monkeys were cued to the size of the reward that would be delivered upon successful completion of the trial. Climbing fiber activity increased when the monkeys were presented with a cue indicating a large reward, but not a small reward. Reward size did not modulate activity at reward delivery or during eye movements. Comparison between climbing fiber and simple spike activity indicated different interactions for coding of movement and reward. These results indicate that climbing fibers encode the expected reward size and suggest a general role of the cerebellum in associative learning beyond error correction.

          Related collections

          Most cited references37

          • Record: found
          • Abstract: found
          • Article: not found

          A theory of cerebellar cortex.

          D. Marr (1969)
          1. A detailed theory of cerebellar cortex is proposed whose consequence is that the cerebellum learns to perform motor skills. Two forms of input-output relation are described, both consistent with the cortical theory. One is suitable for learning movements (actions), and the other for learning to maintain posture and balance (maintenance reflexes).2. It is known that the cells of the inferior olive and the cerebellar Purkinje cells have a special one-to-one relationship induced by the climbing fibre input. For learning actions, it is assumed that:(a) each olivary cell responds to a cerebral instruction for an elemental movement. Any action has a defining representation in terms of elemental movements, and this representation has a neural expression as a sequence of firing patterns in the inferior olive; and(b) in the correct state of the nervous system, a Purkinje cell can initiate the elemental movement to which its corresponding olivary cell responds.3. Whenever an olivary cell fires, it sends an impulse (via the climbing fibre input) to its corresponding Purkinje cell. This Purkinje cell is also exposed (via the mossy fibre input) to information about the context in which its olivary cell fired; and it is shown how, during rehearsal of an action, each Purkinje cell can learn to recognize such contexts. Later, when the action has been learnt, occurrence of the context alone is enough to fire the Purkinje cell, which then causes the next elemental movement. The action thus progresses as it did during rehearsal.4. It is shown that an interpretation of cerebellar cortex as a structure which allows each Purkinje cell to learn a number of contexts is consistent both with the distributions of the various types of cell, and with their known excitatory or inhibitory natures. It is demonstrated that the mossy fibre-granule cell arrangement provides the required pattern discrimination capability.5. The following predictions are made.(a) The synapses from parallel fibres to Purkinje cells are facilitated by the conjunction of presynaptic and climbing fibre (or post-synaptic) activity.(b) No other cerebellar synapses are modifiable.(c) Golgi cells are driven by the greater of the inputs from their upper and lower dendritic fields.6. For learning maintenance reflexes, 2(a) and 2(b) are replaced by2'. Each olivary cell is stimulated by one or more receptors, all of whose activities are usually reduced by the results of stimulating the corresponding Purkinje cell.7. It is shown that if (2') is satisfied, the circuit receptor --> olivary cell --> Purkinje cell --> effector may be regarded as a stabilizing reflex circuit which is activated by learned mossy fibre inputs. This type of reflex has been called a learned conditional reflex, and it is shown how such reflexes can solve problems of maintaining posture and balance.8. 5(a), and either (2) or (2') are essential to the theory: 5(b) and 5(c) are not absolutely essential, and parts of the theory could survive the disproof of either.
            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            A theory of cerebellar function

              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Relative reward preference in primate orbitofrontal cortex.

              The orbital part of prefrontal cortex appears to be crucially involved in the motivational control of goal-directed behaviour. Patients with lesions of orbitofrontal cortex show impairments in making decisions about the expected outcome of actions. Monkeys with orbitofrontal lesions respond abnormally to changes in reward expectations and show altered reward preferences. As rewards constitute basic goals of behaviour, we investigated here how neurons in the orbitofrontal cortex of monkeys process information about liquid and food rewards in a typical frontal task, spatial delayed responding. The activity of orbitofrontal neurons increases in response to reward-predicting signals, during the expectation of rewards, and after the receipt of rewards. Neurons discriminate between different rewards, mainly irrespective of the spatial and visual features of reward-predicting stimuli and behavioural reactions. Most reward discriminations reflect the animals' relative preference among the available rewards, as expressed by their choice behaviour, rather than physical reward properties. Thus, neurons in the orbitofrontal cortex appear to process the motivational value of rewarding outcomes of voluntary action.
                Bookmark

                Author and article information

                Contributors
                Role: Reviewing Editor
                Role: Senior Editor
                Journal
                eLife
                Elife
                eLife
                eLife
                eLife Sciences Publications, Ltd
                2050-084X
                29 October 2019
                2019
                : 8
                : e46870
                Affiliations
                [1]deptEdmond and Lily Safra Center for Brain Sciences The Hebrew University of Jerusalem JerusalemIsrael
                Stanford University School of Medicine United States
                Emory University United States
                Stanford University School of Medicine United States
                University of Cambridge United Kingdom
                Author notes
                [†]

                These authors contributed equally to this work.

                Author information
                https://orcid.org/0000-0001-8750-2182
                https://orcid.org/0000-0003-2602-3334
                Article
                46870
                10.7554/eLife.46870
                6844644
                31661073
                9185c5b2-1224-4448-ba72-bb60c0962eca
                © 2019, Larry et al

                This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.

                History
                : 14 March 2019
                : 24 October 2019
                Funding
                Funded by: FundRef http://dx.doi.org/10.13039/100010663, H2020 European Research Council;
                Award ID: imove 755745
                Award Recipient :
                Funded by: FundRef http://dx.doi.org/10.13039/501100000854, Human Frontier Science Program;
                Award ID: CDA 00056
                Award Recipient :
                Funded by: FundRef http://dx.doi.org/10.13039/501100003977, Israel Science Foundation;
                Award ID: 38017
                Award Recipient :
                The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
                Categories
                Research Article
                Neuroscience
                Custom metadata
                Electrophysiological recordings in monkeys reveal that cerebellar complex spikes encode future reward size when reward information is first made available, but not during reward delivery or smooth pursuit eye movement.

                Life sciences
                complex spikes,cerebellum,smooth pursuit,reinforcement learning,reward prediction,other
                Life sciences
                complex spikes, cerebellum, smooth pursuit, reinforcement learning, reward prediction, other

                Comments

                Comment on this article