Cerebellar climbing fibers encode expected reward size

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

Climbing fiber inputs to the cerebellum encode error signals that instruct learning. Recently, evidence has accumulated to suggest that the cerebellum is also involved in the processing of reward. To study how rewarding events are encoded, we recorded the activity of climbing fibers when monkeys were engaged in an eye movement task. At the beginning of each trial, the monkeys were cued to the size of the reward that would be delivered upon successful completion of the trial. Climbing fiber activity increased when the monkeys were presented with a cue indicating a large reward, but not a small reward. Reward size did not modulate activity at reward delivery or during eye movements. Comparison between climbing fiber and simple spike activity indicated different interactions for coding of movement and reward. These results indicate that climbing fibers encode the expected reward size and suggest a general role of the cerebellum in associative learning beyond error correction.

Related collections

Most cited references 37

Record: found
Abstract: found
Article: not found

A theory of cerebellar cortex.

D. Marr (1969)

1. A detailed theory of cerebellar cortex is proposed whose consequence is that the cerebellum learns to perform motor skills. Two forms of input-output relation are described, both consistent with the cortical theory. One is suitable for learning movements (actions), and the other for learning to maintain posture and balance (maintenance reflexes).2. It is known that the cells of the inferior olive and the cerebellar Purkinje cells have a special one-to-one relationship induced by the climbing fibre input. For learning actions, it is assumed that:(a) each olivary cell responds to a cerebral instruction for an elemental movement. Any action has a defining representation in terms of elemental movements, and this representation has a neural expression as a sequence of firing patterns in the inferior olive; and(b) in the correct state of the nervous system, a Purkinje cell can initiate the elemental movement to which its corresponding olivary cell responds.3. Whenever an olivary cell fires, it sends an impulse (via the climbing fibre input) to its corresponding Purkinje cell. This Purkinje cell is also exposed (via the mossy fibre input) to information about the context in which its olivary cell fired; and it is shown how, during rehearsal of an action, each Purkinje cell can learn to recognize such contexts. Later, when the action has been learnt, occurrence of the context alone is enough to fire the Purkinje cell, which then causes the next elemental movement. The action thus progresses as it did during rehearsal.4. It is shown that an interpretation of cerebellar cortex as a structure which allows each Purkinje cell to learn a number of contexts is consistent both with the distributions of the various types of cell, and with their known excitatory or inhibitory natures. It is demonstrated that the mossy fibre-granule cell arrangement provides the required pattern discrimination capability.5. The following predictions are made.(a) The synapses from parallel fibres to Purkinje cells are facilitated by the conjunction of presynaptic and climbing fibre (or post-synaptic) activity.(b) No other cerebellar synapses are modifiable.(c) Golgi cells are driven by the greater of the inputs from their upper and lower dendritic fields.6. For learning maintenance reflexes, 2(a) and 2(b) are replaced by2'. Each olivary cell is stimulated by one or more receptors, all of whose activities are usually reduced by the results of stimulating the corresponding Purkinje cell.7. It is shown that if (2') is satisfied, the circuit receptor --> olivary cell --> Purkinje cell --> effector may be regarded as a stabilizing reflex circuit which is activated by learned mossy fibre inputs. This type of reflex has been called a learned conditional reflex, and it is shown how such reflexes can solve problems of maintaining posture and balance.8. 5(a), and either (2) or (2') are essential to the theory: 5(b) and 5(c) are not absolutely essential, and parts of the theory could survive the disproof of either.

0 comments Cited 507 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: not found
Article: not found

A theory of cerebellar function

James S Albus (1971)

0 comments Cited 354 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

Relative reward preference in primate orbitofrontal cortex.

L. Tremblay, W. Schultz (1999)

The orbital part of prefrontal cortex appears to be crucially involved in the motivational control of goal-directed behaviour. Patients with lesions of orbitofrontal cortex show impairments in making decisions about the expected outcome of actions. Monkeys with orbitofrontal lesions respond abnormally to changes in reward expectations and show altered reward preferences. As rewards constitute basic goals of behaviour, we investigated here how neurons in the orbitofrontal cortex of monkeys process information about liquid and food rewards in a typical frontal task, spatial delayed responding. The activity of orbitofrontal neurons increases in response to reward-predicting signals, during the expectation of rewards, and after the receipt of rewards. Neurons discriminate between different rewards, mainly irrespective of the spatial and visual features of reward-predicting stimuli and behavioural reactions. Most reward discriminations reflect the animals' relative preference among the available rewards, as expressed by their choice behaviour, rather than physical reward properties. Thus, neurons in the orbitofrontal cortex appear to process the motivational value of rewarding outcomes of voluntary action.

0 comments Cited 226 times – based on 0 reviews      Review now

Bookmark

All references

Author and article information

Contributors

Noga Larry:

ORCID: https://orcid.org/0000-0001-8750-2182

Jennifer L Raymond: Role: Reviewing Editor

Ronald L Calabrese: Role: Senior Editor

Journal

Journal ID (nlm-ta): eLife

Journal ID (iso-abbrev): Elife

Journal ID (publisher-id): eLife

Title: eLife

Publisher: eLife Sciences Publications, Ltd

ISSN (Electronic): 2050-084X

Publication date (Electronic, pub): 29 October 2019

Publication date Collection: 2019

Volume: 8

Electronic Location Identifier: e46870

Affiliations

[1]deptEdmond and Lily Safra Center for Brain Sciences The Hebrew University of Jerusalem JerusalemIsrael

Stanford University School of Medicine United States

Emory University United States

Stanford University School of Medicine United States

University of Cambridge United Kingdom

Author notes

[†]

These authors contributed equally to this work.

Author information

Noga Larry https://orcid.org/0000-0001-8750-2182

Mati Joshua https://orcid.org/0000-0003-2602-3334

Article

Publisher ID: 46870

DOI: 10.7554/eLife.46870

PMC ID: 6844644

PubMed ID: 31661073

SO-VID: 9185c5b2-1224-4448-ba72-bb60c0962eca

License:

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.

History

Date received : 14 March 2019

Date accepted : 24 October 2019

Funding

Funded by: FundRef http://dx.doi.org/10.13039/100010663, H2020 European Research Council;

Award ID: imove 755745

Award Recipient : Mati Joshua

Funded by: FundRef http://dx.doi.org/10.13039/501100000854, Human Frontier Science Program;

Award ID: CDA 00056

Award Recipient : Mati Joshua

Funded by: FundRef http://dx.doi.org/10.13039/501100003977, Israel Science Foundation;

Award ID: 38017

Award Recipient : Mati Joshua

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Custom metadata

Author impact statement Electrophysiological recordings in monkeys reveal that cerebellar complex spikes encode future reward size when reward information is first made available, but not during reward delivery or smooth pursuit eye movement.

ScienceOpen disciplines: Life sciences

Keywords: complex spikes,cerebellum,smooth pursuit,reinforcement learning,reward prediction,other

Data availability:

ScienceOpen disciplines: Life sciences

Keywords: complex spikes, cerebellum, smooth pursuit, reinforcement learning, reward prediction, other

Cerebellar climbing fibers encode expected reward size

Read this article at

Abstract

Related collections

The Dynamic Brain

Most cited references 37

A theory of cerebellar cortex.

A theory of cerebellar function

Relative reward preference in primate orbitofrontal cortex.

Author and article information

Contributors

Journal

Affiliations

Author notes

Author information

Article

History

Funding

Categories

Custom metadata

Comments

Comment on this article

Similar content 39

Cited by 27

Most referenced authors 209