Corticostriatal dynamics encode the refinement of specific behavioral variability during skill learning

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

Learning to perform a complex motor task requires the optimization of specific behavioral features to cope with task constraints. We show that when mice learn a novel motor paradigm they differentially refine specific behavioral features. Animals trained to perform progressively faster sequences of lever presses to obtain reinforcement reduced variability in sequence frequency, but increased variability in an orthogonal feature (sequence duration). Trial-to-trial variability of the activity of motor cortex and striatal projection neurons was higher early in training and subsequently decreased with learning, without changes in average firing rate. As training progressed, variability in corticostriatal activity became progressively more correlated with behavioral variability, but specifically with variability in frequency. Corticostriatal plasticity was required for the reduction in frequency variability, but not for variability in sequence duration. These data suggest that during motor learning corticostriatal dynamics encode the refinement of specific behavioral features that change the probability of obtaining outcomes.

DOI: http://dx.doi.org/10.7554/eLife.09423.001

eLife digest

Learning a new motor skill typically involves a degree of trial and error. Movements that achieve the desired outcome—from catching a ball to playing scales—are repeated and refined until they can be produced on demand. This process is made more difficult as the activity of individual neurons and muscle fibers can vary at random, and this reduces the ability to reproduce a given movement precisely and reliably.

It has been suggested that the motor system overcomes this problem by identifying those parts of a task that are essential for achieving the end goal, and then focusing resources on reducing the variability in the performance of those parts alone. Santos et al. now provide direct evidence in support of this proposal by recording the activity of neurons in motor regions of the mouse brain as the animals learn a lever pressing task.

By giving mice a food reward each time they pressed the lever four times in a row, Santos et al. trained the animals to press the lever in bouts. The experiment was then slightly modified, so that the mice had to perform the four lever presses more rapidly in order to earn their reward. Consistent with predictions, the average speed of lever pressing initially varied greatly, but this variability decreased as the animals learned the task. By contrast, the total duration of individual bouts of lever pressing—which depends largely on the number of times the mice press the lever—was just as variable after training as before.

A similar pattern emerged for the activity of individual motor neurons in the mouse brain. Whereas their activity initially varied greatly, this variability decreased over training. Moreover, it became increasingly linked to the variability in the speed of lever pressing, but not with the variability in the duration of individual bouts.

The work of Santos et al. has thus shown in real time how the motor system focuses its efforts on reducing variability in those specific parts of a task that are essential for achieving a goal. Without a process called corticostriatal plasticity, by which the motor system adapts, mice could not refine this variability.

DOI: http://dx.doi.org/10.7554/eLife.09423.002

Related collections

Most cited references 21

Record: found
Abstract: not found
Article: not found

Optimal feedback control and the neural basis of volitional motor control.

Stephen Scott (2004)

0 comments Cited 255 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

Temporal structure of motor variability is dynamically regulated and predicts motor learning ability.

Howard G Wu, Yohsuke Miyamoto, Luis Castro … (2014)

Individual differences in motor learning ability are widely acknowledged, yet little is known about the factors that underlie them. Here we explore whether movement-to-movement variability in motor output, a ubiquitous if often unwanted characteristic of motor performance, predicts motor learning ability. Surprisingly, we found that higher levels of task-relevant motor variability predicted faster learning both across individuals and across tasks in two different paradigms, one relying on reward-based learning to shape specific arm movement trajectories and the other relying on error-based learning to adapt movements in novel physical environments. We proceeded to show that training can reshape the temporal structure of motor variability, aligning it with the trained task to improve learning. These results provide experimental support for the importance of action exploration, a key idea from reinforcement learning theory, showing that motor variability facilitates motor learning in humans and that our nervous systems actively regulate it to improve learning.

0 comments Cited 207 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

Activity of striatal neurons reflects dynamic encoding and recoding of procedural memories.

Rebecca Barnes, Ann Graybiel, Yasuo Kubota … (2005)

Learning to perform a behavioural procedure as a well-ingrained habit requires extensive repetition of the behavioural sequence, and learning not to perform such behaviours is notoriously difficult. Yet regaining a habit can occur quickly, with even one or a few exposures to cues previously triggering the behaviour. To identify neural mechanisms that might underlie such learning dynamics, we made long-term recordings from multiple neurons in the sensorimotor striatum, a basal ganglia structure implicated in habit formation, in rats successively trained on a reward-based procedural task, given extinction training and then given reacquisition training. The spike activity of striatal output neurons, nodal points in cortico-basal ganglia circuits, changed markedly across multiple dimensions during each of these phases of learning. First, new patterns of task-related ensemble firing successively formed, reversed and then re-emerged. Second, task-irrelevant firing was suppressed, then rebounded, and then was suppressed again. These changing spike activity patterns were highly correlated with changes in behavioural performance. We propose that these changes in task representation in cortico-basal ganglia circuits represent neural equivalents of the explore-exploit behaviour characteristic of habit learning.

0 comments Cited 159 times – based on 0 reviews      Review now

Bookmark

All references

Author and article information

Contributors

Ole Kiehn: Role: Reviewing editor

Journal

Journal ID (nlm-ta): eLife

Journal ID (hwp): eLife

Journal ID (publisher-id): eLife

Title: eLife

Publisher: eLife Sciences Publications, Ltd

ISSN (Print): 2050-084X

ISSN (Electronic): 2050-084X

Publication date (Electronic, pub): 29 September 2015

Publication date Collection: 2015

Volume: 4

Electronic Location Identifier: e09423

Affiliations

[1 ]deptChampalimaud Neuroscience Programme , Fundação Champalimaud , Lisbon, Portugal

[2 ]deptMolecular Neurobiology Laboratory , Salk Institute for Biological Studies , La Jolla, United States

Karolinska Institutet , Sweden

Author notes

[* ]For correspondence: rui.costa@ 123456neuro.fchampalimaud.org

Article

Publisher ID: 09423

DOI: 10.7554/eLife.09423

PMC ID: 4616249

PubMed ID: 26417950

SO-VID: aa034946-67ee-4006-9a65-9c7a7897c212

License:

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.

History

Date received : 18 June 2015

Date accepted : 28 September 2015

Funding

Funded by: Howard Hughes Medical Institute (HHMI);

Award ID: International Early Career Scientist Grant IEC 55007415

Award Recipient : Rui M Costa

Funded by: European Research Council (ERC);

Award ID: Consolidator Grants, ERC CoG 617142

Award Recipient : Rui M Costa

Funded by: ERA NET;

Award ID: NEURON

Award Recipient : Rui M Costa

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Custom metadata

elife-xml-version 2.3

Author impact statement Recordings from the mouse brain as animals learn a lever pressing task reveal how the motor system optimizes skill learning by reducing variability in those aspects of task performance that are essential for achieving a goal.

ScienceOpen disciplines: Life sciences

Keywords: striatum,cortex,action,skill,motor,plasticity,mouse

Data availability:

ScienceOpen disciplines: Life sciences

Keywords: striatum, cortex, action, skill, motor, plasticity, mouse

Comments

Comment on this article

scite_

Cited by 24

See all cited by

Most referenced authors 151

See all reference authors

Corticostriatal dynamics encode the refinement of specific behavioral variability during skill learning

Read this article at

Abstract

eLife digest

Related collections

Behavioral phenotypization of a mouse model for Alzheimer's: PDAPP = Tg(APPV717F)109Ili

Most cited references 21

Optimal feedback control and the neural basis of volitional motor control.

Temporal structure of motor variability is dynamically regulated and predicts motor learning ability.

Activity of striatal neurons reflects dynamic encoding and recoding of procedural memories.

Author and article information

Contributors

Journal

Affiliations

Author notes

Article

History

Funding

Categories

Custom metadata

Comments

Comment on this article

Similar content 198

Cited by 24

Most referenced authors 151