28
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      A Dual Role Hypothesis of the Cortico-Basal-Ganglia Pathways: Opponency and Temporal Difference Through Dopamine and Adenosine

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          The hypothesis that the basal-ganglia direct and indirect pathways represent goodness (or benefit) and badness (or cost) of options, respectively, explains a wide range of phenomena. However, this hypothesis, named the Opponent Actor Learning (OpAL), still has limitations. Structurally, the OpAL model does not incorporate differentiation of the two types of cortical inputs to the basal-ganglia pathways received from intratelencephalic (IT) and pyramidal-tract (PT) neurons. Functionally, the OpAL model does not describe the temporal-difference (TD)-type reward-prediction-error (RPE), nor explains how RPE is calculated in the circuitry connecting to the DA neurons. In fact, there is a different hypothesis on the basal-ganglia pathways and DA, named the Cortico-Striatal-Temporal-Difference (CS-TD) model. The CS-TD model differentiates the IT and PT inputs, describes the TD-type RPE, and explains how TD-RPE is calculated. However, a critical difficulty in this model lies in its assumption that DA induces the same direction of plasticity in both direct and indirect pathways, which apparently contradicts the experimentally observed opposite effects of DA on these pathways. Here, we propose a new hypothesis that integrates the OpAL and CS-TD models. Specifically, we propose that the IT-basal-ganglia pathways represent goodness/badness of current options while the PT-indirect pathway represents the overall value of the previously chosen option, and both of these have influence on the DA neurons, through the basal-ganglia output, so that a variant of TD-RPE is calculated. A key assumption is that opposite directions of plasticity are induced upon phasic activation of DA neurons in the IT-indirect pathway and PT-indirect pathway because of different profiles of IT and PT inputs. Specifically, at PT→indirect-pathway-medium-spiny-neuron (iMSN) synapses, sustained glutamatergic inputs generate rich adenosine, which allosterically prevents DA-D2 receptor signaling and instead favors adenosine-A2A receptor signaling. Then, phasic DA-induced phasic adenosine, which reflects TD-RPE, causes long-term synaptic potentiation. In contrast, at IT→iMSN synapses where adenosine is scarce, phasic DA causes long-term synaptic depression via D2 receptor signaling. This new Opponency and Temporal-Difference (OTD) model provides unique predictions, part of which is potentially in line with recently reported activity patterns of neurons in the globus pallidus externus on the indirect pathway.

          Related collections

          Most cited references64

          • Record: found
          • Abstract: found
          • Article: not found

          Modulation of striatal projection systems by dopamine.

          The basal ganglia are a chain of subcortical nuclei that facilitate action selection. Two striatal projection systems--so-called direct and indirect pathways--form the functional backbone of the basal ganglia circuit. Twenty years ago, investigators proposed that the striatum's ability to use dopamine (DA) rise and fall to control action selection was due to the segregation of D(1) and D(2) DA receptors in direct- and indirect-pathway spiny projection neurons. Although this hypothesis sparked a debate, the evidence that has accumulated since then clearly supports this model. Recent advances in the means of marking neural circuits with optical or molecular reporters have revealed a clear-cut dichotomy between these two cell types at the molecular, anatomical, and physiological levels. The contrast provided by these studies has provided new insights into how the striatum responds to fluctuations in DA signaling and how diseases that alter this signaling change striatal function.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            A framework for mesencephalic dopamine systems based on predictive Hebbian learning.

            We develop a theoretical framework that shows how mesencephalic dopamine systems could distribute to their targets a signal that represents information about future expectations. In particular, we show how activity in the cerebral cortex can make predictions about future receipt of reward and how fluctuations in the activity levels of neurons in diffuse dopamine systems above and below baseline levels would represent errors in these predictions that are delivered to cortical and subcortical targets. We present a model for how such errors could be constructed in a real brain that is consistent with physiological results for a subset of dopaminergic neurons located in the ventral tegmental area and surrounding dopaminergic neurons. The theory also makes testable predictions about human choice behavior on a simple decision-making task. Furthermore, we show that, through a simple influence on synaptic plasticity, fluctuations in dopamine release can act to change the predictions in an appropriate manner.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Distinct roles for direct and indirect pathway striatal neurons in reinforcement

              Dopamine signaling is implicated in reinforcement learning, but the neural substrates targeted by dopamine are poorly understood. Here, we bypassed dopamine signaling itself and tested how optogenetic activation of dopamine D1- or D2-receptor-expressing striatal projection neurons influenced reinforcement learning in mice. Stimulating D1-expressing neurons induced persistent reinforcement, whereas stimulating D2-expressing neurons induced transient punishment, demonstrating that activation of these circuits is sufficient to modify the probability of performing future actions.
                Bookmark

                Author and article information

                Contributors
                Journal
                Front Neural Circuits
                Front Neural Circuits
                Front. Neural Circuits
                Frontiers in Neural Circuits
                Frontiers Media S.A.
                1662-5110
                07 January 2019
                2018
                : 12
                : 111
                Affiliations
                [1] 1Physical and Health Education, Graduate School of Education, The University of Tokyo , Tokyo, Japan
                [2] 2International Research Center for Neurointelligence (WPI-IRCN), The University of Tokyo Institutes for Advanced Study , Tokyo, Japan
                [3] 3Division of Cerebral Circuitry, National Institute for Physiological Sciences , Okazaki, Japan
                [4] 4Department of Physiological Sciences, Graduate University for Advanced Studies , Okazaki, Japan
                Author notes

                Edited by: Anita Disney, Vanderbilt University, United States

                Reviewed by: Veronica Ghiglieri, University of Perugia, Italy; Jeanette Hellgren Kotaleski, Karolinska Institutet (KI), Sweden

                *Correspondence: Kenji Morita, morita@ 123456p.u-tokyo.ac.jp
                Article
                10.3389/fncir.2018.00111
                6338031
                30687019
                ce9a3594-8821-4b52-8bdb-9e8f8f815926
                Copyright © 2019 Morita and Kawaguchi.

                This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

                History
                : 31 August 2018
                : 29 November 2018
                Page count
                Figures: 3, Tables: 0, Equations: 0, References: 80, Pages: 11, Words: 0
                Categories
                Neuroscience
                Hypothesis and Theory

                Neurosciences
                reinforcement learning,reward prediction error,cost,basal ganglia,dopamine,adenosine
                Neurosciences
                reinforcement learning, reward prediction error, cost, basal ganglia, dopamine, adenosine

                Comments

                Comment on this article