11
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Computational Characteristics of the Striatal Dopamine System Described by Reinforcement Learning With Fast Generalization

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Generalization is the ability to apply past experience to similar but non-identical situations. It not only affects stimulus-outcome relationships, as observed in conditioning experiments, but may also be essential for adaptive behaviors, which involve the interaction between individuals and their environment. Computational modeling could potentially clarify the effect of generalization on adaptive behaviors and how this effect emerges from the underlying computation. Recent neurobiological observation indicated that the striatal dopamine system achieves generalization and subsequent discrimination by updating the corticostriatal synaptic connections in differential response to reward and punishment. In this study, we analyzed how computational characteristics in this neurobiological system affects adaptive behaviors. We proposed a novel reinforcement learning model with multilayer neural networks in which the synaptic weights of only the last layer are updated according to the prediction error. We set fixed connections between the input and hidden layers to maintain the similarity of inputs in the hidden-layer representation. This network enabled fast generalization of reward and punishment learning, and thereby facilitated safe and efficient exploration of spatial navigation tasks. Notably, it demonstrated a quick reward approach and efficient punishment aversion in the early learning phase, compared to algorithms that do not show generalization. However, disturbance of the network that causes noisy generalization and impaired discrimination induced maladaptive valuation. These results suggested the advantage and potential drawback of computation by the striatal dopamine system with regard to adaptive behaviors.

          Related collections

          Most cited references38

          • Record: found
          • Abstract: not found
          • Article: not found

          Extreme learning machine: Theory and applications

            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Real-time computing without stable states: a new framework for neural computation based on perturbations.

            A key challenge for neural modeling is to explain how a continuous stream of multimodal input from a rapidly changing environment can be processed by stereotypical recurrent circuits of integrate-and-fire neurons in real time. We propose a new computational model for real-time computing on time-varying input that provides an alternative to paradigms based on Turing machines or attractor neural networks. It does not require a task-dependent construction of neural circuits. Instead, it is based on principles of high-dimensional dynamical systems in combination with statistical learning theory and can be implemented on generic evolved or found recurrent circuitry. It is shown that the inherent transient dynamics of the high-dimensional dynamical system formed by a sufficiently large and heterogeneous neural circuit may serve as universal analog fading memory. Readout neurons can learn to extract in real time from the current state of such recurrent neural circuit information about current and past inputs that may be needed for diverse tasks. Stable internal states are not required for giving a stable output, since transient internal states can be transformed by readout neurons into stable target outputs due to the high dimensionality of the dynamical system. Our approach is based on a rigorous computational model, the liquid state machine, that, unlike Turing machines, does not require sequential transitions between well-defined discrete internal states. It is supported, as the Turing machine is, by rigorous mathematical results that predict universal computational power under idealized conditions, but for the biologically more realistic scenario of real-time processing of time-varying inputs. Our approach provides new perspectives for the interpretation of neural coding, the design of experiments and data analysis in neurophysiology, and the solution of problems in robotics and neurotechnology.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Psychosis as a state of aberrant salience: a framework linking biology, phenomenology, and pharmacology in schizophrenia.

              The clinical hallmark of schizophrenia is psychosis. The objective of this overview is to link the neurobiology (brain), the phenomenological experience (mind), and pharmacological aspects of psychosis-in-schizophrenia into a unitary framework. Current ideas regarding the neurobiology and phenomenology of psychosis and schizophrenia, the role of dopamine, and the mechanism of action of antipsychotic medication were integrated to develop this framework. A central role of dopamine is to mediate the "salience" of environmental events and internal representations. It is proposed that a dysregulated, hyperdopaminergic state, at a "brain" level of description and analysis, leads to an aberrant assignment of salience to the elements of one's experience, at a "mind" level. Delusions are a cognitive effort by the patient to make sense of these aberrantly salient experiences, whereas hallucinations reflect a direct experience of the aberrant salience of internal representations. Antipsychotics "dampen the salience" of these abnormal experiences and by doing so permit the resolution of symptoms. The antipsychotics do not erase the symptoms but provide the platform for a process of psychological resolution. However, if antipsychotic treatment is stopped, the dysregulated neurochemistry returns, the dormant ideas and experiences become reinvested with aberrant salience, and a relapse occurs. The article provides a heuristic framework for linking the psychological and biological in psychosis. Predictions of this hypothesis, particularly regarding the possibility of synergy between psychological and pharmacological therapies, are presented. The author describes how the hypothesis is complementary to other ideas about psychosis and also discusses its limitations.
                Bookmark

                Author and article information

                Contributors
                Journal
                Front Comput Neurosci
                Front Comput Neurosci
                Front. Comput. Neurosci.
                Frontiers in Computational Neuroscience
                Frontiers Media S.A.
                1662-5188
                22 July 2020
                2020
                : 14
                : 66
                Affiliations
                [1] 1Integrated Systems Biology Laboratory, Department of Systems Science, Graduate School of Informatics, Kyoto University , Kyoto, Japan
                [2] 2Laboratory of Structural Physiology, Center for Disease Biology and Integrative Medicine, Faculty of Medicine, The University of Tokyo , Tokyo, Japan
                [3] 3International Research Center for Neurointelligence, The University of Tokyo Institutes for Advanced Study, The University of Tokyo , Tokyo, Japan
                [4] 4Neural Information Processing Laboratories, Advanced Telecommunications Research Institute International (ATR) , Kyoto, Japan
                Author notes

                Edited by: Uma Shanker Tiwary, Indian Institute of Information Technology, Allahabad, India

                Reviewed by: Adam Ponzi, Okinawa Institute of Science and Technology Graduate University, Japan; Alekhya Mandali, University of Cambridge, United Kingdom

                *Correspondence: Yoshihisa Fujita fujita.yoshihisa.4s@ 123456kyoto-u.ac.jp
                Article
                10.3389/fncom.2020.00066
                7388898
                32774245
                15b9889f-cbcf-4424-a293-541535502d07
                Copyright © 2020 Fujita, Yagishita, Kasai and Ishii.

                This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

                History
                : 11 March 2020
                : 08 June 2020
                Page count
                Figures: 7, Tables: 0, Equations: 27, References: 53, Pages: 15, Words: 9665
                Funding
                Funded by: Japan Society for the Promotion of Science 10.13039/501100001691
                Award ID: 19H04180
                Funded by: Ministry of Education, Culture, Sports, Science and Technology 10.13039/501100001700
                Award ID: 17H06310
                Funded by: Core Research for Evolutional Science and Technology 10.13039/501100003382
                Award ID: JPMJCR1652
                Categories
                Neuroscience
                Original Research

                Neurosciences
                generalization,adaptive behaviors,reward learning,striatum,dopamine-dependent plasticity,reinforcement learning,artificial neural networks

                Comments

                Comment on this article