0
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Learning with reinforcement prediction errors in a model of the Drosophila mushroom body

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Effective decision making in a changing environment demands that accurate predictions are learned about decision outcomes. In Drosophila, such learning is orchestrated in part by the mushroom body, where dopamine neurons signal reinforcing stimuli to modulate plasticity presynaptic to mushroom body output neurons. Building on previous mushroom body models, in which dopamine neurons signal absolute reinforcement, we propose instead that dopamine neurons signal reinforcement prediction errors by utilising feedback reinforcement predictions from output neurons. We formulate plasticity rules that minimise prediction errors, verify that output neurons learn accurate reinforcement predictions in simulations, and postulate connectivity that explains more physiological observations than an experimentally constrained model. The constrained and augmented models reproduce a broad range of conditioning and blocking experiments, and we demonstrate that the absence of blocking does not imply the absence of prediction error dependent learning. Our results provide five predictions that can be tested using established experimental methods.

          Abstract

          Dopamine neurons in the mushroom body help Drosophila learn to approach rewards and avoid punishments. Here, the authors propose a model in which dopaminergic learning signals encode reinforcement prediction errors by utilising feedback reinforcement predictions from mushroom body output neurons.

          Related collections

          Most cited references67

          • Record: found
          • Abstract: not found
          • Article: not found

          A Neural Substrate of Prediction and Reward

            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            Learning to predict by the methods of temporal differences

              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              Some aspects of the sequential design of experiments

                Bookmark

                Author and article information

                Contributors
                james.bennett@sussex.ac.uk
                Journal
                Nat Commun
                Nat Commun
                Nature Communications
                Nature Publishing Group UK (London )
                2041-1723
                7 May 2021
                7 May 2021
                2021
                : 12
                : 2569
                Affiliations
                GRID grid.12082.39, ISNI 0000 0004 1936 7590, Department of Informatics, , University of Sussex, ; Brighton, UK
                Author information
                http://orcid.org/0000-0002-9474-4726
                http://orcid.org/0000-0001-5503-0467
                http://orcid.org/0000-0002-4451-915X
                Article
                22592
                10.1038/s41467-021-22592-4
                8105414
                33963189
                86c45816-4d7a-434e-82aa-64fef0c719a0
                © The Author(s) 2021

                Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

                History
                : 12 September 2019
                : 16 March 2021
                Funding
                Funded by: FundRef https://doi.org/10.13039/501100000266, RCUK | Engineering and Physical Sciences Research Council (EPSRC);
                Award ID: EP/P006094/1
                Award ID: EP/P006094/1
                Award Recipient :
                Categories
                Article
                Custom metadata
                © The Author(s) 2021

                Uncategorized
                learning algorithms,classical conditioning,neural circuits,reward,synaptic plasticity

                Comments

                Comment on this article