5
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Intrinsic Rewards for Maintenance, Approach, Avoidance, and Achievement Goal Types

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          In reinforcement learning, reward is used to guide the learning process. The reward is often designed to be task-dependent, and it may require significant domain knowledge to design a good reward function. This paper proposes general reward functions for maintenance, approach, avoidance, and achievement goal types. These reward functions exploit the inherent property of each type of goal and are thus task-independent. We also propose metrics to measure an agent's performance for learning each type of goal. We evaluate the intrinsic reward functions in a framework that can autonomously generate goals and learn solutions to those goals using a standard reinforcement learning algorithm. We show empirically how the proposed reward functions lead to learning in a mobile robot application. Finally, using the proposed reward functions as building blocks, we demonstrate how compound reward functions, reward functions to generate sequences of tasks, can be created that allow the mobile robot to learn more complex behaviors.

          Related collections

          Most cited references32

          • Record: found
          • Abstract: found
          • Article: not found

          What is Intrinsic Motivation? A Typology of Computational Approaches

          Intrinsic motivation, centrally involved in spontaneous exploration and curiosity, is a crucial concept in developmental psychology. It has been argued to be a crucial mechanism for open-ended cognitive development in humans, and as such has gathered a growing interest from developmental roboticists in the recent years. The goal of this paper is threefold. First, it provides a synthesis of the different approaches of intrinsic motivation in psychology. Second, by interpreting these approaches in a computational reinforcement learning framework, we argue that they are not operational and even sometimes inconsistent. Third, we set the ground for a systematic operational study of intrinsic motivation by presenting a formal typology of possible computational approaches. This typology is partly based on existing computational models, but also presents new ways of conceptualizing intrinsic motivation. We argue that this kind of computational typology might be useful for opening new avenues for research both in psychology and developmental robotics.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: found
            Is Open Access

            Novelty or Surprise?

            Novelty and surprise play significant roles in animal behavior and in attempts to understand the neural mechanisms underlying it. They also play important roles in technology, where detecting observations that are novel or surprising is central to many applications, such as medical diagnosis, text processing, surveillance, and security. Theories of motivation, particularly of intrinsic motivation, place novelty and surprise among the primary factors that arouse interest, motivate exploratory or avoidance behavior, and drive learning. In many of these studies, novelty and surprise are not distinguished from one another: the words are used more-or-less interchangeably. However, while undeniably closely related, novelty and surprise are very different. The purpose of this article is first to highlight the differences between novelty and surprise and to discuss how they are related by presenting an extensive review of mathematical and computational proposals related to them, and then to explore the implications of this for understanding behavioral and neuroscience data. We argue that opportunities for improved understanding of behavior and its neural basis are likely being missed by failing to distinguish between novelty and surprise.
              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              Lifelong robot learning

                Bookmark

                Author and article information

                Contributors
                Journal
                Front Neurorobot
                Front Neurorobot
                Front. Neurorobot.
                Frontiers in Neurorobotics
                Frontiers Media S.A.
                1662-5218
                09 October 2018
                2018
                : 12
                : 63
                Affiliations
                [1] 1Intelligent Systems Research Centre, Ulster University , Derry, United Kingdom
                [2] 2School of Engineering and Information Technology, University of New South Wales , Canberra, ACT, Australia
                [3] 3Embodied Systems for Robotics and Learning, The Mærsk Mc Kinney Møller Institute, University of Southern Denmark , Odense, Denmark
                Author notes

                Edited by: Vieri Giuliano Santucci, Istituto di Scienze e Tecnologie della Cognizione (ISTC), Italy

                Reviewed by: Gianluca Baldassarre, Consiglio Nazionale Delle Ricerche (CNR), Italy; Eiji Uchibe, Advanced Telecommunications Research Institute International (ATR), Japan; Patricia Shaw, Aberystwyth University, United Kingdom

                *Correspondence: Paresh Dhakan dhakan-p@ 123456ulster.ac.uk
                Article
                10.3389/fnbot.2018.00063
                6189580
                9933aebb-aeb8-403f-b03d-42346972c4e5
                Copyright © 2018 Dhakan, Merrick, Rañó and Siddique.

                This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

                History
                : 30 April 2018
                : 11 September 2018
                Page count
                Figures: 8, Tables: 8, Equations: 19, References: 39, Pages: 16, Words: 12607
                Categories
                Neuroscience
                Original Research

                Robotics
                intrinsic reward function,goal types,open-ended learning,autonomous goal generation,reinforcement learning

                Comments

                Comment on this article