18
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Literal or Pedagogic Human? Analyzing Human Model Misspecification in Objective Learning

      Preprint
      ,

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          It is incredibly easy for a system designer to misspecify the objective for an autonomous system ("robot''), thus motivating the desire to have the robot learn the objective from human behavior instead. Recent work has suggested that people have an interest in the robot performing well, and will thus behave pedagogically, choosing actions that are informative to the robot. In turn, robots benefit from interpreting the behavior by accounting for this pedagogy. In this work, we focus on misspecification: we argue that robots might not know whether people are being pedagogic or literal and that it is important to ask which assumption is safer to make. We cast objective learning into the more general form of a common-payoff game between the robot and human, and prove that in any such game literal interpretation is more robust to misspecification. Experiments with human data support our theoretical results and point to the sensitivity of the pedagogic assumption.

          Related collections

          Most cited references4

          • Record: found
          • Abstract: not found
          • Conference Proceedings: not found

          Apprenticeship learning via inverse reinforcement learning

            Bookmark
            • Record: found
            • Abstract: not found
            • Conference Proceedings: not found

            Legibility and predictability of robot motion

              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              Learning preferences for manipulation tasks from online coactive feedback

                Bookmark

                Author and article information

                Journal
                09 March 2019
                2019-06-28
                Article
                1903.03877
                e81ba9d0-5631-4cf4-9afb-7870d4d80aa8

                http://arxiv.org/licenses/nonexclusive-distrib/1.0/

                History
                Custom metadata
                Published at UAI 2019
                cs.AI

                Artificial intelligence
                Artificial intelligence

                Comments

                Comment on this article