ScienceOpen: research and publishing network

For Researchers

Search
Advanced search

18

views

    

0

recommends

0

shares

Record: found
Abstract: found
Article: found

Is Open Access

Literal or Pedagogic Human? Analyzing Human Model Misspecification in Objective Learning

Preprint

Author(s): Smitha Milli , Anca D. Dragan

Publication date Created: 09 March 2019

Read this article at

ScienceOpen ArXiv

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

It is incredibly easy for a system designer to misspecify the objective for an autonomous system ("robot''), thus motivating the desire to have the robot learn the objective from human behavior instead. Recent work has suggested that people have an interest in the robot performing well, and will thus behave pedagogically, choosing actions that are informative to the robot. In turn, robots benefit from interpreting the behavior by accounting for this pedagogy. In this work, we focus on misspecification: we argue that robots might not know whether people are being pedagogic or literal and that it is important to ask which assumption is safer to make. We cast objective learning into the more general form of a common-payoff game between the robot and human, and prove that in any such game literal interpretation is more robust to misspecification. Experiments with human data support our theoretical results and point to the sensitivity of the pedagogic assumption.

Related collections

Most cited references 4

Record: found
Abstract: not found
Conference Proceedings: not found

Apprenticeship learning via inverse reinforcement learning

Pieter Abbeel, Andrew Y. Ng (2004)

0 comments Cited 343 times – based on 0 reviews

Record: found
Abstract: not found
Conference Proceedings: not found

Legibility and predictability of robot motion

Siddhartha Srinivasa, Kenton C.T. Lee, Anca Dragan (2013)

0 comments Cited 57 times – based on 0 reviews

Record: found
Abstract: not found
Article: not found

Learning preferences for manipulation tasks from online coactive feedback

Ashutosh Saxena, Shikhar Sharma, Ashesh Jain … (2015)

0 comments Cited 15 times – based on 0 reviews      Review now

Author and article information

Journal

Publication date Created: 09 March 2019

Publication date Updated: 2019-06-28

Article

ArXiV ID: 1903.03877

SO-VID: e81ba9d0-5631-4cf4-9afb-7870d4d80aa8

License:

http://arxiv.org/licenses/nonexclusive-distrib/1.0/

History

Custom metadata

Comments Published at UAI 2019

Categories cs.AI

ScienceOpen disciplines: Artificial intelligence

Data availability:

ScienceOpen disciplines: Artificial intelligence

Comments

Comment on this article