22
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Efficient Bimanual Manipulation Using Learned Task Schemas

      Preprint
      , , ,

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          We address the problem of effectively composing skills to solve sparse-reward tasks in the real world. Given a set of parameterized skills (such as exerting a force or doing a top grasp at a location), our goal is to learn policies that invoke these skills to efficiently solve such tasks. Our insight is that for many tasks, the learning process can be decomposed into learning a state-independent task schema (a sequence of skills to execute) and a policy to choose the parameterizations of the skills in a state-dependent manner. For such tasks, we show that explicitly modeling the schema's state-independence can yield significant improvements in sample efficiency for model-free reinforcement learning algorithms. Furthermore, these schemas can be transferred to solve related tasks, by simply re-learning the parameterizations with which the skills are invoked. We find that doing so enables learning to solve sparse-reward tasks on real-world robotic systems very efficiently. We validate our approach experimentally over a suite of robotic bimanual manipulation tasks, both in simulation and on real hardware. See videos at http://tinyurl.com/chitnis-schema .

          Related collections

          Most cited references6

          • Record: found
          • Abstract: not found
          • Conference Proceedings: not found

          MuJoCo: A physics engine for model-based control

            Bookmark
            • Record: found
            • Abstract: not found
            • Conference Proceedings: not found

            Domain randomization for transferring deep neural networks from simulation to the real world

              Bookmark
              • Record: found
              • Abstract: not found
              • Conference Proceedings: not found

              Ensemble-CIO: Full-body dynamic motion planning that transfers to physical humanoids

                Bookmark

                Author and article information

                Journal
                30 September 2019
                Article
                1909.13874
                99965995-a421-4f81-8f57-3cb3fa462401

                http://arxiv.org/licenses/nonexclusive-distrib/1.0/

                History
                Custom metadata
                cs.RO cs.CV cs.LG

                Computer vision & Pattern recognition,Robotics,Artificial intelligence
                Computer vision & Pattern recognition, Robotics, Artificial intelligence

                Comments

                Comment on this article