21
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Gym-Ignition: Reproducible Robotic Simulations for Reinforcement Learning

      Preprint
      , ,

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          In this paper we present Gym-Ignition, a new framework to create reproducible robotic environments for reinforcement learning research. It interfaces with the new generation of Gazebo, part of the Ignition Robotics suite. The new Ignition Gazebo simulator mainly provides three improvements for reinforcement learning applications compared to the alternatives: 1) the modular architecture enables using the simulator as a C++ library, simplifying the interconnection with external software; 2) multiple physics and rendering engines are supported as plugins, and they can be switched during runtime; 3) the new distributed simulation capability permits simulating complex scenarios while sharing the load on multiple workers and machines. The core of Gym-Ignition is a component that contains the Ignition Gazebo simulator, and it simplifies its configuration and usage. We provide a Python package that permits developers to create robotic environments simulated in Ignition Gazebo. Environments expose the common OpenAI Gym interface, making them compatible out-of-the-box with third-party frameworks containing reinforcement learning algorithms. Simulations can be executed in both headless and GUI mode, and the physics engine can run in accelerated mode and instances can be parallelized. Furthermore, the Gym-Ignition software architecture provides abstraction of the Robot and the Task, making environments agnostic from the specific runtime. This allows their execution also in a real-time setting on actual robotic platforms.

          Related collections

          Most cited references6

          • Record: found
          • Abstract: found
          • Article: not found

          Learning agile and dynamic motor skills for legged robots

          Legged robots pose one of the greatest challenges in robotics. Dynamic and agile maneuvers of animals cannot be imitated by existing methods that are crafted by humans. A compelling alternative is reinforcement learning, which requires minimal craftsmanship and promotes the natural evolution of a control policy. However, so far, reinforcement learning research for legged robots is mainly limited to simulation, and only few and comparably simple examples have been deployed on real systems. The primary reason is that training with real robots, particularly with dynamically balancing systems, is complicated and expensive. In the present work, we introduce a method for training a neural network policy in simulation and transferring it to a state-of-the-art legged system, thereby leveraging fast, automated, and cost-effective data generation schemes. The approach is applied to the ANYmal robot, a sophisticated medium-dog–sized quadrupedal system. Using policies trained in simulation, the quadrupedal machine achieves locomotion skills that go beyond what had been achieved with prior methods: ANYmal is capable of precisely and energy-efficiently following high-level body velocity commands, running faster than before, and recovering from falling even in complex configurations.
            Bookmark
            • Record: found
            • Abstract: not found
            • Conference Proceedings: not found

            Sim-to-Real Transfer of Robotic Control with Dynamics Randomization

              Bookmark
              • Record: found
              • Abstract: not found
              • Conference Proceedings: not found

              Gibson Env: Real-World Perception for Embodied Agents

                Bookmark

                Author and article information

                Journal
                05 November 2019
                Article
                1911.01715
                ab0a26b7-400b-4af8-b92e-ff04ad1b553b

                http://arxiv.org/licenses/nonexclusive-distrib/1.0/

                History
                Custom metadata
                Accepted in SII2020
                cs.RO cs.DC cs.LG cs.SY eess.SY

                Performance, Systems & Control,Robotics,Artificial intelligence,Networking & Internet architecture

                Comments

                Comment on this article