0
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Safe exploration of nonlinear dynamical systems: A predictive safety filter for reinforcement learning

      Preprint
      ,

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Despite fast progress in Reinforcement Learning (RL), the transfer into real-world applications is challenged by safety requirements in the presence of physical limitations. This is often due to the fact, that most RL methods do not support explicit consideration of state and input constraints. In this paper, we address this problem for nonlinear systems by introducing a predictive safety filter, which turns a constrained dynamical system into an unconstrained safe system, to which any RL algorithm can be applied `out-of-the-box'. The predictive safety filter receives the proposed learning input and decides, based on the current system state, if it can be safely applied to the real system, or if it has to be modified otherwise. Safety is thereby established by a continuously updated safety policy, which is computed according to a data-driven system model, supporting state and input dependent uncertainties in the prediction.

          Related collections

          Most cited references20

          • Record: found
          • Abstract: not found
          • Article: not found

          On the implementation of an interior-point filter line-search algorithm for large-scale nonlinear programming

            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            Model predictive control: Recent developments and future promise

              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              LQR-trees: Feedback Motion Planning via Sums-of-Squares Verification

                Bookmark

                Author and article information

                Journal
                13 December 2018
                Article
                1812.05506
                aaa0477a-a476-4ca7-8b01-5a80fba71d60

                http://arxiv.org/licenses/nonexclusive-distrib/1.0/

                History
                Custom metadata
                cs.SY cs.LG

                Performance, Systems & Control,Artificial intelligence
                Performance, Systems & Control, Artificial intelligence

                Comments

                Comment on this article