scispace - formally typeset
Open AccessJournal ArticleDOI

A General Safety Framework for Learning-Based Control in Uncertain Robotic Systems

Reads0
Chats0
TLDR
A general safety framework based on Hamilton–Jacobi reachability methods that can work in conjunction with an arbitrary learning algorithm is proposed, which proves theoretical safety guarantees combining probabilistic and worst-case analysis and demonstrates the proposed framework experimentally on a quadrotor vehicle.
Abstract
The proven efficacy of learning-based control schemes strongly motivates their application to robotic systems operating in the physical world. However, guaranteeing correct operation during the learning process is currently an unresolved issue, which is of vital importance in safety-critical systems. We propose a general safety framework based on Hamilton–Jacobi reachability methods that can work in conjunction with an arbitrary learning algorithm. The method exploits approximate knowledge of the system dynamics to guarantee constraint satisfaction while minimally interfering with the learning process. We further introduce a Bayesian mechanism that refines the safety analysis as the system acquires new evidence, reducing initial conservativeness when appropriate while strengthening guarantees through real-time validation. The result is a least-restrictive, safety-preserving control law that intervenes only when the computed safety guarantees require it, or confidence in the computed guarantees decays in light of new observations. We prove theoretical safety guarantees combining probabilistic and worst-case analysis and demonstrate the proposed framework experimentally on a quadrotor vehicle. Even though safety analysis is based on a simple point-mass model, the quadrotor successfully arrives at a suitable controller by policy-gradient reinforcement learning without ever crashing, and safely retracts away from a strong external disturbance introduced during flight.

read more

Citations
More filters
Proceedings ArticleDOI

Hamilton-Jacobi reachability: A brief overview and recent advances

TL;DR: In this paper, the authors present an overview of basic HJ reachability theory and provide instructions for using the most recent numerical tools, including an efficient GPU-parallelized implementation of a Level Set Toolbox for computing reachable sets.
Posted Content

Towards Verified Artificial Intelligence

TL;DR: Five challenges for achieving Verified AI are described, and five corresponding principles for addressing these challenges are described.
Proceedings ArticleDOI

Adaptive Safety with Control Barrier Functions

TL;DR: In this paper, adaptive control Lyapunov functions (aCLFs) and adaptive Control Barrier Functions (aCBFs) are combined into a single control methodology for systems with uncertain parameters in the context of a Quadratic Program (QP) based framework.
Proceedings ArticleDOI

Linear Model Predictive Safety Certification for Learning-Based Control

TL;DR: In this article, a model predictive safety certification (MPSC) scheme for linear systems with additive disturbances is proposed, which verifies safety of a proposed learning-based input and modifies it as little as necessary in order to keep the system within a given set of constraints.
Proceedings Article

Natural policy gradient primal-dual method for constrained Markov decision processes

TL;DR: This work is the first to establish non-asymptotic convergence guarantees of policybased primal-dual methods for solving infinite-horizon discounted CMDPs, and it is shown that two samplebased NPG-PD algorithms inherit such non- ATM convergence properties and provide finite-sample complexity guarantees.
References
More filters
Journal ArticleDOI

Human-level control through deep reinforcement learning

TL;DR: This work bridges the divide between high-dimensional sensory inputs and actions, resulting in the first artificial agent that is capable of learning to excel at a diverse array of challenging tasks.
Book

Gaussian Processes for Machine Learning

TL;DR: The treatment is comprehensive and self-contained, targeted at researchers and students in machine learning and applied statistics, and deals with the supervised learning problem for both regression and classification.
Book

Theory of Ordinary Differential Equations

TL;DR: The prerequisite for the study of this book is a knowledge of matrices and the essentials of functions of a complex variable as discussed by the authors, which is a useful text in the application of differential equations as well as for the pure mathematician.
Proceedings Article

Trust Region Policy Optimization

TL;DR: A method for optimizing control policies, with guaranteed monotonic improvement, by making several approximations to the theoretically-justified scheme, called Trust Region Policy Optimization (TRPO).
Journal Article

A comprehensive survey on safe reinforcement learning

TL;DR: This work categorize and analyze two approaches of Safe Reinforcement Learning, based on the modification of the optimality criterion, the classic discounted finite/infinite horizon, with a safety factor and the incorporation of external knowledge or the guidance of a risk metric.
Related Papers (5)