Safe learning of regions of attraction for uncertain, nonlinear systems with Gaussian processes

doi:10.1109/CDC.2016.7798979

Open AccessProceedings ArticleDOI

Safe learning of regions of attraction for uncertain, nonlinear systems with Gaussian processes

- pp 4661-4666

TLDR

This paper considers an approach that learns the ROA from experiments on a real system, without ever leaving the true ROA and, thus, without risking safety-critical failures.

Abstract:

Control theory can provide useful insights into the properties of controlled, dynamic systems. One important property of nonlinear systems is the region of attraction (ROA), a safe subset of the state space in which a given controller renders an equilibrium point asymptotically stable. The ROA is typically estimated based on a model of the system. However, since models are only an approximation of the real world, the resulting estimated safe region can contain states outside the ROA of the real system. This is not acceptable in safety-critical applications. In this paper, we consider an approach that learns the ROA from experiments on a real system, without ever leaving the true ROA and, thus, without risking safety-critical failures. Based on regularity assumptions on the model errors in terms of a Gaussian process prior, we use an underlying Lyapunov function in order to determine a region in which an equilibrium point is asymptotically stable with high probability. Moreover, we provide an algorithm to actively and safely explore the state space in order to expand the ROA estimate. We demonstrate the effectiveness of this method in simulation.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

A General Safety Framework for Learning-Based Control in Uncertain Robotic Systems

Jaime F. Fisac, +5 more

- 01 Jul 2019 -

IEEE Transactions on Automatic Control

TL;DR: A general safety framework based on Hamilton–Jacobi reachability methods that can work in conjunction with an arbitrary learning algorithm is proposed, which proves theoretical safety guarantees combining probabilistic and worst-case analysis and demonstrates the proposed framework experimentally on a quadrotor vehicle.

...read moreread less

Proceedings ArticleDOI

Learning-Based Model Predictive Control for Safe Exploration

Torsten Koller, +3 more

TL;DR: This paper presents a learning-based model predictive control scheme that can provide provable high-probability safety guarantees and exploits regularity assumptions on the dynamics in terms of a Gaussian process prior to construct provably accurate confidence intervals on predicted trajectories.

...read moreread less

Posted Content

Learning-based Model Predictive Control for Safe Exploration and Reinforcement Learning

Torsten Koller, +4 more

TL;DR: In this paper, a learning-based model predictive control scheme that provides high-probability safety guarantees throughout the learning process is presented. But it does not provide any safety guarantees during the reinforcement learning process.

...read moreread less

Posted Content

Neural Lyapunov Control

Ya-Chien Chang, +2 more

- 01 May 2020 -

arXiv: Learning

TL;DR: In this paper, the authors propose a method for learning control policies and neural network Lyapunov functions for nonlinear control problems, with provable guarantee of stability, using a falsifier that finds counterexamples to guide the learner towards solutions.

...read moreread less

Posted Content

Learning for Safety-Critical Control with Control Barrier Functions.

Andrew J. Taylor, +3 more

TL;DR: A machine learning framework utilizing Control Barrier Functions (CBFs) to reduce model uncertainty as it impact the safe behavior of a system, ultimately achieving safe behavior.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Book

Gaussian Processes for Machine Learning

Carl Edward Rasmussen, +1 more

TL;DR: The treatment is comprehensive and self-contained, targeted at researchers and students in machine learning and applied statistics, and deals with the supervised learning problem for both regression and classification.

...read moreread less

Book

Essentials of Robust Control

Kemin Zhou, +1 more

TL;DR: In this article, the authors introduce linear algebraic Riccati Equations and linear systems with Ha spaces and balance model reduction, and Ha Loop Shaping, and Controller Reduction.

...read moreread less

Proceedings Article

Gaussian Process Optimization in the Bandit Setting: No Regret and Experimental Design

Niranjan Srinivas, +3 more

TL;DR: This work analyzes GP-UCB, an intuitive upper-confidence based algorithm, and bound its cumulative regret in terms of maximal information gain, establishing a novel connection between GP optimization and experimental design and obtaining explicit sublinear regret bounds for many commonly used covariance functions.

...read moreread less

Journal ArticleDOI

Convex Computation of the Region of Attraction of Polynomial Control Systems

Didier Henrion, +1 more

- 01 Feb 2014 -

IEEE Transactions on Automatic Control

TL;DR: The ROA can be computed by solving a convex linear programming (LP) problem over the space of measures and this problem can be solved approximately via a classical converging hierarchy of convex finite-dimensional linear matrix inequalities (LMIs).

...read moreread less

Proceedings ArticleDOI

Reachability-based safe learning with Gaussian processes

Anayo K. Akametalu, +5 more

TL;DR: This work proposes a novel method that uses a principled approach to learn the system's unknown dynamics based on a Gaussian process model and iteratively approximates the maximal safe set and further incorporates safety into the reinforcement learning performance metric, allowing a better integration of safety and learning.

...read moreread less

IEEE Transactions on Automatic Control

Information-Theoretic Regret Bounds for Gaussian Process Optimization in the Bandit Setting

Niranjan Srinivas, +3 more

- 01 May 2012 -

IEEE Transactions on Information Theory

A General Safety Framework for Learning-Based Control in Uncertain Robotic Systems

Jaime F. Fisac, +5 more

- 01 Jul 2019 -

IEEE Transactions on Automatic Control

Safe learning of regions of attraction for uncertain, nonlinear systems with Gaussian processes

Citations

A General Safety Framework for Learning-Based Control in Uncertain Robotic Systems

Learning-Based Model Predictive Control for Safe Exploration

Learning-based Model Predictive Control for Safe Exploration and Reinforcement Learning

Neural Lyapunov Control

Learning for Safety-Critical Control with Control Barrier Functions.

References

Gaussian Processes for Machine Learning

Essentials of Robust Control

Gaussian Process Optimization in the Bandit Setting: No Regret and Experimental Design

Convex Computation of the Region of Attraction of Polynomial Control Systems

Reachability-based safe learning with Gaussian processes

Related Papers (5)

Gaussian Processes for Machine Learning

Safe Model-based Reinforcement Learning with Stability Guarantees

Control Barrier Function Based Quadratic Programs for Safety Critical Systems

Information-Theoretic Regret Bounds for Gaussian Process Optimization in the Bandit Setting

A General Safety Framework for Learning-Based Control in Uncertain Robotic Systems