Safe exploration in model-based reinforcement learning using control barrier functions
From MaRDI portal
Abstract: This paper develops a model-based reinforcement learning (MBRL) framework for learning online the value function of an infinite-horizon optimal control problem while obeying safety constraints expressed as control barrier functions (CBFs). Our approach is facilitated by the development of a novel class of CBFs, termed Lyapunov-like CBFs (LCBFs), that retain the beneficial properties of CBFs for developing minimally-invasive safe control policies while also possessing desirable Lyapunov-like qualities such as positive semi-definiteness. We show how these LCBFs can be used to augment a learning-based control policy to guarantee safety and then leverage this approach to develop a safe exploration framework in a MBRL setting. We demonstrate that our approach can handle more general safety constraints than comparative methods via numerical examples.
Recommendations
- Safe control of nonlinear systems in LPV framework using model-based reinforcement learning
- Temporal logic guided safe model-based reinforcement learning: a hybrid systems approach
- Safe reinforcement learning for continuous spaces through Lyapunov-constrained behavior
- Safe Exploration of State and Action Spaces in Reinforcement Learning
- A comprehensive survey on safe reinforcement learning
Cites work
- A General Safety Framework for Learning-Based Control in Uncertain Robotic Systems
- Adaptive Control Tutorial
- Adaptive nonlinear control without overparametrization
- Approximate optimal influence over an agent through an uncertain interaction dynamic
- Barrier Lyapunov functions for the control of output-constrained nonlinear systems
- Barrier function based model predictive control
- Control Barrier Function Based Quadratic Programs for Safety Critical Systems
- Data-Driven Economic NMPC Using Reinforcement Learning
- Data-based reinforcement learning approximate optimal control for an uncertain nonlinear system with control effectiveness faults
- Distributed Coordination Control for Multi-Robot Networks Using Lyapunov-Like Barrier Functions
- Efficient model-based reinforcement learning for approximate online optimal control
- Integral concurrent learning: adaptive control with parameter convergence using finite excitation
- Model-based reinforcement learning for approximate optimal regulation
- Nonlinear systems.
- Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem
- Reinforcement Learning and Feedback Control: Using Natural Decision Methods to Design Optimal Adaptive Controllers
- Reinforcement learning for optimal feedback control. A Lyapunov-based approach
- Robust control barrier functions for constrained stabilization of nonlinear systems
- Safe reinforcement learning for dynamical games
- Safe reinforcement learning: A control barrier function optimization approach
- Set invariance in control
- Switching in systems and control
Cited in
(22)- Safe reinforcement learning for continuous spaces through Lyapunov-constrained behavior
- Adaptive critic learning for approximate optimal event-triggered tracking control of nonlinear systems with prescribed performances
- An iterative scheme of safe reinforcement learning for nonlinear systems via barrier certificate generation
- Safe Exploration of State and Action Spaces in Reinforcement Learning
- Safety reinforcement learning control via transfer learning
- Safe adaptive output-feedback optimal control of a class of linear systems
- A predictive safety filter for learning-based control of constrained nonlinear dynamical systems
- Reinforcement learning control of constrained dynamic systems with uniformly ultimate boundedness stability guarantee
- A comprehensive survey on safe reinforcement learning
- Temporal logic guided safe model-based reinforcement learning: a hybrid systems approach
- Nonconvex policy search using variational inequalities
- 10.1162/jmlr.2003.3.4-5.803
- Safe robust multi-agent reinforcement learning with neural control barrier functions and safety attention mechanism
- Assured learning-enabled autonomy: a metacognitive reinforcement learning framework
- Learning safe neural network controllers with barrier certificates
- Safety-aware apprenticeship learning
- Safe reinforcement learning: A control barrier function optimization approach
- Safe control of nonlinear systems in LPV framework using model-based reinforcement learning
- Explicit explore, exploit, or escape \((E^4)\): near-optimal safety-constrained reinforcement learning in polynomial time
- Off‐policy model‐based end‐to‐end safe reinforcement learning
- Verifiably Safe Off-Model Reinforcement Learning
- Probabilistic counterexample guidance for safer reinforcement learning
This page was built for publication: Safe exploration in model-based reinforcement learning using control barrier functions
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2103658)