Safe exploration in model-based reinforcement learning using control barrier functions

DOI10.1016/J.AUTOMATICA.2022.110684MaRDI QIDQ2103658zbMATH OpenOpenAlexFDO

Publication date 9 December 2022

Published in Automatica (Search for Journal in Brave)

Full work available at URL https://arxiv.org/abs/2104.08171

adaptive control reinforcement learning control barrier functions

Learning and adaptive systems in artificial intelligence (68T05) Dynamic programming in optimal control and differential games (49L20) Nonlinear systems in control theory (93C10) Adaptive control/observation systems (93C40)

Abstract: This paper develops a model-based reinforcement learning (MBRL) framework for learning online the value function of an infinite-horizon optimal control problem while obeying safety constraints expressed as control barrier functions (CBFs). Our approach is facilitated by the development of a novel class of CBFs, termed Lyapunov-like CBFs (LCBFs), that retain the beneficial properties of CBFs for developing minimally-invasive safe control policies while also possessing desirable Lyapunov-like qualities such as positive semi-definiteness. We show how these LCBFs can be used to augment a learning-based control policy to guarantee safety and then leverage this approach to develop a safe exploration framework in a MBRL setting. We demonstrate that our approach can handle more general safety constraints than comparative methods via numerical examples.

Recommendations

Cites work

Cited in

(22)

This page was built for publication: Safe exploration in model-based reinforcement learning using control barrier functions

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2103658)