Model-based reinforcement learning for approximate optimal control with temporal logic specifications

DOI10.1145/3447928.3456639MaRDI QIDQ6201592zbMATH OpenWikidataFDO

Publication date 21 February 2024

Published in Proceedings of the 24th International Conference on Hybrid Systems: Computation and Control (Search for Journal in Brave)

Full work available at URL https://arxiv.org/abs/2101.07156

zbMATH Keywords

hybrid systems optimal control safe reinforcement learning

Mathematics Subject Classification ID

Formal languages and automata (68Q45) Specification and verification (program logics, model checking, etc.) (68Q60) Control/observation systems governed by functional relations other than differential equations (such as hybrid and switching systems) (93C30)

Abstract: In this paper we study the problem of synthesizing optimal control policies for uncertain continuous-time nonlinear systems from syntactically co-safe linear temporal logic (scLTL) formulas. We formulate this problem as a sequence of reach-avoid optimal control sub-problems. We show that the resulting hybrid optimal control policy guarantees the satisfaction of a given scLTL formula by constructing a barrier certificate. Since solving each optimal control problem may be computationally intractable, we take a learning-based approach to approximately solve this sequence of optimal control problems online without requiring full knowledge of the system dynamics. Using Lyapunov-based tools, we develop sufficient conditions under which our approximate solution maintains correctness. Finally, we demonstrate the efficacy of the developed method with a numerical example.

Cites work

Cited in

(1)

Data-driven abstraction-based control synthesis

This page was built for publication: Model-based reinforcement learning for approximate optimal control with temporal logic specifications

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q6201592)