On-the-fly control of unknown nonlinear systems with sublinear regret
From MaRDI portal
Publication:6402847
DOI10.1109/TAC.2022.3186425arXiv2206.11103MaRDI QIDQ6402847FDOQ6402847
Authors: A. P. Vinod, Arie Israel, Ufuk Topcu
Publication date: 22 June 2022
Abstract: We study the problem of data-driven, constrained control of unknown nonlinear dynamics from a single ongoing and finite-horizon trajectory. We consider a one-step optimal control problem with a smooth, black-box objective, typically a composition of a known cost function and the unknown dynamics. We investigate an on-the-fly control paradigm, i.e., at each time step, the evolution of the dynamics and the first-order information of the cost are provided only for the executed control action. We propose an optimization-based control algorithm that iteratively minimizes a data-driven surrogate function for the unknown objective. We prove that the proposed approach incurs sublinear cumulative regret (step-wise suboptimality with respect to an optimal one-step controller) and is worst-case optimal among a broad class of data-driven control algorithms. We also present tractable reformulations of the approach that can leverage off-the-shelf solvers for efficient implementations.
This page was built for publication: On-the-fly control of unknown nonlinear systems with sublinear regret
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q6402847)