An information based approach to stochastic control problems
From MaRDI portal
Publication:2019688
Abstract: An information based method for solving stochastic control problems with partial observation has been proposed. First, the information-theoretic lower bounds of the cost function has been analysed. It has been shown, under rather weak assumptions, that reduction of the expected cost with closed-loop control compared to the best open-loop strategy is upper bounded by non-decreasing function of mutual information between control variables and the state trajectory. On the basis of this result, an Information Based Control method has been developed. The main idea of the IBC consists in replacing the original control task by a sequence of control problems that are relatively easy to solve and such that information about the state of the system is actively generated. Two examples of the operation of the IBC are given. It has been shown that the IBC is able to find the optimal solution without using dynamic programming at least in these examples. Hence the computational complexity of the IBC is substantially smaller than complexity of dynamic programming, which is the main advantage of the proposed method.
Recommendations
- Example for equivalence of dual and information-based optimal control
- Control and stabilization with insufficient information
- A forward method for optimal stochastic nonlinear and adaptive control
- Incremental value of information for discrete-time partially observed stochastic systems
- The information path functional approach to solution of a controllable stochastic problem
Cites work
- scientific article; zbMATH DE number 5562427 (Why is no real title available?)
- scientific article; zbMATH DE number 2063875 (Why is no real title available?)
- scientific article; zbMATH DE number 3224816 (Why is no real title available?)
- A stochastic quasi-Newton method for large-scale optimization
- Adaptive dual control. Theory and applications.
- An active exploration method for data efficient reinforcement learning
- Bayesian filtering and smoothing
- Control Under Communication Constraints
- Discrete-time entropy formulation of optimal and adaptive control problems
- Elements of Information Theory
- Entropy formulation of optimal and adaptive control
- Estimation of entropy and other functionals of a multivariate density
- Incremental value of information for discrete-time partially observed stochastic systems
- Information and entropy flow in the Kalman-Bucy filter
- Nonlinear Bayesian estimation using Gaussian sum approximations
- Optimal Measurement Methods for Distributed Parameter System Identification
- Partial synchronization in stochastic dynamical networks with switching communication channels
- Point-based value iteration for continuous POMDPs
- Role of mutual information in entropy production under information exchanges
Cited in
(11)- Solution to the variation problem for information path functional of a controlled random process
- scientific article; zbMATH DE number 6148138 (Why is no real title available?)
- Asymmetric information control for stochastic systems with different intermittent observations
- On the choice of the cost function for nonlinear model predictive control: a multi-criteria evaluation
- Information-theoretic lower bounds of the quadratic cost in stochastic control with partial observation
- Information Relaxation and Dual Formulation of Controlled Markov Diffusions
- scientific article; zbMATH DE number 836586 (Why is no real title available?)
- The information path functional approach to solution of a controllable stochastic problem
- Mean field approach to stochastic control with partial information
- Optimal Control for Stochastic Systems With Multiple Controllers of Different Information Structures
- Reduction of future information required for optimal control of dynamic systems: a pseudostochastic model
This page was built for publication: An information based approach to stochastic control problems
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2019688)