Dynamic programming for ergodic control with partial observations. (Q2574544)

From MaRDI portal

Jump to:navigation, search

scientific article

Language	Label	Description	Also known as
English	Dynamic programming for ergodic control with partial observations.	scientific article

Statements

scholarly article

0 references

Dynamic programming for ergodic control with partial observations. (English)

0 references

Vivek S. Borkar

0 references

Stochastic Processes and their Applications

0 references

publication date

29 November 2005

0 references

The paper derives a dynamic programming principle for optimal control of a partially observed Markov process taking values in a Euclidean space. The minimized functional is that of average (ergodic) costs over infinite horizon. The control space is compact. The problem is addressed by approximating the original ergodic cost functional by a family of discounted cost functionals with discount factors converging to unity. The dynamic programming principle inequalities are first derived in discrete time and the result is then carried over to partially observed Markov semimartingales in continuous time. The construction of optimal controls proceeds in the following steps: 1. restating the problem by means of a separation principle which makes the control process adapted to the process of observations, 2. changing the probability measure in order to eliminate variability in the marginal distribution of the observation process, 3. introducing a stability assumption for the state process in a Lyapunov function form, 4. embedding the state process into another one with a ``doubled'' range of values, for which an accessible atom exists. The argument draws on earlier results of the same author concerning optimal ergodic control of partially observed finite Markov chains.

0 references

zbMATH Keywords

Markov process

0 references

ergodic cost

0 references

0 references

MaRDI profile type

MaRDI publication profile

0 references

Bounds for the fundamental solution of a parabolic equation

0 references

A New Approach to the Limit Theory of Recurrent Markov Chains

0 references

Occupation measures for controlled Markov processes: Characterization and optimality

0 references

0 references

A remark on the attainable distributions of controlled diffusions

0 references

0 references

White-Noise Representations in Stochastic Realization Theory

0 references

0 references

The value function in ergodic control of diffusion processes with partial observations

0 references

Average Cost Dynamic Programming Equations For Controlled Markov Chains With Partial Observations

0 references

The value function in ergodic control of diffusion processes with partial observations II

0 references

Dynamic Programming Conditions for Partially Observable Stochastic Systems

0 references

Optimal Control for Partially Observed Diffusions

0 references

Mimicking the one-dimensional marginal distributions of processes having an Ito differential

0 references

0 references

0 references

0 references

Markov chains and stochastic stability

0 references

A splitting technique for Harris recurrent Markov chains

0 references

Necessary and Sufficient Dynamic Programming Conditions for Continuous Time Stochastic Optimal Control

0 references

Martingale conditions for the optimal control of continuous time stochastic systems

0 references

Survey of Measurable Selection Theorems

0 references

Identifiers

zbMATH Open document ID

0 references

10.1016/S0304-4149(02)00190-4

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

0 references

0 references

zbMATH DE Number

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:2574544

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Item:Q2574544&oldid=34464448"