Dynamic programming for ergodic control with partial observations. (Q2574544): Difference between revisions

The paper derives a dynamic programming principle for optimal control of a partially observed Markov process taking values in a Euclidean space. The minimized functional is that of average (ergodic) costs over infinite horizon. The control space is compact. The problem is addressed by approximating the original ergodic cost functional by a family of discounted cost functionals with discount factors converging to unity. The dynamic programming principle inequalities are first derived in discrete time and the result is then carried over to partially observed Markov semimartingales in continuous time. The construction of optimal controls proceeds in the following steps: 1. restating the problem by means of a separation principle which makes the control process adapted to the process of observations, 2. changing the probability measure in order to eliminate variability in the marginal distribution of the observation process, 3. introducing a stability assumption for the state process in a Lyapunov function form, 4. embedding the state process into another one with a ``doubled'' range of values, for which an accessible atom exists. The argument draws on earlier results of the same author concerning optimal ergodic control of partially observed finite Markov chains.

0 references

zbMATH Keywords

Markov process

0 references

ergodic cost

0 references

reviewed by

Alexis Derviz

0 references

MaRDI profile type

MaRDI publication profile

0 references

cites work

Bounds for the fundamental solution of a parabolic equation

0 references

A New Approach to the Limit Theory of Recurrent Markov Chains

0 references

Occupation measures for controlled Markov processes: Characterization and optimality

0 references

Q5560061

0 references

A remark on the attainable distributions of controlled diffusions

0 references

Q3995082

0 references

White-Noise Representations in Stochastic Realization Theory

0 references

Q4858374

0 references

The value function in ergodic control of diffusion processes with partial observations

0 references

Average Cost Dynamic Programming Equations For Controlled Markov Chains With Partial Observations

0 references

The value function in ergodic control of diffusion processes with partial observations II

0 references

Dynamic Programming Conditions for Partially Observable Stochastic Systems

0 references

Optimal Control for Partially Observed Diffusions

0 references

Mimicking the one-dimensional marginal distributions of processes having an Ito differential

0 references

Q4255598

0 references

Q3959169

0 references

Q5562267

0 references

Markov chains and stochastic stability

0 references

A splitting technique for Harris recurrent Markov chains

0 references

Necessary and Sufficient Dynamic Programming Conditions for Continuous Time Stochastic Optimal Control

0 references

Martingale conditions for the optimal control of continuous time stochastic systems

0 references

Survey of Measurable Selection Theorems

0 references

Identifiers

zbMATH Open document ID

1075.60522

0 references

DOI

10.1016/S0304-4149(02)00190-4

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

0 references

0 references

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:2574544

@@ Property / cites work @@
+Bounds for the fundamental solution of a parabolic equation
+Normal rank
@@ Property / cites work @@
+A New Approach to the Limit Theory of Recurrent Markov Chains
+Normal rank
@@ Property / cites work @@
+Occupation measures for controlled Markov processes: Characterization and optimality
+Normal rank
@@ Property / cites work @@
+Q5560061
@@ Property / cites work: Q5560061 / rank @@
+Normal rank
@@ Property / cites work @@
+A remark on the attainable distributions of controlled diffusions
+Normal rank
@@ Property / cites work @@
+Q3995082
@@ Property / cites work: Q3995082 / rank @@
+Normal rank
@@ Property / cites work @@
+White-Noise Representations in Stochastic Realization Theory
+Normal rank
@@ Property / cites work @@
+Q4858374
@@ Property / cites work: Q4858374 / rank @@
+Normal rank
@@ Property / cites work @@
+The value function in ergodic control of diffusion processes with partial observations
+Normal rank
@@ Property / cites work @@
+Average Cost Dynamic Programming Equations For Controlled Markov Chains With Partial Observations
+Normal rank
@@ Property / cites work @@
+The value function in ergodic control of diffusion processes with partial observations II
+Normal rank
@@ Property / cites work @@
+Dynamic Programming Conditions for Partially Observable Stochastic Systems
+Normal rank
@@ Property / cites work @@
+Optimal Control for Partially Observed Diffusions
@@ Property / cites work: Optimal Control for Partially Observed Diffusions / rank @@
+Normal rank
@@ Property / cites work @@
+Mimicking the one-dimensional marginal distributions of processes having an Ito differential
+Normal rank
@@ Property / cites work @@
+Q4255598
@@ Property / cites work: Q4255598 / rank @@
+Normal rank
@@ Property / cites work @@
+Q3959169
@@ Property / cites work: Q3959169 / rank @@
+Normal rank
@@ Property / cites work @@
+Q5562267
@@ Property / cites work: Q5562267 / rank @@
+Normal rank
@@ Property / cites work @@
+Markov chains and stochastic stability
@@ Property / cites work: Markov chains and stochastic stability / rank @@
+Normal rank
@@ Property / cites work @@
+A splitting technique for Harris recurrent Markov chains
+Normal rank
@@ Property / cites work @@
+Necessary and Sufficient Dynamic Programming Conditions for Continuous Time Stochastic Optimal Control
+Normal rank
@@ Property / cites work @@
+Martingale conditions for the optimal control of continuous time stochastic systems
+Normal rank
@@ Property / cites work @@
+Survey of Measurable Selection Theorems
@@ Property / cites work: Survey of Measurable Selection Theorems / rank @@
+Normal rank