Long-term average cost control problems for continuous time Markov processes: A survey (Q788690): Difference between revisions

This paper is a survey of long term average cost control problems, namely the cost is defined by \[ J(v)=\underline{\lim}_{T\to \infty}\frac{1}{T}E\int^{T}_{0}f(y(s,v),v(s))ds, \] where $y(\cdot,v)$ is the response for control v. The object is to minimize J(v). The following simple example illustrate the kind of phenomena treated in the paper. Example: Let V be the totality of Lipschitz continuous functions. For $v\in V$, $y(t,v)$ is the solution of SDE, \[ dy_ x(t)=v(y_ x(t))dt+\sqrt{2}dw(t),\quad y_ x(0)=x(\in R^ 1). \] The discounted cost is given by $J_ x^{\alpha}(v)=E\int^{\infty}_{0}e^{-\alpha t}(y_ x(t)^ 2+v^ 2(y_ x(t)))dt$. The optimal cost function $u_{\alpha}(x)=\inf_{v\in V}J_ x^{\alpha}(v)$ satisfies the Hamilton-Jacobi-Bellman (H-J-B) equation $u_{\alpha}''(x)-\inf_{v\in R^ 1}(vu'_{\alpha}(x)+v^ 2)+\alpha u_{\alpha}=x^ 2$. Actually $u_{\alpha}(x)=(\sqrt{\alpha^ 2+4}-\alpha)(\frac{x^ 2}{2}+\frac{1}{\alpha})$. Hence, $\lim_{\alpha \to 0}\alpha u_{\alpha}(x)=2$ and $W(x)=\lim_{\alpha \to 0}(u_{\alpha}(x)- u_{\alpha}(0))=x^ 2$ satisfies \[ -W''(x)-\inf_{v\in R^ 1}(vW'(x)+v^ 2)+\lambda =x^ 2. \] Moreover, putting $V_ 0=\{v\in V;\lim_{T\to \infty}\frac{1}{T}EW(y_ x(T,v))=0\}$, \[ \lambda =\inf_{v\in V_ 0}(\lim_{T\to \infty}\frac{1}{T}E\int^{T}_{0}y^ 2_ x(t)+v^ 2(y_ x(t))dt) \] and an optimal control $\hat v(x)=-x$ is a minimum selection of $(vW'(x)+v^ 2)$. In many cases, the problem relates to a solution $(\lambda,W)$ of the H-J-B equation $\inf_{v\in K}(A^ vW+f(x,v))- \lambda =0,$ and using the discounted problem a solution is obtained. The author treats three problems: continuous control, stopping problem and impulse control, and discusses some open problems.

0 references

zbMATH Keywords

long-term average cost

0 references

undiscounted stochastic control Markov processes

0 references

continuous control, stopping problem

0 references

impulse control

0 references

reviewed by

Makiko Nisio

0 references

MaRDI profile type

MaRDI publication profile

0 references

0 references

0 references

0 references

0 references

0 references

Temps d'arrÊt optimal, théorie générale des processus et processus de Markov

0 references

Discrete Dynamic Programming

0 references

Continuous time control of Markov processes on an arbitrary state space: average return criterion

0 references

Q5342182

0 references

Denumerable state semi-Markov decision processes with unbounded costs, average cost criterion

0 references

Q4086303

0 references

Q4771778

0 references

Q3266141

0 references

Nondiscounted Continuous Time Markovian Decision Process with Countable State Space

0 references

Q3318666

0 references

On Optimal Control of a Non-Terminating Diffusion Process with Reflection

0 references

Étude asymptotique des systèmes markoviens à commande

0 references

Maximal Average-Reward Policies for Semi-Markov Decision Processes With Arbitrary State and Action Space

0 references

Q3658833

0 references

Q5570526

0 references

On a degenerate variational inequality with Neumann boundary conditions

0 references

On the Optimal Stopping Time Problem for Degenerate Diffusions

0 references

Countable-state average-cost regenerative stopping problems

0 references

Q4772533

0 references

Optimal control of diffusion processes with reflection

0 references

Q4182645

0 references

On Some Impulse Control Problems with Long Run Average Cost

0 references

Asymptotics in quasi-variational inequalities and ergodic control problems

0 references

Optimal control of service in tandem queues

0 references

On the Nonexistence of $|varepsilon$-Optimal Randomized Stationary Policies in Average Cost Markov Decision Models

0 references

Average cost semi-markov decision processes

0 references

Q5615108

0 references

Diffusion processes with boundary conditions

0 references

Q4749716

0 references

Q4172681

0 references

Stationary Markovian Decision Problems and Perturbation Theory of Quasi-Compact Linear Operators

0 references

On Semi-Markov Controlled Models with an Average Reward Criterion

0 references

Q3875811

0 references

Identifiers

zbMATH Open document ID

0531.93068

0 references

DOI

10.1007/BF00046603

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

0 references

0 references

0 references

0 references

0 references

0 references

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:788690

@@ Property / MaRDI profile type @@
+MaRDI publication profile
@@ Property / MaRDI profile type: MaRDI publication profile / rank @@
+Normal rank
@@ Property / cites work @@
+Q3929443
@@ Property / cites work: Q3929443 / rank @@
+Normal rank
@@ Property / cites work @@
+Q4199298
@@ Property / cites work: Q4199298 / rank @@
+Normal rank
@@ Property / cites work @@
+Q3953613
@@ Property / cites work: Q3953613 / rank @@
+Normal rank
@@ Property / cites work @@
+Q5683339
@@ Property / cites work: Q5683339 / rank @@
+Normal rank
@@ Property / cites work @@
+Q3878285
@@ Property / cites work: Q3878285 / rank @@
+Normal rank
@@ Property / cites work @@
+Temps d'arrÊt optimal, théorie générale des processus et processus de Markov
+Normal rank
@@ Property / cites work @@
+Discrete Dynamic Programming
@@ Property / cites work: Discrete Dynamic Programming / rank @@
+Normal rank
@@ Property / cites work @@
+Continuous time control of Markov processes on an arbitrary state space: average return criterion
+Normal rank
@@ Property / cites work @@
+Q5342182
@@ Property / cites work: Q5342182 / rank @@
+Normal rank
@@ Property / cites work @@
+Denumerable state semi-Markov decision processes with unbounded costs, average cost criterion
+Normal rank
@@ Property / cites work @@
+Q4086303
@@ Property / cites work: Q4086303 / rank @@
+Normal rank
@@ Property / cites work @@
+Q4771778
@@ Property / cites work: Q4771778 / rank @@
+Normal rank
@@ Property / cites work @@
+Q3266141
@@ Property / cites work: Q3266141 / rank @@
+Normal rank
@@ Property / cites work @@
+Nondiscounted Continuous Time Markovian Decision Process with Countable State Space
+Normal rank
@@ Property / cites work @@
+Q3318666
@@ Property / cites work: Q3318666 / rank @@
+Normal rank
@@ Property / cites work @@
+On Optimal Control of a Non-Terminating Diffusion Process with Reflection
+Normal rank
@@ Property / cites work @@
+Étude asymptotique des systèmes markoviens à commande
+Normal rank
@@ Property / cites work @@
+Maximal Average-Reward Policies for Semi-Markov Decision Processes With Arbitrary State and Action Space
+Normal rank
@@ Property / cites work @@
+Q3658833
@@ Property / cites work: Q3658833 / rank @@
+Normal rank
@@ Property / cites work @@
+Q5570526
@@ Property / cites work: Q5570526 / rank @@
+Normal rank
@@ Property / cites work @@
+On a degenerate variational inequality with Neumann boundary conditions
+Normal rank
@@ Property / cites work @@
+On the Optimal Stopping Time Problem for Degenerate Diffusions
+Normal rank
@@ Property / cites work @@
+Countable-state average-cost regenerative stopping problems
+Normal rank
@@ Property / cites work @@
+Q4772533
@@ Property / cites work: Q4772533 / rank @@
+Normal rank
@@ Property / cites work @@
+Optimal control of diffusion processes with reflection
+Normal rank
@@ Property / cites work @@
+Q4182645
@@ Property / cites work: Q4182645 / rank @@
+Normal rank
@@ Property / cites work @@
+On Some Impulse Control Problems with Long Run Average Cost
+Normal rank
@@ Property / cites work @@
+Asymptotics in quasi-variational inequalities and ergodic control problems
+Normal rank
@@ Property / cites work @@
+Optimal control of service in tandem queues
@@ Property / cites work: Optimal control of service in tandem queues / rank @@
+Normal rank
@@ Property / cites work @@
+On the Nonexistence of $|varepsilon$-Optimal Randomized Stationary Policies in Average Cost Markov Decision Models
+Normal rank
@@ Property / cites work @@
+Average cost semi-markov decision processes
@@ Property / cites work: Average cost semi-markov decision processes / rank @@
+Normal rank
@@ Property / cites work @@
+Q5615108
@@ Property / cites work: Q5615108 / rank @@
+Normal rank
@@ Property / cites work @@
+Diffusion processes with boundary conditions
@@ Property / cites work: Diffusion processes with boundary conditions / rank @@
+Normal rank
@@ Property / cites work @@
+Q4749716
@@ Property / cites work: Q4749716 / rank @@
+Normal rank
@@ Property / cites work @@
+Q4172681
@@ Property / cites work: Q4172681 / rank @@
+Normal rank
@@ Property / cites work @@
+Stationary Markovian Decision Problems and Perturbation Theory of Quasi-Compact Linear Operators
+Normal rank
@@ Property / cites work @@
+On Semi-Markov Controlled Models with an Average Reward Criterion
+Normal rank
@@ Property / cites work @@
+Q3875811
@@ Property / cites work: Q3875811 / rank @@
+Normal rank