Long-term average cost control problems for continuous time Markov processes: A survey (Q788690)

From MaRDI portal

Jump to:navigation, search

scientific article

Language	Label	Description	Also known as
English	Long-term average cost control problems for continuous time Markov processes: A survey	scientific article

Statements

scholarly article

0 references

Long-term average cost control problems for continuous time Markov processes: A survey (English)

0 references

0 references

Acta Applicandae Mathematicae

0 references

publication date

1983

0 references

This paper is a survey of long term average cost control problems, namely the cost is defined by \[ J(v)=\underline{\lim}_{T\to \infty}\frac{1}{T}E\int^{T}_{0}f(y(s,v),v(s))ds, \] where \(y(\cdot,v)\) is the response for control v. The object is to minimize J(v). The following simple example illustrate the kind of phenomena treated in the paper. Example: Let V be the totality of Lipschitz continuous functions. For \(v\in V\), \(y(t,v)\) is the solution of SDE, \[ dy_ x(t)=v(y_ x(t))dt+\sqrt{2}dw(t),\quad y_ x(0)=x(\in R^ 1). \] The discounted cost is given by \(J_ x^{\alpha}(v)=E\int^{\infty}_{0}e^{-\alpha t}(y_ x(t)^ 2+v^ 2(y_ x(t)))dt\). The optimal cost function \(u_{\alpha}(x)=\inf_{v\in V}J_ x^{\alpha}(v)\) satisfies the Hamilton-Jacobi-Bellman (H-J-B) equation \(u_{\alpha}''(x)-\inf_{v\in R^ 1}(vu'_{\alpha}(x)+v^ 2)+\alpha u_{\alpha}=x^ 2\). Actually \(u_{\alpha}(x)=(\sqrt{\alpha^ 2+4}-\alpha)(\frac{x^ 2}{2}+\frac{1}{\alpha})\). Hence, \(\lim_{\alpha \to 0}\alpha u_{\alpha}(x)=2\) and \(W(x)=\lim_{\alpha \to 0}(u_{\alpha}(x)- u_{\alpha}(0))=x^ 2\) satisfies \[ -W''(x)-\inf_{v\in R^ 1}(vW'(x)+v^ 2)+\lambda =x^ 2. \] Moreover, putting \(V_ 0=\{v\in V;\lim_{T\to \infty}\frac{1}{T}EW(y_ x(T,v))=0\}\), \[ \lambda =\inf_{v\in V_ 0}(\lim_{T\to \infty}\frac{1}{T}E\int^{T}_{0}y^ 2_ x(t)+v^ 2(y_ x(t))dt) \] and an optimal control \(\hat v(x)=-x\) is a minimum selection of \((vW'(x)+v^ 2)\). In many cases, the problem relates to a solution \((\lambda,W)\) of the H-J-B equation \(\inf_{v\in K}(A^ vW+f(x,v))- \lambda =0,\) and using the discounted problem a solution is obtained. The author treats three problems: continuous control, stopping problem and impulse control, and discusses some open problems.

0 references

zbMATH Keywords

long-term average cost

0 references

undiscounted stochastic control Markov processes

0 references

continuous control, stopping problem

0 references

impulse control

0 references

0 references

MaRDI profile type

MaRDI publication profile

0 references

0 references

0 references

0 references

0 references

0 references

Temps d'arrÊt optimal, théorie générale des processus et processus de Markov

0 references

Discrete Dynamic Programming

0 references

Continuous time control of Markov processes on an arbitrary state space: average return criterion

0 references

0 references

Denumerable state semi-Markov decision processes with unbounded costs, average cost criterion

0 references

0 references

0 references

0 references

Nondiscounted Continuous Time Markovian Decision Process with Countable State Space

0 references

0 references

On Optimal Control of a Non-Terminating Diffusion Process with Reflection

0 references

Étude asymptotique des systèmes markoviens à commande

0 references

Maximal Average-Reward Policies for Semi-Markov Decision Processes With Arbitrary State and Action Space

0 references

0 references

0 references

On a degenerate variational inequality with Neumann boundary conditions

0 references

On the Optimal Stopping Time Problem for Degenerate Diffusions

0 references

Countable-state average-cost regenerative stopping problems

0 references

0 references

Optimal control of diffusion processes with reflection

0 references

0 references

On Some Impulse Control Problems with Long Run Average Cost

0 references

Asymptotics in quasi-variational inequalities and ergodic control problems

0 references

Optimal control of service in tandem queues

0 references

On the Nonexistence of $|varepsilon$-Optimal Randomized Stationary Policies in Average Cost Markov Decision Models

0 references

Average cost semi-markov decision processes

0 references

0 references

Diffusion processes with boundary conditions

0 references

0 references

0 references

Stationary Markovian Decision Problems and Perturbation Theory of Quasi-Compact Linear Operators

0 references

On Semi-Markov Controlled Models with an Average Reward Criterion

0 references

0 references

Identifiers

zbMATH Open document ID

0 references

10.1007/BF00046603

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

0 references

0 references

0 references

0 references

0 references

0 references

zbMATH DE Number

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:788690

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Item:Q788690&oldid=34693933"