Long-term average cost control problems for continuous time Markov processes: A survey (Q788690): Difference between revisions

From MaRDI portal
RedirectionBot (talk | contribs)
Changed an Item
ReferenceBot (talk | contribs)
Changed an Item
 
(One intermediate revision by one other user not shown)
Property / MaRDI profile type
 
Property / MaRDI profile type: MaRDI publication profile / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3929443 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4199298 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3953613 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5683339 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3878285 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Temps d'arrÊt optimal, théorie générale des processus et processus de Markov / rank
 
Normal rank
Property / cites work
 
Property / cites work: Discrete Dynamic Programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: Continuous time control of Markov processes on an arbitrary state space: average return criterion / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5342182 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Denumerable state semi-Markov decision processes with unbounded costs, average cost criterion / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4086303 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4771778 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3266141 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Nondiscounted Continuous Time Markovian Decision Process with Countable State Space / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3318666 / rank
 
Normal rank
Property / cites work
 
Property / cites work: On Optimal Control of a Non-Terminating Diffusion Process with Reflection / rank
 
Normal rank
Property / cites work
 
Property / cites work: Étude asymptotique des systèmes markoviens à commande / rank
 
Normal rank
Property / cites work
 
Property / cites work: Maximal Average-Reward Policies for Semi-Markov Decision Processes With Arbitrary State and Action Space / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3658833 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5570526 / rank
 
Normal rank
Property / cites work
 
Property / cites work: On a degenerate variational inequality with Neumann boundary conditions / rank
 
Normal rank
Property / cites work
 
Property / cites work: On the Optimal Stopping Time Problem for Degenerate Diffusions / rank
 
Normal rank
Property / cites work
 
Property / cites work: Countable-state average-cost regenerative stopping problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4772533 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Optimal control of diffusion processes with reflection / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4182645 / rank
 
Normal rank
Property / cites work
 
Property / cites work: On Some Impulse Control Problems with Long Run Average Cost / rank
 
Normal rank
Property / cites work
 
Property / cites work: Asymptotics in quasi-variational inequalities and ergodic control problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Optimal control of service in tandem queues / rank
 
Normal rank
Property / cites work
 
Property / cites work: On the Nonexistence of $|varepsilon$-Optimal Randomized Stationary Policies in Average Cost Markov Decision Models / rank
 
Normal rank
Property / cites work
 
Property / cites work: Average cost semi-markov decision processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5615108 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Diffusion processes with boundary conditions / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4749716 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4172681 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Stationary Markovian Decision Problems and Perturbation Theory of Quasi-Compact Linear Operators / rank
 
Normal rank
Property / cites work
 
Property / cites work: On Semi-Markov Controlled Models with an Average Reward Criterion / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3875811 / rank
 
Normal rank

Latest revision as of 10:50, 14 June 2024

scientific article
Language Label Description Also known as
English
Long-term average cost control problems for continuous time Markov processes: A survey
scientific article

    Statements

    Long-term average cost control problems for continuous time Markov processes: A survey (English)
    0 references
    0 references
    1983
    0 references
    This paper is a survey of long term average cost control problems, namely the cost is defined by \[ J(v)=\underline{\lim}_{T\to \infty}\frac{1}{T}E\int^{T}_{0}f(y(s,v),v(s))ds, \] where \(y(\cdot,v)\) is the response for control v. The object is to minimize J(v). The following simple example illustrate the kind of phenomena treated in the paper. Example: Let V be the totality of Lipschitz continuous functions. For \(v\in V\), \(y(t,v)\) is the solution of SDE, \[ dy_ x(t)=v(y_ x(t))dt+\sqrt{2}dw(t),\quad y_ x(0)=x(\in R^ 1). \] The discounted cost is given by \(J_ x^{\alpha}(v)=E\int^{\infty}_{0}e^{-\alpha t}(y_ x(t)^ 2+v^ 2(y_ x(t)))dt\). The optimal cost function \(u_{\alpha}(x)=\inf_{v\in V}J_ x^{\alpha}(v)\) satisfies the Hamilton-Jacobi-Bellman (H-J-B) equation \(u_{\alpha}''(x)-\inf_{v\in R^ 1}(vu'_{\alpha}(x)+v^ 2)+\alpha u_{\alpha}=x^ 2\). Actually \(u_{\alpha}(x)=(\sqrt{\alpha^ 2+4}-\alpha)(\frac{x^ 2}{2}+\frac{1}{\alpha})\). Hence, \(\lim_{\alpha \to 0}\alpha u_{\alpha}(x)=2\) and \(W(x)=\lim_{\alpha \to 0}(u_{\alpha}(x)- u_{\alpha}(0))=x^ 2\) satisfies \[ -W''(x)-\inf_{v\in R^ 1}(vW'(x)+v^ 2)+\lambda =x^ 2. \] Moreover, putting \(V_ 0=\{v\in V;\lim_{T\to \infty}\frac{1}{T}EW(y_ x(T,v))=0\}\), \[ \lambda =\inf_{v\in V_ 0}(\lim_{T\to \infty}\frac{1}{T}E\int^{T}_{0}y^ 2_ x(t)+v^ 2(y_ x(t))dt) \] and an optimal control \(\hat v(x)=-x\) is a minimum selection of \((vW'(x)+v^ 2)\). In many cases, the problem relates to a solution \((\lambda,W)\) of the H-J-B equation \(\inf_{v\in K}(A^ vW+f(x,v))- \lambda =0,\) and using the discounted problem a solution is obtained. The author treats three problems: continuous control, stopping problem and impulse control, and discusses some open problems.
    0 references
    long-term average cost
    0 references
    undiscounted stochastic control Markov processes
    0 references
    continuous control, stopping problem
    0 references
    impulse control
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references

    Identifiers

    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references