Control of Markov Chains with Long-Run Average Cost Criterion: The Dynamic Programming Equations

From MaRDI portal
Revision as of 15:29, 5 February 2024 by Import240129110113 (talk | contribs) (Created automatically from import240129110113)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Publication:3821376

DOI10.1137/0327034zbMath0668.60059OpenAlexW1976732220MaRDI QIDQ3821376

Vivek S. Borkar

Publication date: 1989

Published in: SIAM Journal on Control and Optimization (Search for Journal in Brave)

Full work available at URL: https://semanticscholar.org/paper/7f987f133d136e6a10e8e2dc36b0ae8ea896d2ce




Related Items (21)

On the average cost optimality equation and the structure of optimal policies for partially observable Markov decision processesA note on the existence of optimal stationary policies for average Markov decision processes with countable statesMarkov decision processes with multiple costsControlled diffusions with constraintsRecent results on conditions for the existence of average optimal stationary policiesThe average cost of Markov chains subject to total variation distance uncertaintyErgodic and adaptive control of nearest-neighbor motionsAverage optimality in dynamic programming on Borel spaces -- unbounded costs and controlsOn strong average optimality of Markov decision processes with unbounded costsComparing recent assumptions for the existence of average optimal stationary policiesEconomic design of memory-type control charts: the fallacy of the formula proposed by Lorenzen and Vance (1986)Unnamed ItemControlled Markov chains with constraints.Optimal Distributed Uplink Channel Allocation: A Constrained MDP FormulationA survey of Markov decision models for control of networks of queuesRemarks on the existence of solutions to the average cost optimality equation in Markov decision processesInfinite Horizon Average Cost Dynamic Programming Subject to Total Variation Distance AmbiguityThe Kumar-Becker-Lin scheme revisitedAverage optimality for Markov decision processes in borel spaces: a new condition and approachThe convergence of value iteration in average cost Markov decision chainsDenumerable state stochastic games with limiting average payoff






This page was built for publication: Control of Markov Chains with Long-Run Average Cost Criterion: The Dynamic Programming Equations