Average, Sensitive and Blackwell Optimal Policies in Denumerable Markov Decision Chains with Unbounded Rewards
From MaRDI portal
Publication:3798497
DOI10.1287/moor.13.3.395zbMath0652.90099MaRDI QIDQ3798497
Publication date: 1988
Published in: Mathematics of Operations Research (Search for Journal in Brave)
Full work available at URL: http://repub.eur.nl/pub/2251
Blackwell optimality; compact action sets; denumerable state space; average optimality; discount optimality; discrete-time Markov decision chain
90C40: Markov and semi-Markov decision processes
Related Items
Simultaneous recurrent conditions on countable state Markov chains, Taylor series expansions for stationary Markov chains, Characterization and sufficient conditions for normed ergodicity of Markov chains, Approximation solution and suboptimality for discounted semi-markov decision problems with countable state space, Blackwell optimal policies in a Markov decision process with a Borel state space, Asymptotic properties of constrained Markov Decision Processes, Strong bounds on perturbations, A survey of recent results on continuous-time Markov decision processes (with comments and rejoinder), Strong 1-optimal stationary policies in denumerable Markov decision processes, Conditions for existence of average and Blackwell optimal stationary policies in denumerable Markov decision processes, Denumerable semi-Markov decision chains with small interest rates, ``Super-overtaking optimal policies for Markov control processes, Are limits of \(\alpha\)-discounted optimal policies Blackwell optimal? A counterexample, Zero-sum Markov games and worst-case optimal control of queueing systems, Blackwell optimality in the class of Markov policies for continuous-time controlled Markov chains, The existence of sensitive optimal policies in two multi-dimensional queueing models, Average optimality for Markov decision processes in borel spaces: a new condition and approach, Geometric Ergodicity of the ALOHA-system and a Coupled Processors Model, Optimality equations and sensitive optimality in bounded Markov decision processes1