scientific article; zbMATH DE number 3628710
From MaRDI portal
Publication:4190426
zbMath0404.90051MaRDI QIDQ4190426
Publication date: 1976
Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.
Countable State SpaceUnbounded RewardsContracting Markov Decision ProcessesEpsilon-Optimal Stationary Policies
Minimax problems in mathematical programming (90C47) Stopping times; optimal stopping problems; gambling theory (60G40) Markov renewal processes, semi-Markov processes (60K15) Research exposition (monographs, survey articles) pertaining to operations research and mathematical programming (90-02)
Related Items (13)
A class of procedures to compute the optimal value f unction in a Markovian decision problem ⋮ Some basic concepts of numerical treatment of Markov decision models ⋮ Survey of linear programming for standard and nonstandard Markovian control problems. Part I: Theory ⋮ (Approximate) iterated successive approximations algorithm for sequential decision processes ⋮ On theory and algorithms for Markov decision problems with the total reward criterion ⋮ The method of value oriented successive approximations for the average reward Markov decision process ⋮ Solving linear systems by methods based on a probabilistic interpretation ⋮ Denumerable semi-Markov decision chains with small interest rates ⋮ Discounted Markov games: Generalized policy iteration method ⋮ A \(K\)-step look-ahead analysis of value iteration algorithms for Markov decision processes ⋮ On the convergence of successive approximations in dynamic programming with non-zero terminal reward ⋮ Infinite horizon Markov decision processes with unknown or variable discount factors ⋮ Truncated policy iteration methods
This page was built for publication: