scientific article
From MaRDI portal
Publication:3585656
zbMath1195.93148MaRDI QIDQ3585656
Publication date: 20 August 2010
Full work available at URL: https://eudml.org/doc/196935
Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.
finite state Markov decision processesdiscounted and average costselimination of suboptimal policies
Markov chains (discrete-time Markov processes on discrete state spaces) (60J10) Optimal stochastic control (93E20) Markov and semi-Markov decision processes (90C40)
Cites Work
- Unnamed Item
- Unnamed Item
- Dynamic programming, Markov chains, and the method of successive approximations
- Uniform convergence of value iteration policies for discounted Markov decision processes
- A modified dynamic programming method for Markovian decision problems
- Action Elimination Procedures for Modified Policy Iteration Algorithms
- Erratum—Tests for Suboptimal Actions in Discounted Markov Programming
- Note—A Test for Nonoptimal Actions in Undiscounted Finite Markov Decision Chains
- Modified Policy Iteration Algorithms for Discounted Markov Decision Problems
- Letter to the Editor—A Test for Suboptimal Actions in Markovian Decision Problems
- On Finding the Maximal Gain for Markov Decision Processes
- Technical Note—Bounds on the Gain of a Markov Decision Process
- Technical Note—Elimination of Suboptimal Actions in Markov Decision Problems
This page was built for publication: