Comparing Policies in Markov Decision Processes: Mandl's Lemma Revisited
From MaRDI portal
Publication:3200907
Recommendations
- scientific article; zbMATH DE number 970511
- Adaptive control of discounted Markov decision chains
- Optimal Adaptive Policies for Markov Decision Processes
- Average Cost Optimal Stationary Policies in Infinite State Markov Decision Processes with Unbounded Costs
- Value iteration and approximately optimal stationary policies in finite-state average Markov decision chains
Cited in
(3)
This page was built for publication: Comparing Policies in Markov Decision Processes: Mandl's Lemma Revisited
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q3200907)