On the Optimality of Structured Policies in Countable Stage Decision Processes. II: Positive and Negative Problems
From MaRDI portal
Publication:4132298
Cited in
(4)- On convergence of value iteration for a class of total cost Markov decision processes
- A mixed value and policy iteration method for stochastic control with universally measurable policies
- Conditions for characterizing the structure of optimal strategies in infinite-horizon dynamic programs
- On the average cost optimality equation and the structure of optimal policies for partially observable Markov decision processes
This page was built for publication: On the Optimality of Structured Policies in Countable Stage Decision Processes. II: Positive and Negative Problems
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q4132298)