Pages that link to "Item:Q2887630"
From MaRDI portal
The following pages link to A review of stochastic algorithms with continuous value function approximation and some new approximate policy iteration algorithms for multidimensional continuous applications (Q2887630):
Displayed 6 items.
- Potential-based least-squares policy iteration for a parameterized feedback control system (Q289143) (← links)
- New approximate dynamic programming algorithms for large-scale undiscounted Markov decision processes and their application to optimize a production and distribution system (Q320866) (← links)
- A perturbation approach to a class of discounted approximate value iteration algorithms with Borel spaces (Q330284) (← links)
- Perspectives of approximate dynamic programming (Q333093) (← links)
- Temporal difference-based policy iteration for optimal control of stochastic systems (Q467477) (← links)
- A perturbation approach to approximate value iteration for average cost Markov decision processes with Borel spaces and bounded costs (Q5227201) (← links)