Pages that link to "Item:Q2887630"
From MaRDI portal
The following pages link to A review of stochastic algorithms with continuous value function approximation and some new approximate policy iteration algorithms for multidimensional continuous applications (Q2887630):
Displaying 9 items.
- Potential-based least-squares policy iteration for a parameterized feedback control system (Q289143) (← links)
- New approximate dynamic programming algorithms for large-scale undiscounted Markov decision processes and their application to optimize a production and distribution system (Q320866) (← links)
- A perturbation approach to a class of discounted approximate value iteration algorithms with Borel spaces (Q330284) (← links)
- Perspectives of approximate dynamic programming (Q333093) (← links)
- Temporal difference-based policy iteration for optimal control of stochastic systems (Q467477) (← links)
- Convergence of deep fictitious play for stochastic differential games (Q2170300) (← links)
- A perturbation approach to approximate value iteration for average cost Markov decision processes with Borel spaces and bounded costs (Q5227201) (← links)
- Least squares policy iteration with instrumental variables vs. direct policy search: comparison against optimal benchmarks using energy storage (Q5882386) (← links)
- Off-line approximate dynamic programming for the vehicle routing problem with a highly variable customer basis and stochastic demands (Q6047879) (← links)