Pages that link to "Item:Q1586803"
From MaRDI portal
The following pages link to On the existence of fixed points for approximate value iteration and temporal-difference learning (Q1586803):
Displaying 8 items.
- A perturbation approach to a class of discounted approximate value iteration algorithms with Borel spaces (Q330284) (← links)
- A generalized Kalman filter for fixed point approximation and efficient temporal-difference learning (Q859737) (← links)
- Shape constraints in economics and operations research (Q1730901) (← links)
- Application of interval iterations to the entrainment problem in respiratory physiology (Q3579050) (← links)
- (Q4999027) (← links)
- A perturbation approach to approximate value iteration for average cost Markov decision processes with Borel spaces and bounded costs (Q5227201) (← links)
- Analyzing Approximate Value Iteration Algorithms (Q5868951) (← links)
- Least squares policy iteration with instrumental variables vs. direct policy search: comparison against optimal benchmarks using energy storage (Q5882386) (← links)