Pages that link to "Item:Q859737"
From MaRDI portal
The following pages link to A generalized Kalman filter for fixed point approximation and efficient temporal-difference learning (Q859737):
Displaying 6 items.
- Q-learning and policy iteration algorithms for stochastic shortest path problems (Q378731) (← links)
- On regression-based stopping times (Q708889) (← links)
- A new learning algorithm for optimal stopping (Q839001) (← links)
- Projected equation methods for approximate solution of large linear systems (Q1012492) (← links)
- Fundamental design principles for reinforcement learning algorithms (Q2094028) (← links)
- Approximate policy iteration: a survey and some new methods (Q2887629) (← links)