Pages that link to "Item:Q2485935"
From MaRDI portal
The following pages link to Basis function adaptation in temporal difference reinforcement learning (Q2485935):
Displayed 10 items.
- Model selection in reinforcement learning (Q415618) (← links)
- Approximate dynamic programming via direct search in the space of value function approximations (Q713118) (← links)
- Projected equation methods for approximate solution of large linear systems (Q1012492) (← links)
- Reinforcement learning for a biped robot based on a CPG-actor-critic method (Q2383520) (← links)
- Restricted gradient-descent algorithm for value-function approximation in reinforcement learning (Q2389624) (← links)
- A tutorial on the cross-entropy method (Q2485925) (← links)
- Approximate policy iteration: a survey and some new methods (Q2887629) (← links)
- Learning Tetris Using the Noisy Cross-Entropy Method (Q3421374) (← links)
- Approximate dynamic programming via iterated Bellman inequalities (Q5256802) (← links)
- Actor-Critic Algorithms with Online Feature Adaptation (Q5270681) (← links)