Pages that link to "Item:Q2485935"

From MaRDI portal

← Basis function adaptation in temporal difference reinforcement learning (Q2485935)

Jump to:navigation, search

The following pages link to Basis function adaptation in temporal difference reinforcement learning (Q2485935):

Displayed 10 items.

Model selection in reinforcement learning (Q415618) ‎ (← links)
Approximate dynamic programming via direct search in the space of value function approximations (Q713118) ‎ (← links)
Projected equation methods for approximate solution of large linear systems (Q1012492) ‎ (← links)
Reinforcement learning for a biped robot based on a CPG-actor-critic method (Q2383520) ‎ (← links)
Restricted gradient-descent algorithm for value-function approximation in reinforcement learning (Q2389624) ‎ (← links)
A tutorial on the cross-entropy method (Q2485925) ‎ (← links)
Approximate policy iteration: a survey and some new methods (Q2887629) ‎ (← links)
Learning Tetris Using the Noisy Cross-Entropy Method (Q3421374) ‎ (← links)
Approximate dynamic programming via iterated Bellman inequalities (Q5256802) ‎ (← links)
Actor-Critic Algorithms with Online Feature Adaptation (Q5270681) ‎ (← links)

Retrieved from "https://portal.mardi4nfdi.de/wiki/Special:WhatLinksHere/Item:Q2485935"