scientific article; zbMATH DE number 2087264
From MaRDI portal
Publication:4737965
zbMath1065.68608MaRDI QIDQ4737965
Ronald Parr, Michail G. Lagoudakis, Michael L. Littman
Publication date: 11 August 2004
Full work available at URL: http://link.springer.de/link/service/series/0558/bibs/2308/23080249.htm
Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.
Related Items (5)
Reinforcement learning for a biped robot based on a CPG-actor-critic method ⋮ Restricted gradient-descent algorithm for value-function approximation in reinforcement learning ⋮ Dynamic portfolio choice: a simulation-and-regression approach ⋮ Hybrid least-squares algorithms for approximate policy evaluation ⋮ Bayesian Exploration for Approximate Dynamic Programming
This page was built for publication: