scientific article; zbMATH DE number 2087264
From MaRDI portal
Publication:4737965
zbMath1065.68608MaRDI QIDQ4737965
Ronald Parr, Michail G. Lagoudakis, Michael L. Littman
Publication date: 11 August 2004
Full work available at URL: http://link.springer.de/link/service/series/0558/bibs/2308/23080249.htm
Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.
Related Items
Reinforcement learning for a biped robot based on a CPG-actor-critic method, Restricted gradient-descent algorithm for value-function approximation in reinforcement learning, Dynamic portfolio choice: a simulation-and-regression approach, Hybrid least-squares algorithms for approximate policy evaluation, Bayesian Exploration for Approximate Dynamic Programming