scientific article
From MaRDI portal
Publication:3148830
zbMath0992.68097MaRDI QIDQ3148830
Publication date: 22 September 2002
Full work available at URL: http://link.springer.de/link/service/series/0558/bibs/2111/21110589
Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.
Related Items (3)
Model-free reinforcement learning for branching Markov decision processes ⋮ Approximate Q Learning for Controlled Diffusion Processes and Its Near Optimality ⋮ A concentration bound for contractive stochastic approximation
This page was built for publication: