scientific article
From MaRDI portal
Publication:2810787
zbMath1360.90280MaRDI QIDQ2810787
Alessandro Lazaric, Rémi Munos, Mohammad Ghavamzadeh
Publication date: 6 June 2016
Full work available at URL: http://jmlr.csail.mit.edu/papers/v17/10-364.html
Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.
reinforcement learningpolicy iterationfinite-sample analysisclassification-based approach to policy iteration
Classification and discrimination; cluster analysis (statistical aspects) (62H30) Markov and semi-Markov decision processes (90C40)
Related Items (3)
Unnamed Item ⋮ Unnamed Item ⋮ Preference-based reinforcement learning: evolutionary direct policy search using a preference-based racing algorithm
This page was built for publication: