scientific article; zbMATH DE number 6542806
From MaRDI portal
Publication:5744816
zbMath1351.90162MaRDI QIDQ5744816
Matthieu Geist, Boris Lesner, Bruno Scherrer, Mohammad Ghavamzadeh, Victor Gabillon
Publication date: 19 February 2016
Full work available at URL: http://jmlr.csail.mit.edu/papers/v16/scherrer15a.html
Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.
Learning and adaptive systems in artificial intelligence (68T05) Dynamic programming (90C39) Markov and semi-Markov decision processes (90C40) Experimental studies (91A90)
Related Items (6)
Unnamed Item ⋮ How fast can we play Tetris greedily with rectangular pieces? ⋮ Simulation-based search ⋮ Refinement of the four-dimensional GLV method on elliptic curves ⋮ Proximal algorithms and temporal difference methods for solving fixed point problems ⋮ Multiply Accelerated Value Iteration for NonSymmetric Affine Fixed Point Problems and Application to Markov Decision Processes
Uses Software
This page was built for publication: