Policy Iteration Based on Stochastic Factorization
From MaRDI portal
Publication:2878742
DOI10.1613/jair.4301zbMath1366.90211OpenAlexW223326216WikidataQ113424363 ScholiaQ113424363MaRDI QIDQ2878742
Doina Precup, André M. S. Barreto, Joelle Pineau
Publication date: 5 September 2014
Published in: Journal of Artificial Intelligence Research (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1613/jair.4301
Approximation methods and heuristics in mathematical programming (90C59) Dynamic programming (90C39) Markov and semi-Markov decision processes (90C40)
Related Items (2)
An incremental off-policy search in a model-free Markov decision process using a single sample path ⋮ A numerical study of Markov decision process algorithms for multi-component replacement problems
This page was built for publication: Policy Iteration Based on Stochastic Factorization