Non-deterministic policies in Markovian decision processes
DOI10.1613/JAIR.3175zbMATH Open1210.68080arXiv1401.3871OpenAlexW2159272820WikidataQ113424367 ScholiaQ113424367MaRDI QIDQ3068940FDOQ3068940
Authors: Mahdi Milani Fard, Joelle Pineau
Publication date: 21 January 2011
Published in: Journal of Artificial Intelligence Research (Search for Journal in Brave)
Full work available at URL: https://arxiv.org/abs/1401.3871
Recommendations
Learning and adaptive systems in artificial intelligence (68T05) Management decision making, including multiple objectives (90B50) Markov and semi-Markov decision processes (90C40)
Cited In (6)
- Title not available (Why is that?)
- Title not available (Why is that?)
- Nonmyopic Strategic Behavior in the MDP Planning Procedure
- Delayed Nondeterminism in Continuous-Time Markov Decision Processes
- Non-randomized strategies in stochastic decision processes
- Optimal Treatment Regimes: A Review and Empirical Comparison
This page was built for publication: Non-deterministic policies in Markovian decision processes
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q3068940)