On functional equations for Kth best policies in Markov decision processes
From MaRDI portal
Publication:1939643
DOI10.1016/J.AUTOMATICA.2012.09.016zbMATH Open1257.93111OpenAlexW2156658315MaRDI QIDQ1939643FDOQ1939643
Authors: Hyeong Soo Chang
Publication date: 4 March 2013
Published in: Automatica (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1016/j.automatica.2012.09.016
Recommendations
Cited In (5)
- A functional equation arising in multi-agent statistical decision theory
- Relevant states and memory in Markov chain bootstrapping and simulation
- Finding Best k Policies
- Ordinary differential equation methods for Markov decision processes and application to Kullback-Leibler control cost
- Neuro-optimal tracking control for a class of discrete-time nonlinear systems via generalized value iteration adaptive dynamic programming approach
This page was built for publication: On functional equations for \(K\)th best policies in Markov decision processes
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q1939643)