Learning Policies for Markov Decision Processes From Data
DOI10.1109/TAC.2018.2866455zbMATH Open1482.93721arXiv1701.05954OpenAlexW2581566809WikidataQ129352002 ScholiaQ129352002MaRDI QIDQ5223736FDOQ5223736
Authors: Manjesh Kumar Hanawal, Hao Liu, Henghui Zhu, Ioannis Ch. Paschalidis
Publication date: 18 July 2019
Published in: IEEE Transactions on Automatic Control (Search for Journal in Brave)
Full work available at URL: https://arxiv.org/abs/1701.05954
Markov processes: estimation; hidden Markov models (62M05) Markov and semi-Markov decision processes (90C40) Stochastic learning and adaptive control (93E35)
Cited In (6)
- \(L^\ast\)-based learning of Markov decision processes (extended version)
- Learning algorithms for Markov decision processes
- Learning Variable-Length Markov Models of Behavior
- Bagging strategies for learning planning policies
- On a probabilistic approach to synthesize control policies from example datasets
- Learning parametric policies and transition probability models of Markov decision processes from data
This page was built for publication: Learning Policies for Markov Decision Processes From Data
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q5223736)