Reinforcement Learning in Robust Markov Decision Processes
From MaRDI portal
Publication:2833106
DOI10.1287/moor.2016.0779zbMath1348.68197OpenAlexW2522885171MaRDI QIDQ2833106
Shiau Hong Lim, Shie Mannor, Huan Xu
Publication date: 16 November 2016
Published in: Mathematics of Operations Research (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1287/moor.2016.0779
Learning and adaptive systems in artificial intelligence (68T05) Markov and semi-Markov decision processes (90C40)
Related Items
Lipschitzness is all you need to tame off-policy generative adversarial imitation learning, A survey of decision making and optimization under uncertainty, Continuous-Time Robust Dynamic Programming
Cites Work
- Distributionally Robust Markov Decision Processes
- 10.1162/153244303765208377
- Bias and Variance Approximation in Value Function Estimates
- Online Markov Decision Processes
- Markov Decision Processes with Arbitrary Reward Processes
- 10.1162/1532443041827880
- Robust Control of Markov Decision Processes with Uncertain Transition Matrices
- Bounded Parameter Markov Decision Processes with Average Reward Criterion
- Robust Dynamic Programming
- Unnamed Item
- Unnamed Item