Reinforcement Learning in Robust Markov Decision Processes

From MaRDI portal

Publication:2833106

Jump to:navigation, search

DOI10.1287/moor.2016.0779zbMath1348.68197OpenAlexW2522885171MaRDI QIDQ2833106

Shiau Hong Lim, Shie Mannor, Huan Xu

Publication date: 16 November 2016

Published in: Mathematics of Operations Research (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1287/moor.2016.0779

zbMATH Keywords

reinforcement learning robust MDP

Mathematics Subject Classification ID

Learning and adaptive systems in artificial intelligence (68T05) Markov and semi-Markov decision processes (90C40)

Related Items

Lipschitzness is all you need to tame off-policy generative adversarial imitation learning, A survey of decision making and optimization under uncertainty, Continuous-Time Robust Dynamic Programming

Cites Work

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:2833106&oldid=15756933"