Sleeping experts and bandits approach to constrained Markov decision processes

From MaRDI portal
Publication:901196

DOI10.1016/j.automatica.2015.10.015zbMath1329.93154arXiv1412.4898OpenAlexW2132036095MaRDI QIDQ901196

Hyeong Soo Chang

Publication date: 23 December 2015

Published in: Automatica (Search for Journal in Brave)

Full work available at URL: https://arxiv.org/abs/1412.4898





Cites Work