Online regret bounds for Markov decision processes with deterministic transitions

From MaRDI portal
Publication:982638

DOI10.1016/j.tcs.2010.04.005zbMath1198.90388OpenAlexW2150011303WikidataQ29307615 ScholiaQ29307615MaRDI QIDQ982638

Ronald Ortner

Publication date: 7 July 2010

Published in: Theoretical Computer Science (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1016/j.tcs.2010.04.005




Related Items (1)



Cites Work


This page was built for publication: Online regret bounds for Markov decision processes with deterministic transitions