Online Reinforcement Learning of Optimal Threshold Policies for Markov Decision Processes
From MaRDI portal
Publication:5092299
DOI10.1109/TAC.2021.3108121OpenAlexW3198564127WikidataQ114147847 ScholiaQ114147847MaRDI QIDQ5092299FDOQ5092299
Authors: Arghyadip Roy, Vivek Borkar, Abhay Karandikar, Prasanna Chaporkar
Publication date: 28 July 2022
Published in: IEEE Transactions on Automatic Control (Search for Journal in Brave)
Full work available at URL: https://arxiv.org/abs/1912.10325
Cited In (3)
This page was built for publication: Online Reinforcement Learning of Optimal Threshold Policies for Markov Decision Processes
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q5092299)