Mathematical Research Data Initiative
Main page
Recent changes
Random page
SPARQL
MaRDI@GitHub
New item
Special pages
In other projects
MaRDI portal item
Discussion
View source
View history
English
Log in

Online Reinforcement Learning of Optimal Threshold Policies for Markov Decision Processes

From MaRDI portal
Publication:5092299
Jump to:navigation, search

DOI10.1109/TAC.2021.3108121OpenAlexW3198564127WikidataQ114147847 ScholiaQ114147847MaRDI QIDQ5092299FDOQ5092299


Authors: Arghyadip Roy, Vivek Borkar, Abhay Karandikar, Prasanna Chaporkar Edit this on Wikidata


Publication date: 28 July 2022

Published in: IEEE Transactions on Automatic Control (Search for Journal in Brave)

Full work available at URL: https://arxiv.org/abs/1912.10325





Mathematics Subject Classification ID

Systems theory; control (93-XX)



Cited In (3)

  • A Small Gain Analysis of Single Timescale Actor Critic
  • Online Bootstrap Inference For Policy Evaluation In Reinforcement Learning
  • An Online Policy Gradient Algorithm for Markov Decision Processes with Continuous States and Actions





This page was built for publication: Online Reinforcement Learning of Optimal Threshold Policies for Markov Decision Processes

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q5092299)

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:5092299&oldid=19599566"
Tools
What links here
Related changes
Printable version
Permanent link
Page information
This page was last edited on 8 February 2024, at 12:47. Warning: Page may not contain recent updates.
Privacy policy
About MaRDI portal
Disclaimers
Imprint
Powered by MediaWiki