An online actor-critic algorithm with function approximation for constrained Markov decision processes

From MaRDI portal

Publication:438776

Jump to:navigation, search

DOI10.1007/S10957-012-9989-5zbMATH Open1262.90189OpenAlexW2073314543MaRDI QIDQ438776FDOQ438776

Authors: Shalabh Bhatnagar, K. Lakshmanan

Publication date: 31 July 2012

Published in: Journal of Optimization Theory and Applications (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1007/s10957-012-9989-5

Recommendations

zbMATH Keywords

function approximation actor critic algorithm constrained Markov decision process long-run average cost criterion

Mathematics Subject Classification ID

Markov chains (discrete-time Markov processes on discrete state spaces) (60J10) Markov and semi-Markov decision processes (90C40)

Cites Work

Cited In (15)

This page was built for publication: An online actor-critic algorithm with function approximation for constrained Markov decision processes

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q438776)

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:438776&oldid=12314946"