New algorithms of the Q-learning type

From MaRDI portal

Revision as of 22:44, 2 February 2024 by Import240129110113 (talk | contribs) (Created automatically from import240129110113)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Publication:2440701

Jump to:navigation, search

DOI10.1016/j.automatica.2007.09.009zbMath1283.93328OpenAlexW2118458590MaRDI QIDQ2440701

Shalabh Bhatnagar, K. Mohan Babu

Publication date: 19 March 2014

Published in: Automatica (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1016/j.automatica.2007.09.009

zbMATH Keywords

Markov decision processes reinforcement learning Q-learning SPSA two-timescale stochastic approximation

Mathematics Subject Classification ID

Learning and adaptive systems in artificial intelligence (68T05) Stochastic learning and adaptive control (93E35)

Related Items (3)

A constrained optimization perspective on actor-critic algorithms and application to network routing ⋮ Multiscale Q-learning with linear function approximation ⋮ Approximate stochastic annealing for online control of infinite horizon Markov decision processes

Cites Work

This page was built for publication: New algorithms of the Q-learning type

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:2440701&oldid=15110641"