Real-time reinforcement learning by sequential actor-critics and experience replay
From MaRDI portal
Publication:1784532
DOI10.1016/J.NEUNET.2009.05.011zbMATH Open1396.68107OpenAlexW2089434629WikidataQ51616641 ScholiaQ51616641MaRDI QIDQ1784532FDOQ1784532
Authors: Paweł Wawrzyński
Publication date: 27 September 2018
Published in: Neural Networks (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1016/j.neunet.2009.05.011
Recommendations
Learning and adaptive systems in artificial intelligence (68T05) Adaptive control/observation systems (93C40)
Cites Work
- \({\mathcal Q}\)-learning
- Title not available (Why is that?)
- Natural actor-critic algorithms
- Least squares policy evaluation algorithms with linear function approximation
- OnActor-Critic Algorithms
- An analysis of temporal-difference learning with function approximation
- Title not available (Why is that?)
- Title not available (Why is that?)
- AN ANALYSIS OF EXPERIENCE REPLAY IN TEMPORAL DIFFERENCE LEARNING
- Efficient Dynamic Computer Simulation of Robotic Mechanisms
Cited In (13)
- Reward-weighted regression with sample reuse for direct policy search in reinforcement learning
- Deep reinforcement learning via good choice resampling experience replay memory
- TD-regularized actor-critic methods
- Safe adaptive output-feedback optimal control of a class of linear systems
- A Small Gain Analysis of Single Timescale Actor Critic
- Autonomous reinforcement learning with experience replay
- Recursive estimation in piecewise affine systems using parameter identifiers and concurrent learning
- Experience selection in deep reinforcement learning for control
- Artificial Intelligence and Soft Computing - ICAISC 2004
- Integral reinforcement learning and experience replay for adaptive optimal control of partially-unknown constrained-input continuous-time systems
- Efficient sample reuse in policy gradients with parameter-based exploration
- A Lie group PMP approach for optimal stabilization and tracking control of autonomous underwater vehicles
- Actor prioritized experience replay
This page was built for publication: Real-time reinforcement learning by sequential actor-critics and experience replay
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q1784532)