Reinforcement learning algorithms with function approximation: recent advances and applications
DOI10.1016/j.ins.2013.08.037zbMath1328.68176OpenAlexW2113921460MaRDI QIDQ903601
Lei Zuo, Xin Xu, Zhen-Hua Huang
Publication date: 14 January 2016
Published in: Information Sciences (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1016/j.ins.2013.08.037
reinforcement learninggeneralizationlearning controlfunction approximationapproximate dynamic programming
Learning and adaptive systems in artificial intelligence (68T05) Applications of Markov chains and discrete-time Markov processes on general state spaces (social mobility, learning theory, industrial processes, etc.) (60J20)
Related Items (18)
Uses Software
Cites Work
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Neural network approach to continuous-time direct adaptive optimal control for partially unknown nonlinear systems
- Policy search for motor primitives in robotics
- Model selection in reinforcement learning
- Multi-player non-zero-sum games: online adaptive learning solution of coupled Hamilton-Jacobi equations
- Integrating guidance into relational reinforcement learning
- Adaptive stepsizes for recursive estimation with applications in approximate dynamic programming
- Model-free \(Q\)-learning designs for linear discrete-time zero-sum games with application to \(H^\infty\) control
- Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem
- Adaptive optimal control for continuous-time linear systems based on policy iteration
- Natural actor-critic algorithms
- Instrumental variable methods for system identification
- Elevator group control using multiple reinforcement learning agents
- Asynchronous stochastic approximation and Q-learning
- An upper bound on the loss from approximate optimal-value functions
- Stochastic approximation with two time scales
- Robot learning with GA-based fuzzy reinforcement learning agents
- Reinforcement learning for long-run average cost.
- Convergence results for single-step on-policy reinforcement-learning algorithms
- Kernel-based reinforcement learning
- Technical update: Least-squares temporal difference learning
- Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
- \({\mathcal Q}\)-learning
- The convergence of \(TD(\lambda)\) for general \(\lambda\)
- Least squares policy evaluation algorithms with linear function approximation
- Hybrid least-squares algorithms for approximate policy evaluation
- Graph kernels and Gaussian processes for relational reinforcement learning
- Markov decision processes with their applications
- Adaptive stock trading with dynamic asset allocation using reinforcement learning
- 10.1162/153244303768966085
- Algorithms for Reinforcement Learning
- On the Convergence of Stochastic Iterative Dynamic Programming Algorithms
- An analysis of temporal-difference learning with function approximation
- OnActor-Critic Algorithms
- Punish/Reward: Learning with a Critic in Adaptive Threshold Systems
- 10.1162/1532443041827907
- Adaptive-critic-based neural networks for aircraft optimal control
- Learning Theory and Kernel Machines
- Approximate Dynamic Programming
- Recent advances in hierarchical reinforcement learning
This page was built for publication: Reinforcement learning algorithms with function approximation: recent advances and applications