Reinforcement learning algorithms with function approximation: recent advances and applications
From MaRDI portal
Recommendations
- A tutorial on linear function approximators for dynamic programming and reinforcement learning
- scientific article; zbMATH DE number 1950579
- Basis function adaptation in temporal difference reinforcement learning
- Q-Learning with Linear Function Approximation
- Reinforcement learning: a tutorial survey and recent advances
Cites work
- scientific article; zbMATH DE number 5957269 (Why is no real title available?)
- scientific article; zbMATH DE number 5957492 (Why is no real title available?)
- scientific article; zbMATH DE number 5957504 (Why is no real title available?)
- scientific article; zbMATH DE number 5348356 (Why is no real title available?)
- scientific article; zbMATH DE number 1321699 (Why is no real title available?)
- scientific article; zbMATH DE number 1332320 (Why is no real title available?)
- scientific article; zbMATH DE number 1560499 (Why is no real title available?)
- scientific article; zbMATH DE number 1753141 (Why is no real title available?)
- scientific article; zbMATH DE number 1753152 (Why is no real title available?)
- scientific article; zbMATH DE number 1881082 (Why is no real title available?)
- 10.1162/153244303768966085
- 10.1162/1532443041827907
- Adaptive optimal control for continuous-time linear systems based on policy iteration
- Adaptive stepsizes for recursive estimation with applications in approximate dynamic programming
- Adaptive stock trading with dynamic asset allocation using reinforcement learning
- Adaptive-critic-based neural networks for aircraft optimal control
- Algorithms for reinforcement learning.
- An analysis of temporal-difference learning with function approximation
- An upper bound on the loss from approximate optimal-value functions
- Approximate Dynamic Programming
- Asynchronous stochastic approximation and Q-learning
- Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
- Convergence results for single-step on-policy reinforcement-learning algorithms
- Elevator group control using multiple reinforcement learning agents
- Graph kernels and Gaussian processes for relational reinforcement learning
- Hybrid least-squares algorithms for approximate policy evaluation
- Instrumental variable methods for system identification
- Integrating guidance into relational reinforcement learning
- Kernel-based reinforcement learning
- Least squares policy evaluation algorithms with linear function approximation
- Linear least-squares algorithms for temporal difference learning
- Markov decision processes with their applications
- Model selection in reinforcement learning
- Model-free \(Q\)-learning designs for linear discrete-time zero-sum games with application to \(H^\infty\) control
- Multi-player non-zero-sum games: online adaptive learning solution of coupled Hamilton-Jacobi equations
- Natural actor-critic algorithms
- Neural network approach to continuous-time direct adaptive optimal control for partially unknown nonlinear systems
- On graph kernels: hardness results and efficient alternatives.
- On the Convergence of Stochastic Iterative Dynamic Programming Algorithms
- OnActor-Critic Algorithms
- Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem
- Policy search for motor primitives in robotics
- Punish/Reward: Learning with a Critic in Adaptive Threshold Systems
- Recent advances in hierarchical reinforcement learning
- Reinforcement learning for long-run average cost.
- Robot learning with GA-based fuzzy reinforcement learning agents
- Stochastic approximation with two time scales
- Technical update: Least-squares temporal difference learning
- The convergence of \(TD(\lambda)\) for general \(\lambda\)
- \({\mathcal Q}\)-learning
Cited in
(25)- Embedded adaptive fuzzy controller based on reinforcement learning for DC motor with flexible shaft
- Non-parametric value function approximation in robotics
- A lexicographic approach to constrained MDP admission control
- Restricted gradient-descent algorithm for value-function approximation in reinforcement learning
- Basis function adaptation in temporal difference reinforcement learning
- Editorial: some recent advances in learning and adaptation for uncertain feedback control systems
- An integrated data-driven Markov parameters sequence identification and adaptive dynamic programming method to design fault-tolerant optimal tracking control for completely unknown model systems
- Controlling estimation error in reinforcement learning via reinforced operation
- Improved SARSA and DQN algorithms for reinforcement learning
- An online prediction algorithm for reinforcement learning with linear function approximation using cross entropy method
- Error controlled actor-critic
- A systematic study on meta-heuristic approaches for solving the graph coloring problem
- Advances in Neural Networks – ISNN 2005
- Dual heuristic programming with just‐in‐time modeling for self‐learning fault‐tolerant control of mobile robots
- A Q-learning approach for investment decisions
- A unified DC programming framework and efficient DCA based approaches for large scale batch reinforcement learning
- Neural circuits for learning context-dependent associations of stimuli
- Optimal scheduling for data transmission between mobile devices and cloud
- Neural-network-based robust optimal control design for a class of uncertain nonlinear systems via adaptive dynamic programming
- A reinforcement learning approach to distribution-free capacity allocation for sea cargo revenue management
- Two-phase iteration for value function approximation and hyperparameter optimization in Gaussian-kernel-based adaptive critic design
- Optimal distributed synchronization control for continuous-time heterogeneous multi-agent differential graphical games
- Reinforcement learning endowed with safe veto policies to learn the control of linked-multicomponent robotic systems
- Data-based robust optimal control of continuous-time affine nonlinear systems with matched uncertainties
- An approximate dynamic programming approach to resource management in multi-cloud scenarios
This page was built for publication: Reinforcement learning algorithms with function approximation: recent advances and applications
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q903601)