Reinforcement learning algorithms with function approximation: recent advances and applications (Q903601): Difference between revisions

From MaRDI portal
Set OpenAlex properties.
Normalize DOI.
 
(One intermediate revision by one other user not shown)
Property / DOI
 
Property / DOI: 10.1016/j.ins.2013.08.037 / rank
Normal rank
 
Property / cites work
 
Property / cites work: Model-free \(Q\)-learning designs for linear discrete-time zero-sum games with application to \(H^\infty\) control / rank
 
Normal rank
Property / cites work
 
Property / cites work: 10.1162/153244303768966085 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Adaptive-critic-based neural networks for aircraft optimal control / rank
 
Normal rank
Property / cites work
 
Property / cites work: Recent advances in hierarchical reinforcement learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4533362 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4257216 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Natural actor-critic algorithms / rank
 
Normal rank
Property / cites work
 
Property / cites work: Stochastic approximation with two time scales / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3527701 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Technical update: Least-squares temporal difference learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5477859 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Elevator group control using multiple reinforcement learning agents / rank
 
Normal rank
Property / cites work
 
Property / cites work: The convergence of \(TD(\lambda)\) for general \(\lambda\) / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4527272 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Integrating guidance into relational reinforcement learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4797054 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3093261 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Model selection in reinforcement learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Graph kernels and Gaussian processes for relational reinforcement learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Learning Theory and Kernel Machines / rank
 
Normal rank
Property / cites work
 
Property / cites work: Reinforcement learning for long-run average cost. / rank
 
Normal rank
Property / cites work
 
Property / cites work: Adaptive stepsizes for recursive estimation with applications in approximate dynamic programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3174169 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Markov decision processes with their applications / rank
 
Normal rank
Property / cites work
 
Property / cites work: On the Convergence of Stochastic Iterative Dynamic Programming Algorithms / rank
 
Normal rank
Property / cites work
 
Property / cites work: Hybrid least-squares algorithms for approximate policy evaluation / rank
 
Normal rank
Property / cites work
 
Property / cites work: Policy search for motor primitives in robotics / rank
 
Normal rank
Property / cites work
 
Property / cites work: OnActor-Critic Algorithms / rank
 
Normal rank
Property / cites work
 
Property / cites work: 10.1162/1532443041827907 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3174155 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Least squares policy evaluation algorithms with linear function approximation / rank
 
Normal rank
Property / cites work
 
Property / cites work: Adaptive stock trading with dynamic asset allocation using reinforcement learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Kernel-based reinforcement learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Approximate Dynamic Programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: Convergence results for single-step on-policy reinforcement-learning algorithms / rank
 
Normal rank
Property / cites work
 
Property / cites work: An upper bound on the loss from approximate optimal-value functions / rank
 
Normal rank
Property / cites work
 
Property / cites work: Instrumental variable methods for system identification / rank
 
Normal rank
Property / cites work
 
Property / cites work: Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Algorithms for Reinforcement Learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Asynchronous stochastic approximation and Q-learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: An analysis of temporal-difference learning with function approximation / rank
 
Normal rank
Property / cites work
 
Property / cites work: Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem / rank
 
Normal rank
Property / cites work
 
Property / cites work: Multi-player non-zero-sum games: online adaptive learning solution of coupled Hamilton-Jacobi equations / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4261789 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Adaptive optimal control for continuous-time linear systems based on policy iteration / rank
 
Normal rank
Property / cites work
 
Property / cites work: Neural network approach to continuous-time direct adaptive optimal control for partially unknown nonlinear systems / rank
 
Normal rank
Property / cites work
 
Property / cites work: \({\mathcal Q}\)-learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Punish/Reward: Learning with a Critic in Adaptive Threshold Systems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4533350 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Robot learning with GA-based fuzzy reinforcement learning agents / rank
 
Normal rank
Property / DOI
 
Property / DOI: 10.1016/J.INS.2013.08.037 / rank
 
Normal rank

Latest revision as of 07:57, 10 December 2024

scientific article
Language Label Description Also known as
English
Reinforcement learning algorithms with function approximation: recent advances and applications
scientific article

    Statements

    Reinforcement learning algorithms with function approximation: recent advances and applications (English)
    0 references
    0 references
    0 references
    0 references
    14 January 2016
    0 references
    reinforcement learning
    0 references
    function approximation
    0 references
    approximate dynamic programming
    0 references
    learning control
    0 references
    generalization
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references

    Identifiers