Multiscale Q-learning with linear function approximation (Q312650): Difference between revisions

From MaRDI portal
Set OpenAlex properties.
ReferenceBot (talk | contribs)
Changed an Item
 
Property / cites work
 
Property / cites work: Reinforcement learning based algorithms for average cost Markov decision processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Learning Algorithms for Markov Decision Processes with Average Cost / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3324260 / rank
 
Normal rank
Property / cites work
 
Property / cites work: A simple dynamic routing problem / rank
 
Normal rank
Property / cites work
 
Property / cites work: Some Pathological Traps for Stochastic Approximation / rank
 
Normal rank
Property / cites work
 
Property / cites work: Stochastic Approximations and Differential Inclusions / rank
 
Normal rank
Property / cites work
 
Property / cites work: Stochastic Approximations and Differential Inclusions, Part II: Applications / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3376698 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4257216 / rank
 
Normal rank
Property / cites work
 
Property / cites work: New algorithms of the Q-learning type / rank
 
Normal rank
Property / cites work
 
Property / cites work: Multiscale Stochastic Approximation for Parametric Optimization of Hidden Markov Models / rank
 
Normal rank
Property / cites work
 
Property / cites work: Two-timescale simultaneous perturbation stochastic approximation using deterministic perturbation sequences / rank
 
Normal rank
Property / cites work
 
Property / cites work: A Simultaneous Perturbation Stochastic Approximation-Based Actor–Critic Algorithm for Markov Decision Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Adaptive multivariate three-timescale stochastic approximation algorithms for simulation based optimization / rank
 
Normal rank
Property / cites work
 
Property / cites work: Adaptive Newton-based multivariate smoothed functional algorithms for simulation optimization / rank
 
Normal rank
Property / cites work
 
Property / cites work: Stochastic recursive algorithms for optimization. Simultaneous perturbation methods / rank
 
Normal rank
Property / cites work
 
Property / cites work: Natural actor-critic algorithms / rank
 
Normal rank
Property / cites work
 
Property / cites work: An online actor-critic algorithm with function approximation for constrained Markov decision processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4858374 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Stochastic approximation with two time scales / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3527701 / rank
 
Normal rank
Property / cites work
 
Property / cites work: The O.D.E. Method for Convergence of Stochastic Approximation and Reinforcement Learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Recursive Stochastic Algorithms for Global Optimization in $\mathbb{R}^d $ / rank
 
Normal rank
Property / cites work
 
Property / cites work: Actor-Critic--Type Learning Algorithms for Markov Decision Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: OnActor-Critic Algorithms / rank
 
Normal rank
Property / cites work
 
Property / cites work: Stochastic approximation methods for constrained and unconstrained systems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4346705 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q-Learning with Linear Function Approximation / rank
 
Normal rank
Property / cites work
 
Property / cites work: Nonconvergence to unstable points in urn models and stochastic approximations / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4315289 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Perturbation theory and finite Markov chains / rank
 
Normal rank
Property / cites work
 
Property / cites work: Average cost temporal-difference learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Multivariate stochastic approximation using a simultaneous perturbation gradient approximation / rank
 
Normal rank
Property / cites work
 
Property / cites work: A one-measurement form of simultaneous perturbation stochastic approximation / rank
 
Normal rank
Property / cites work
 
Property / cites work: Asynchronous stochastic approximation and Q-learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: An analysis of temporal-difference learning with function approximation / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4714399 / rank
 
Normal rank
Property / cites work
 
Property / cites work: \({\mathcal Q}\)-learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: On the optimal assignment of customers to parallel servers / rank
 
Normal rank

Latest revision as of 13:49, 12 July 2024

scientific article
Language Label Description Also known as
English
Multiscale Q-learning with linear function approximation
scientific article

    Statements

    Multiscale Q-learning with linear function approximation (English)
    0 references
    0 references
    0 references
    16 September 2016
    0 references
    Q-learning with linear function approximation
    0 references
    reinforcement learning
    0 references
    stochastic approximation
    0 references
    ordinary differential equation
    0 references
    differential inclusion
    0 references
    multi-stage stochastic shortest path problem
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references

    Identifiers

    0 references
    0 references
    0 references
    0 references
    0 references
    0 references