Q5168859 (Q5168859): Difference between revisions

From MaRDI portal
Importer (talk | contribs)
Changed an Item
ReferenceBot (talk | contribs)
Changed an Item
Property / cites work
 
Property / cites work: Viability theory / rank
 
Normal rank
Property / cites work
 
Property / cites work: On the Theory of Dynamic Programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: A Dynamical System Approach to Stochastic Approximations / rank
 
Normal rank
Property / cites work
 
Property / cites work: Mixed equilibria and dynamical systems arising from fictitious play in perturbed games / rank
 
Normal rank
Property / cites work
 
Property / cites work: Stochastic Approximations and Differential Inclusions / rank
 
Normal rank
Property / cites work
 
Property / cites work: Stochastic Approximations and Differential Inclusions, Part II: Applications / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4269108 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Stochastic approximation with two time scales / rank
 
Normal rank
Property / cites work
 
Property / cites work: Asynchronous Stochastic Approximations / rank
 
Normal rank
Property / cites work
 
Property / cites work: Stochastic approximation with `controlled Markov' noise / rank
 
Normal rank
Property / cites work
 
Property / cites work: Stochastic approximation. A dynamical systems viewpoint. / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3997244 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Stabilization of stochastic approximation by step size adaptation / rank
 
Normal rank
Property / cites work
 
Property / cites work: Actor-Critic--Type Learning Algorithms for Markov Decision Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: OnActor-Critic Algorithms / rank
 
Normal rank
Property / cites work
 
Property / cites work: Stochastic approximation methods for constrained and unconstrained systems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Asymptotic Properties of Distributed and Communicating Stochastic Approximation Algorithms / rank
 
Normal rank
Property / cites work
 
Property / cites work: Stochastic approximation algorithms for parallel and distributed processing / rank
 
Normal rank
Property / cites work
 
Property / cites work: Convergent multiple-timescales reinforcement learning algorithms in normal form games / rank
 
Normal rank
Property / cites work
 
Property / cites work: Analysis of recursive stochastic algorithms / rank
 
Normal rank
Property / cites work
 
Property / cites work: Stochastic approximations for finite-state Markov chains / rank
 
Normal rank
Property / cites work
 
Property / cites work: Convergence results for single-step on-policy reinforcement-learning algorithms / rank
 
Normal rank
Property / cites work
 
Property / cites work: Asynchronous stochastic approximation and Q-learning / rank
 
Normal rank

Revision as of 18:21, 8 July 2024

scientific article; zbMATH DE number 6318809
Language Label Description Also known as
English
No label defined
scientific article; zbMATH DE number 6318809

    Statements

    0 references
    0 references
    21 July 2014
    0 references
    asynchronous stochastic approximation
    0 references
    set-valued mean field
    0 references
    differential inclusion
    0 references
    two-timescales
    0 references
    0 references
    0 references
    0 references

    Identifiers

    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references