Convergent multiple-timescales reinforcement learning algorithms in normal form games (Q1429103): Difference between revisions

From MaRDI portal
Set OpenAlex properties.
Normalize DOI.
 
(One intermediate revision by one other user not shown)
Property / DOI
 
Property / DOI: 10.1214/aoap/1069786497 / rank
Normal rank
 
Property / cites work
 
Property / cites work: Q4938927 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Mixed equilibria and dynamical systems arising from fictitious play in perturbed games / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4257216 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Stochastic approximation with two time scales / rank
 
Normal rank
Property / cites work
 
Property / cites work: REINFORCEMENT LEARNING IN MARKOVIAN EVOLUTIONARY GAMES / rank
 
Normal rank
Property / cites work
 
Property / cites work: The allocation of offensive and defensive resources in a territorial game / rank
 
Normal rank
Property / cites work
 
Property / cites work: Learning mixed equilibria / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4223194 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Games with randomly disturbed payoffs: a new rationale for mixed-strategy equilibrium points / rank
 
Normal rank
Property / cites work
 
Property / cites work: Learning in perturbed asymmetric games / rank
 
Normal rank
Property / cites work
 
Property / cites work: A note on best response dynamics. / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4847945 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Three problems in learning mixed-strategy Nash equilibria / rank
 
Normal rank
Property / cites work
 
Property / cites work: Actor-Critic--Type Learning Algorithms for Markov Decision Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Stochastic approximation methods for constrained and unconstrained systems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4739314 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Non-cooperative games / rank
 
Normal rank
Property / cites work
 
Property / cites work: Nonconvergence to unstable points in urn models and stochastic approximations / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5332984 / rank
 
Normal rank
Property / DOI
 
Property / DOI: 10.1214/AOAP/1069786497 / rank
 
Normal rank

Latest revision as of 20:14, 10 December 2024

scientific article
Language Label Description Also known as
English
Convergent multiple-timescales reinforcement learning algorithms in normal form games
scientific article

    Statements

    Convergent multiple-timescales reinforcement learning algorithms in normal form games (English)
    0 references
    0 references
    0 references
    30 March 2004
    0 references

    Identifiers

    0 references
    0 references
    0 references
    0 references
    0 references
    0 references