Convergent multiple-timescales reinforcement learning algorithms in normal form games (Q1429103): Difference between revisions

@@ Property / DOI @@
-.1214/aoap/1069786497
@@ Property / DOI: 10.1214/aoap/1069786497 / rank @@
-Normal rank
@@ Property / cites work @@
+Q4938927
@@ Property / cites work: Q4938927 / rank @@
+Normal rank
@@ Property / cites work @@
+Mixed equilibria and dynamical systems arising from fictitious play in perturbed games
+Normal rank
@@ Property / cites work @@
+Q4257216
@@ Property / cites work: Q4257216 / rank @@
+Normal rank
@@ Property / cites work @@
+Stochastic approximation with two time scales
@@ Property / cites work: Stochastic approximation with two time scales / rank @@
+Normal rank
@@ Property / cites work @@
+REINFORCEMENT LEARNING IN MARKOVIAN EVOLUTIONARY GAMES
+Normal rank
@@ Property / cites work @@
+The allocation of offensive and defensive resources in a territorial game
+Normal rank
@@ Property / cites work @@
+Learning mixed equilibria
@@ Property / cites work: Learning mixed equilibria / rank @@
+Normal rank
@@ Property / cites work @@
+Q4223194
@@ Property / cites work: Q4223194 / rank @@
+Normal rank
@@ Property / cites work @@
+Games with randomly disturbed payoffs: a new rationale for mixed-strategy equilibrium points
+Normal rank
@@ Property / cites work @@
+Learning in perturbed asymmetric games
@@ Property / cites work: Learning in perturbed asymmetric games / rank @@
+Normal rank
@@ Property / cites work @@
+A note on best response dynamics.
@@ Property / cites work: A note on best response dynamics. / rank @@
+Normal rank
@@ Property / cites work @@
+Q4847945
@@ Property / cites work: Q4847945 / rank @@
+Normal rank
@@ Property / cites work @@
+Three problems in learning mixed-strategy Nash equilibria
+Normal rank
@@ Property / cites work @@
+Actor-Critic--Type Learning Algorithms for Markov Decision Processes
+Normal rank
@@ Property / cites work @@
+Stochastic approximation methods for constrained and unconstrained systems
+Normal rank
@@ Property / cites work @@
+Q4739314
@@ Property / cites work: Q4739314 / rank @@
+Normal rank
@@ Property / cites work @@
+Non-cooperative games
@@ Property / cites work: Non-cooperative games / rank @@
+Normal rank
@@ Property / cites work @@
+Nonconvergence to unstable points in urn models and stochastic approximations
+Normal rank
@@ Property / cites work @@
+Q5332984
@@ Property / cites work: Q5332984 / rank @@
+Normal rank
@@ Property / DOI @@
+.1214/AOAP/1069786497
@@ Property / DOI: 10.1214/AOAP/1069786497 / rank @@
+Normal rank