Convergent multiple-timescales reinforcement learning algorithms in normal form games (Q1429103): Difference between revisions

@@ Property / MaRDI profile type @@
+MaRDI publication profile
@@ Property / MaRDI profile type: MaRDI publication profile / rank @@
+Normal rank
@@ Property / OpenAlex ID @@
+W2067018002
@@ Property / OpenAlex ID: W2067018002 / rank @@
+Normal rank
@@ Property / cites work @@
+Q4938927
@@ Property / cites work: Q4938927 / rank @@
+Normal rank
@@ Property / cites work @@
+Mixed equilibria and dynamical systems arising from fictitious play in perturbed games
+Normal rank
@@ Property / cites work @@
+Q4257216
@@ Property / cites work: Q4257216 / rank @@
+Normal rank
@@ Property / cites work @@
+Stochastic approximation with two time scales
@@ Property / cites work: Stochastic approximation with two time scales / rank @@
+Normal rank
@@ Property / cites work @@
+REINFORCEMENT LEARNING IN MARKOVIAN EVOLUTIONARY GAMES
+Normal rank
@@ Property / cites work @@
+The allocation of offensive and defensive resources in a territorial game
+Normal rank
@@ Property / cites work @@
+Learning mixed equilibria
@@ Property / cites work: Learning mixed equilibria / rank @@
+Normal rank
@@ Property / cites work @@
+Q4223194
@@ Property / cites work: Q4223194 / rank @@
+Normal rank
@@ Property / cites work @@
+Games with randomly disturbed payoffs: a new rationale for mixed-strategy equilibrium points
+Normal rank
@@ Property / cites work @@
+Learning in perturbed asymmetric games
@@ Property / cites work: Learning in perturbed asymmetric games / rank @@
+Normal rank
@@ Property / cites work @@
+A note on best response dynamics.
@@ Property / cites work: A note on best response dynamics. / rank @@
+Normal rank
@@ Property / cites work @@
+Q4847945
@@ Property / cites work: Q4847945 / rank @@
+Normal rank
@@ Property / cites work @@
+Three problems in learning mixed-strategy Nash equilibria
+Normal rank
@@ Property / cites work @@
+Actor-Critic--Type Learning Algorithms for Markov Decision Processes
+Normal rank
@@ Property / cites work @@
+Stochastic approximation methods for constrained and unconstrained systems
+Normal rank
@@ Property / cites work @@
+Q4739314
@@ Property / cites work: Q4739314 / rank @@
+Normal rank
@@ Property / cites work @@
+Non-cooperative games
@@ Property / cites work: Non-cooperative games / rank @@
+Normal rank
@@ Property / cites work @@
+Nonconvergence to unstable points in urn models and stochastic approximations
+Normal rank
@@ Property / cites work @@
+Q5332984
@@ Property / cites work: Q5332984 / rank @@
+Normal rank
@@ links / mardi / name / links / mardi / name @@
+Publication:1429103