Learning Theory
From MaRDI portal
Publication:4680907
Recommendations
- Reinforcement learning for exploratory linear-quadratic two-person zero-sum stochastic differential games
- Convergent multiple-timescales reinforcement learning algorithms in normal form games
- scientific article; zbMATH DE number 3920249
- scientific article; zbMATH DE number 4102853
- scientific article; zbMATH DE number 4020880
Cited in
(5)- On the effect of clock offsets and quantization on learning-based adversarial games
- Reinforcement Learning rules in a repeated game
- Learning Zero-Sum Simultaneous-Move Markov Games Using Function Approximation and Correlated Equilibrium
- Reinforcement learning for exploratory linear-quadratic two-person zero-sum stochastic differential games
- The lagging anchor algorithm: Reinforcement learning in two-player zero-sum games with imperfect information
This page was built for publication: Learning Theory
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q4680907)