Online calibrated forecasts: memory efficiency versus universality for learning in games
From MaRDI portal
Publication:2384142
Recommendations
- Learning in games with bounded memory
- Learning with bounded memory in games
- Prediction, Optimization, and Learning in Repeated Games
- Learning and equilibrium as useful approximations: accuracy of prediction on randomly selected constant sum games
- scientific article; zbMATH DE number 1804097
- Asymptotically optimal strategies for online prediction with history-dependent experts
- Smooth calibration, leaky forecasts, finite recall, and Nash dynamics
Cites work
- scientific article; zbMATH DE number 48363 (Why is no real title available?)
- scientific article; zbMATH DE number 51132 (Why is no real title available?)
- scientific article; zbMATH DE number 1232130 (Why is no real title available?)
- scientific article; zbMATH DE number 1233801 (Why is no real title available?)
- scientific article; zbMATH DE number 1043533 (Why is no real title available?)
- scientific article; zbMATH DE number 1134975 (Why is no real title available?)
- scientific article; zbMATH DE number 903638 (Why is no real title available?)
- scientific article; zbMATH DE number 1424768 (Why is no real title available?)
- scientific article; zbMATH DE number 3205074 (Why is no real title available?)
- A game of prediction with expert advice
- A general class of adaptive strategies
- A separation principle for the control of a class of nonlinear systems
- Adaptive Heuristics
- An easier way to calibrate.
- Asymptotic calibration
- Asynchronous stochastic approximation and Q-learning
- COMPLEXITY AND REAL COMPUTATION: A MANIFESTO
- Calibrated learning and correlated equilibrium
- Calibration with Many Checking Rules
- Convergence rate of linear two-time-scale stochastic approximation.
- Convergent multiple-timescales reinforcement learning algorithms in normal form games
- Dynamic fictitious play, dynamic gradient play, and distributed convergence to Nash equilibria
- Equivalent necessary and sufficient conditions on noise sequences for stochastic approximation algorithms
- Evolutionary Games and Population Dynamics
- Learning Theory
- Mixed equilibria and dynamical systems arising from fictitious play in perturbed games
- Regret in the on-line decision problem
- Self-Calibrating Priors Do Not Exist
- Stochastic approximation with two time scales
- Stochastic uncoupled dynamics and Nash equilibrium
- The Nonstochastic Multiarmed Bandit Problem
- The O.D.E. Method for Convergence of Stochastic Approximation and Reinforcement Learning
Cited in
(3)
This page was built for publication: Online calibrated forecasts: memory efficiency versus universality for learning in games
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2384142)