An adjusted payoff-based procedure for normal form games

DOI10.1287/MOOR.2016.0785MaRDI QIDQ2833112zbMATH OpenOpenAlexFDO

Publication date 16 November 2016

Published in Mathematics of Operations Research (Search for Journal in Brave)

Full work available at URL https://arxiv.org/abs/1106.5596

adaptive dynamics learning stochastic approximation normal form games

Stochastic approximation (62L20) Noncooperative games (91A10) (n)-person games, (n>2) (91A06) Multistage and repeated games (91A20) Stochastic learning and adaptive control (93E35) Rationality and learning in game theory (91A26)

Abstract: We study a simple adaptive model in the framework of an N -player normal form game. The model consists of a repeated game where the players only know their own action space and their own payoff scored at each stage, not those of the other agents. Each player, in order to update her mixed action, computes the average vector payoff she has obtained by using the number of times she has played each pure action. The resulting stochastic process is analyzed via the ODE method from stochastic approximation theory. We are interested in the convergence of the process to rest points of the related continuous dynamics. Results concerning almost sure convergence and convergence with positive probability are obtained and applied to a traffic game. We also provide some examples where convergence occurs with probability zero.

Recommendations

Cites work

Cited in

(5)

This page was built for publication: An adjusted payoff-based procedure for normal form games

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2833112)