Adaptive game playing using multiplicative weights (Q1818286)

From MaRDI portal

Jump to:navigation, search

scientific article

Language	Label	Description	Also known as
English	Adaptive game playing using multiplicative weights	scientific article

Statements

scholarly article

0 references

Adaptive game playing using multiplicative weights (English)

0 references

zbMATH Open document ID

0 references

10.1006/game.1999.0738

0 references

0 references

Robert E. Schapire

0 references

Games and Economic Behavior

0 references

publication date

1 February 2000

0 references

This paper is devoted to the algorithm MW (Multiplicative Weights). The main theorem concerning this algorithm is the following: Theorem. For any matrix \(M\) with \(n\) rows and entries in \([0,1]\), and for any sequence of mixed strategies \(Q_1,\dots, Q_T\) play by the environment, the sequence of mixed strategies \(P_1,\dots, P_T\) produced by the algorithm MW satisfies \[ \sum_{t=1}^T M(P_t,Q_t)\leq \min_P \Biggl[ a_\beta \sum_{t=1}^T M(P,Q_t)+ c_\beta RE(P\parallel P_1)\Biggr], \] where \(a_\beta= \frac{\ln(1/\beta)} {1-\beta}\) and \(c_\beta= \frac{1}{1-\beta}\). The authors show how this algorithm can be used to give a simple proof of von Neumann's min-max theorem. A possible application of the algorithm to solve linear programming problems is also noted. A version of the algorithm whose distributions are guaranteed to converge to an optimal mixed strategy is also given. The authors show that the convergence rate of the second version of the algorithm is asymptotically optimal.

0 references

Mathematics Subject Classification ID

0 references

0 references

zbMATH DE Number

0 references

zbMATH Keywords

repeated game

0 references

multiplicative weights algorithm

0 references

asymptotic optimality

0 references

K. Chandrasekhara Rao

0 references

MaRDI profile type

MaRDI publication profile

0 references

full work available at URL

https://doi.org/10.1006/game.1999.0738

0 references

0 references

Fast probabilistic algorithms for Hamiltonian circuits and matchings

0 references

0 references

An analog of the minimax theorem for vector payoffs

0 references

0 references

How to use expert advice

0 references

Universal Portfolios

0 references

Universal portfolios with side information

0 references

0 references

Present Position and Potential Developments: Some Personal Views: Statistical Theory: The Prequential Approach

0 references

Universal prediction of individual sequences

0 references

0 references

Prediction in the worst case

0 references

A Randomization Rule for Selecting Forecasts

0 references

Asymptotic calibration

0 references

Regret in the on-line decision problem

0 references

Consistency and cautious fictitious play

0 references

A sublinear-time randomized approximation algorithm for matrix games

0 references

0 references

On‐Line Portfolio Selection Using Multiplicative Updates

0 references

Probability Inequalities for Sums of Bounded Random Variables

0 references

0 references

The weighted majority algorithm

0 references

0 references

Fast Approximation Algorithms for Fractional Packing and Covering Problems

0 references

Universal sequential coding of single messages

0 references

A game of prediction with expert advice

0 references

0 references

Coding theorems for individual sequences

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:1818286

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Item:Q1818286&oldid=34232500"