Adaptive game playing using multiplicative weights (Q1818286)

From MaRDI portal
scientific article
Language Label Description Also known as
English
Adaptive game playing using multiplicative weights
scientific article

    Statements

    Adaptive game playing using multiplicative weights (English)
    0 references
    0 references
    0 references
    1 February 2000
    0 references
    This paper is devoted to the algorithm MW (Multiplicative Weights). The main theorem concerning this algorithm is the following: Theorem. For any matrix \(M\) with \(n\) rows and entries in \([0,1]\), and for any sequence of mixed strategies \(Q_1,\dots, Q_T\) play by the environment, the sequence of mixed strategies \(P_1,\dots, P_T\) produced by the algorithm MW satisfies \[ \sum_{t=1}^T M(P_t,Q_t)\leq \min_P \Biggl[ a_\beta \sum_{t=1}^T M(P,Q_t)+ c_\beta RE(P\parallel P_1)\Biggr], \] where \(a_\beta= \frac{\ln(1/\beta)} {1-\beta}\) and \(c_\beta= \frac{1}{1-\beta}\). The authors show how this algorithm can be used to give a simple proof of von Neumann's min-max theorem. A possible application of the algorithm to solve linear programming problems is also noted. A version of the algorithm whose distributions are guaranteed to converge to an optimal mixed strategy is also given. The authors show that the convergence rate of the second version of the algorithm is asymptotically optimal.
    0 references
    0 references
    repeated game
    0 references
    multiplicative weights algorithm
    0 references
    asymptotic optimality
    0 references
    0 references