Learning in Games via Reinforcement and Regularization (Q2833105): Difference between revisions

From MaRDI portal
Set OpenAlex properties.
ReferenceBot (talk | contribs)
Changed an Item
 
(One intermediate revision by one other user not shown)
Property / arXiv ID
 
Property / arXiv ID: 1407.6267 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Domination or equilibrium / rank
 
Normal rank
Property / cites work
 
Property / cites work: Hessian Riemannian Gradient Flows in Convex Programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: The Nonlinear Geometry of Linear Programming. I Affine and Projective Scaling Trajectories / rank
 
Normal rank
Property / cites work
 
Property / cites work: Mirror descent and nonlinear projected subgradient methods for convex optimization. / rank
 
Normal rank
Property / cites work
 
Property / cites work: On the convergence of reinforcement learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Stochastic Approximations and Differential Inclusions / rank
 
Normal rank
Property / cites work
 
Property / cites work: Barrier Operators and Associated Gradient-Like Dynamical Systems for Constrained Minimization Problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Learning through reinforcement and replicator dynamics / rank
 
Normal rank
Property / cites work
 
Property / cites work: A payoff-based learning procedure and its application to traffic games / rank
 
Normal rank
Property / cites work
 
Property / cites work: Penalty-Regulated Dynamics and Robust Learning Procedures in Games / rank
 
Normal rank
Property / cites work
 
Property / cites work: Perturbed variations of penalty function methods. Example: Projective SUMT / rank
 
Normal rank
Property / cites work
 
Property / cites work: Adaptive game playing using multiplicative weights / rank
 
Normal rank
Property / cites work
 
Property / cites work: Evolutionary Games in Economics / rank
 
Normal rank
Property / cites work
 
Property / cites work: Social Stability and Equilibrium / rank
 
Normal rank
Property / cites work
 
Property / cites work: Escort evolutionary game theory / rank
 
Normal rank
Property / cites work
 
Property / cites work: A Simple Adaptive Procedure Leading to Correlated Equilibrium / rank
 
Normal rank
Property / cites work
 
Property / cites work: On the Global Convergence of Stochastic Fictitious Play / rank
 
Normal rank
Property / cites work
 
Property / cites work: Adaptive dynamics and evolutionary stability / rank
 
Normal rank
Property / cites work
 
Property / cites work: Evolutionary Games and Population Dynamics / rank
 
Normal rank
Property / cites work
 
Property / cites work: Time Average Replicator and Best-Reply Dynamics / rank
 
Normal rank
Property / cites work
 
Property / cites work: Learning, matching, and aggregation / rank
 
Normal rank
Property / cites work
 
Property / cites work: A note on best response dynamics. / rank
 
Normal rank
Property / cites work
 
Property / cites work: Two Competing Models of How People Learn in Games / rank
 
Normal rank
Property / cites work
 
Property / cites work: Attainability of boundary points under reinforcement learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Free-Steering Relaxation Methods for Problems with Strictly Convex Costs and Linear Constraints / rank
 
Normal rank
Property / cites work
 
Property / cites work: A continuous-time approach to online optimization / rank
 
Normal rank
Property / cites work
 
Property / cites work: The projection dynamic and the geometry of population games / rank
 
Normal rank
Property / cites work
 
Property / cites work: Higher order game dynamics / rank
 
Normal rank
Property / cites work
 
Property / cites work: Inertial Game Dynamics and Applications to Constrained Optimization / rank
 
Normal rank
Property / cites work
 
Property / cites work: Individual <i>Q</i>-Learning in Normal Form Games / rank
 
Normal rank
Property / cites work
 
Property / cites work: The weighted majority algorithm / rank
 
Normal rank
Property / cites work
 
Property / cites work: Quantal response equilibria for normal form games / rank
 
Normal rank
Property / cites work
 
Property / cites work: The emergence of rational behavior in the presence of stochastic perturbations / rank
 
Normal rank
Property / cites work
 
Property / cites work: Riemannian game dynamics / rank
 
Normal rank
Property / cites work
 
Property / cites work: ``Evolutionary'' selection dynamic in games: Convergence and limit properties / rank
 
Normal rank
Property / cites work
 
Property / cites work: Projected Dynamical Systems in the Formulation, Stability Analysis, and Computation of Fixed-Demand Traffic Network Equilibria / rank
 
Normal rank
Property / cites work
 
Property / cites work: Primal-dual subgradient methods for convex problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4888828 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Optimal properties of stimulus-response learning models. / rank
 
Normal rank
Property / cites work
 
Property / cites work: Evolutionary stability in asymmetric games / rank
 
Normal rank
Property / cites work
 
Property / cites work: The projection dynamic and the replicator dynamic / rank
 
Normal rank
Property / cites work
 
Property / cites work: Online Learning and Online Convex Optimization / rank
 
Normal rank
Property / cites work
 
Property / cites work: Exponential weight algorithm in continuous time / rank
 
Normal rank
Property / cites work
 
Property / cites work: Possible generalization of Boltzmann-Gibbs statistics. / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3474501 / rank
 
Normal rank
Property / cites work
 
Property / cites work: No-regret dynamics and fictitious play / rank
 
Normal rank

Latest revision as of 22:47, 12 July 2024

scientific article
Language Label Description Also known as
English
Learning in Games via Reinforcement and Regularization
scientific article

    Statements

    Learning in Games via Reinforcement and Regularization (English)
    0 references
    16 November 2016
    0 references
    Bregman divergence
    0 references
    dominated strategies
    0 references
    equilibrium stability
    0 references
    Fenchel coupling
    0 references
    penalty functions
    0 references
    projection dynamics
    0 references
    regularization
    0 references
    reinforcement learning
    0 references
    replicator dynamics
    0 references
    time averages
    0 references
    0 references
    0 references

    Identifiers

    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references