scientific article; zbMATH DE number 1931819

From MaRDI portal

Publication:4709178

Jump to:navigation, search

MaRDI QIDQ4709178zbMATH OpenFDO

Authors Bikramjit Banerjee, Jing Peng

Publication date 20 June 2003

Full work available at URL http://link.springer.de/link/service/series/0558/bibs/2430/24300001.htm

Mathematics Subject Classification ID

Learning and adaptive systems in artificial intelligence (68T05) Applications of game theory (91A80)

Recommendations

Cited in

(8)

Gradient methods for solving Stackelberg games
AWESOME: a general multiagent learning algorithm that converges in self-play and learns a best response against stationary opponents
DOMAIN EXTENSIONS OF THE ERLANG LOSS FUNCTION: THEIR SCALABILITY AND ITS APPLICATIONS TO COOPERATIVE GAMES
Fast convergence of optimistic gradient ascent in network zero-sum extensive form games
On gradient-based learning in continuous games
Continuous-Time Discounted Mirror Descent Dynamics in Monotone Concave Games
The evolutionary dynamics of soft-max policy gradient in multi-agent settings
Policy invariance under reward transformations for general-sum stochastic games

This page was built for publication:

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q4709178)

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:4709178&oldid=18951070"