Mathematical Research Data Initiative
Main page
Recent changes
Random page
SPARQL
MaRDI@GitHub
New item
In other projects
MaRDI portal item
Discussion
View source
View history
English
Log in

Learning Theory

From MaRDI portal
Publication:4680907
Jump to:navigation, search

DOI10.1007/B98522zbMATH Open1078.91514OpenAlexW4206057230MaRDI QIDQ4680907FDOQ4680907

Shie Mannor

Publication date: 13 June 2005

Published in: Lecture Notes in Computer Science (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1007/b98522




Recommendations

  • Reinforcement learning for exploratory linear-quadratic two-person zero-sum stochastic differential games
  • Convergent multiple-timescales reinforcement learning algorithms in normal form games
  • scientific article; zbMATH DE number 3920249
  • scientific article
  • scientific article; zbMATH DE number 4020880


Mathematics Subject Classification ID

Learning and adaptive systems in artificial intelligence (68T05) Stochastic games, stochastic differential games (91A15) Rationality and learning in game theory (91A26)



Cited In (3)

  • The lagging anchor algorithm: Reinforcement learning in two-player zero-sum games with imperfect information
  • Reinforcement Learning rules in a repeated game
  • Reinforcement learning for exploratory linear-quadratic two-person zero-sum stochastic differential games

Uses Software

  • R-MAX





This page was built for publication: Learning Theory

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q4680907)

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:4680907&oldid=18903862"
Tools
What links here
Related changes
Printable version
Permanent link
Page information
This page was last edited on 7 February 2024, at 18:33. Warning: Page may not contain recent updates.
Privacy policy
About MaRDI portal
Disclaimers
Imprint
Powered by MediaWiki