Mathematical Research Data Initiative
Main page
Recent changes
Random page
SPARQL
MaRDI@GitHub
New item
Special pages
In other projects
MaRDI portal item
Discussion
View source
View history
English
Log in

scientific article; zbMATH DE number 3822951

From MaRDI portal
Publication:3668675
Jump to:navigation, search

zbMATH Open0519.62065MaRDI QIDQ3668675FDOQ3668675


Authors: Radu Theodorescu, Dieter Kalin Edit this on Wikidata


Publication date: 1982



Title of this publication is not available (Why is that?)




zbMATH Keywords

finite horizonlearning algorithmtwo-armed bandit problemcharacterizations of optimal policiesmonotonicity properties for expected cumulative discounted reward


Mathematics Subject Classification ID

Learning and adaptive systems in artificial intelligence (68T05) Sequential statistical design (62L05) Dynamic programming (90C39) Optimal stopping in statistics (62L15)







This page was built for publication:

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q3668675)

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:3668675&oldid=17134076"
Tools
What links here
Related changes
Printable version
Permanent link
Page information
This page was last edited on 5 February 2024, at 07:11. Warning: Page may not contain recent updates.
Privacy policy
About MaRDI portal
Disclaimers
Imprint
Powered by MediaWiki