Finite-time analysis of the multiarmed bandit problem (Q5959973): Difference between revisions
From MaRDI portal
Set profile property. |
Created claim: Wikidata QID (P12): Q56675670, #quickstatements; #temporary_batch_1711565664090 |
||
Property / Wikidata QID | |||
Property / Wikidata QID: Q56675670 / rank | |||
Normal rank |
Latest revision as of 21:57, 27 March 2024
scientific article; zbMATH DE number 1727089
Language | Label | Description | Also known as |
---|---|---|---|
English | Finite-time analysis of the multiarmed bandit problem |
scientific article; zbMATH DE number 1727089 |
Statements
Finite-time analysis of the multiarmed bandit problem (English)
0 references
11 April 2002
0 references
reinforcement learning
0 references