Finite-time analysis of the multiarmed bandit problem (Q5959973): Difference between revisions
From MaRDI portal
Added link to MaRDI item. |
Created claim: Wikidata QID (P12): Q56675670, #quickstatements; #temporary_batch_1711565664090 |
||
(One intermediate revision by one other user not shown) | |||
Property / MaRDI profile type | |||
Property / MaRDI profile type: Publication / rank | |||
Normal rank | |||
Property / Wikidata QID | |||
Property / Wikidata QID: Q56675670 / rank | |||
Normal rank |
Latest revision as of 22:57, 27 March 2024
scientific article; zbMATH DE number 1727089
Language | Label | Description | Also known as |
---|---|---|---|
English | Finite-time analysis of the multiarmed bandit problem |
scientific article; zbMATH DE number 1727089 |
Statements
Finite-time analysis of the multiarmed bandit problem (English)
0 references
11 April 2002
0 references
reinforcement learning
0 references