Finite-time analysis of the multiarmed bandit problem (Q5959973): Difference between revisions
From MaRDI portal
Created a new Item |
Created claim: Wikidata QID (P12): Q56675670, #quickstatements; #temporary_batch_1711565664090 |
||
(2 intermediate revisions by 2 users not shown) | |||
Property / MaRDI profile type | |||
Property / MaRDI profile type: Publication / rank | |||
Normal rank | |||
Property / Wikidata QID | |||
Property / Wikidata QID: Q56675670 / rank | |||
Normal rank | |||
links / mardi / name | links / mardi / name | ||
Latest revision as of 22:57, 27 March 2024
scientific article; zbMATH DE number 1727089
Language | Label | Description | Also known as |
---|---|---|---|
English | Finite-time analysis of the multiarmed bandit problem |
scientific article; zbMATH DE number 1727089 |
Statements
Finite-time analysis of the multiarmed bandit problem (English)
0 references
11 April 2002
0 references
reinforcement learning
0 references