Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems (Q5396763): Difference between revisions
From MaRDI portal
Created a new Item |
Changed an Item |
||
(4 intermediate revisions by 4 users not shown) | |||
Property / MaRDI profile type | |||
Property / MaRDI profile type: MaRDI publication profile / rank | |||
Normal rank | |||
Property / OpenAlex ID | |||
Property / OpenAlex ID: W2950929549 / rank | |||
Normal rank | |||
Property / Wikidata QID | |||
Property / Wikidata QID: Q59538563 / rank | |||
Normal rank | |||
Property / arXiv ID | |||
Property / arXiv ID: 1204.5721 / rank | |||
Normal rank | |||
links / mardi / name | links / mardi / name | ||
Revision as of 01:50, 20 April 2024
scientific article; zbMATH DE number 6254309
Language | Label | Description | Also known as |
---|---|---|---|
English | Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems |
scientific article; zbMATH DE number 6254309 |
Statements
Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems (English)
0 references
3 February 2014
0 references
learning and statistical methods
0 references
game-theoretic learning
0 references
online learning
0 references
optimization
0 references
reinforcement learning
0 references