Regret bounds for Narendra-Shapiro bandit algorithms (Q5086451)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: Regret bounds for Narendra-Shapiro bandit algorithms |
scientific article; zbMATH DE number 7553390
| Language | Label | Description | Also known as |
|---|---|---|---|
| default for all languages | No label defined |
||
| English | Regret bounds for Narendra-Shapiro bandit algorithms |
scientific article; zbMATH DE number 7553390 |
Statements
Regret bounds for Narendra-Shapiro bandit algorithms (English)
0 references
5 July 2022
0 references
regret
0 references
stochastic bandit algorithms
0 references
piecewise deterministic Markov processes
0 references
0 references
0.9097681
0 references
0.9030225
0 references
0.89547646
0 references
0.88458246
0 references
0.88458246
0 references
0.8825656
0 references
0.8787991
0 references
0.8732257
0 references
0 references