Empirical Gittins index strategies with \(\varepsilon\)-explorations for multi-armed bandit problems (Q6167036): Difference between revisions
From MaRDI portal
ReferenceBot (talk | contribs) Changed an Item |
Normalize DOI. |
||
Property / DOI | |||
Property / DOI: 10.1016/j.csda.2022.107610 / rank | |||
Property / DOI | |||
Property / DOI: 10.1016/J.CSDA.2022.107610 / rank | |||
Normal rank |
Latest revision as of 19:04, 30 December 2024
scientific article; zbMATH DE number 7708608
Language | Label | Description | Also known as |
---|---|---|---|
English | Empirical Gittins index strategies with \(\varepsilon\)-explorations for multi-armed bandit problems |
scientific article; zbMATH DE number 7708608 |
Statements
Empirical Gittins index strategies with \(\varepsilon\)-explorations for multi-armed bandit problems (English)
0 references
7 July 2023
0 references
multi-armed bandit problem
0 references
reinforcement learning
0 references
rewarded Markov process
0 references
Gittins index
0 references
empirical Gittins index
0 references
0 references
0 references