Multi-armed bandits with discount factor near one: The Bernoulli case (Q1161450): Difference between revisions
From MaRDI portal
Created a new Item |
Set OpenAlex properties. |
||
(2 intermediate revisions by 2 users not shown) | |||
Property / MaRDI profile type | |||
Property / MaRDI profile type: MaRDI publication profile / rank | |||
Normal rank | |||
Property / full work available at URL | |||
Property / full work available at URL: https://doi.org/10.1214/aos/1176345578 / rank | |||
Normal rank | |||
Property / OpenAlex ID | |||
Property / OpenAlex ID: W2095160246 / rank | |||
Normal rank | |||
links / mardi / name | links / mardi / name | ||
Latest revision as of 21:49, 19 March 2024
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | Multi-armed bandits with discount factor near one: The Bernoulli case |
scientific article |
Statements
Multi-armed bandits with discount factor near one: The Bernoulli case (English)
0 references
1981
0 references
Bernoulli bandit process
0 references
limit rule
0 references
play-the-winner rule
0 references
least failures rule
0 references
discount optimality
0 references
expected average reward optimality
0 references
multi-armed bandit
0 references
optimal arm pulling strategy
0 references
infinite sequence of Bernoulli random variables
0 references
Gittins index
0 references
asymptotic bounds
0 references