Multi-armed bandits with discount factor near one: The Bernoulli case (Q1161450): Difference between revisions
From MaRDI portal
Set profile property. |
Set OpenAlex properties. |
||
Property / full work available at URL | |||
Property / full work available at URL: https://doi.org/10.1214/aos/1176345578 / rank | |||
Normal rank | |||
Property / OpenAlex ID | |||
Property / OpenAlex ID: W2095160246 / rank | |||
Normal rank |
Latest revision as of 21:49, 19 March 2024
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | Multi-armed bandits with discount factor near one: The Bernoulli case |
scientific article |
Statements
Multi-armed bandits with discount factor near one: The Bernoulli case (English)
0 references
1981
0 references
Bernoulli bandit process
0 references
limit rule
0 references
play-the-winner rule
0 references
least failures rule
0 references
discount optimality
0 references
expected average reward optimality
0 references
multi-armed bandit
0 references
optimal arm pulling strategy
0 references
infinite sequence of Bernoulli random variables
0 references
Gittins index
0 references
asymptotic bounds
0 references