A Survey of Preference-Based Online Learning with Bandit Algorithms (Q2938721): Difference between revisions
From MaRDI portal
Added link to MaRDI item. |
Set OpenAlex properties. |
||
(One intermediate revision by one other user not shown) | |||
Property / MaRDI profile type | |||
Property / MaRDI profile type: MaRDI publication profile / rank | |||
Normal rank | |||
Property / full work available at URL | |||
Property / full work available at URL: https://doi.org/10.1007/978-3-319-11662-4_3 / rank | |||
Normal rank | |||
Property / OpenAlex ID | |||
Property / OpenAlex ID: W1032589285 / rank | |||
Normal rank |
Latest revision as of 22:17, 19 March 2024
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | A Survey of Preference-Based Online Learning with Bandit Algorithms |
scientific article |
Statements
A Survey of Preference-Based Online Learning with Bandit Algorithms (English)
0 references
14 January 2015
0 references
multi-armed bandits
0 references
online learning
0 references
preference learning
0 references
ranking
0 references
top-k selection
0 references
exploration/exploitation
0 references
cumulative regret
0 references
sample complexity
0 references
PAC learning
0 references