A Survey of Preference-Based Online Learning with Bandit Algorithms (Q2938721): Difference between revisions
From MaRDI portal
Created a new Item |
Set OpenAlex properties. |
||
(2 intermediate revisions by 2 users not shown) | |||
Property / MaRDI profile type | |||
Property / MaRDI profile type: MaRDI publication profile / rank | |||
Normal rank | |||
Property / full work available at URL | |||
Property / full work available at URL: https://doi.org/10.1007/978-3-319-11662-4_3 / rank | |||
Normal rank | |||
Property / OpenAlex ID | |||
Property / OpenAlex ID: W1032589285 / rank | |||
Normal rank | |||
links / mardi / name | links / mardi / name | ||
Latest revision as of 22:17, 19 March 2024
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | A Survey of Preference-Based Online Learning with Bandit Algorithms |
scientific article |
Statements
A Survey of Preference-Based Online Learning with Bandit Algorithms (English)
0 references
14 January 2015
0 references
multi-armed bandits
0 references
online learning
0 references
preference learning
0 references
ranking
0 references
top-k selection
0 references
exploration/exploitation
0 references
cumulative regret
0 references
sample complexity
0 references
PAC learning
0 references