A Survey of Preference-Based Online Learning with Bandit Algorithms (Q2938721): Difference between revisions

From MaRDI portal
Added link to MaRDI item.
Set OpenAlex properties.
 
(One intermediate revision by one other user not shown)
Property / MaRDI profile type
 
Property / MaRDI profile type: MaRDI publication profile / rank
 
Normal rank
Property / full work available at URL
 
Property / full work available at URL: https://doi.org/10.1007/978-3-319-11662-4_3 / rank
 
Normal rank
Property / OpenAlex ID
 
Property / OpenAlex ID: W1032589285 / rank
 
Normal rank

Latest revision as of 22:17, 19 March 2024

scientific article
Language Label Description Also known as
English
A Survey of Preference-Based Online Learning with Bandit Algorithms
scientific article

    Statements

    A Survey of Preference-Based Online Learning with Bandit Algorithms (English)
    0 references
    0 references
    0 references
    14 January 2015
    0 references
    0 references
    multi-armed bandits
    0 references
    online learning
    0 references
    preference learning
    0 references
    ranking
    0 references
    top-k selection
    0 references
    exploration/exploitation
    0 references
    cumulative regret
    0 references
    sample complexity
    0 references
    PAC learning
    0 references
    0 references