Robustness of stochastic bandit policies (Q391739): Difference between revisions

From MaRDI portal
Import240304020342 (talk | contribs)
Set profile property.
ReferenceBot (talk | contribs)
Changed an Item
 
(2 intermediate revisions by 2 users not shown)
Property / OpenAlex ID
 
Property / OpenAlex ID: W1985558253 / rank
 
Normal rank
Property / arXiv ID
 
Property / arXiv ID: 1107.4506 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Sample mean based index policies by <i>O</i>(log <i>n</i>) regret for the multi-armed bandit problem / rank
 
Normal rank
Property / cites work
 
Property / cites work: Exploration-exploitation tradeoff using variance estimates in multi-armed bandits / rank
 
Normal rank
Property / cites work
 
Property / cites work: Finite-time analysis of the multiarmed bandit problem / rank
 
Normal rank
Property / cites work
 
Property / cites work: Stationary multi-choice bandit problems. / rank
 
Normal rank
Property / cites work
 
Property / cites work: Optimal adaptive policies for sequential allocation problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5302093 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Asymptotically efficient adaptive allocation rules / rank
 
Normal rank
Property / cites work
 
Property / cites work: When can the two-armed bandit algorithm be trusted? / rank
 
Normal rank
Property / cites work
 
Property / cites work: The tight constant in the Dvoretzky-Kiefer-Wolfowitz inequality / rank
 
Normal rank
Property / cites work
 
Property / cites work: Some aspects of the sequential design of experiments / rank
 
Normal rank

Latest revision as of 04:54, 7 July 2024

scientific article
Language Label Description Also known as
English
Robustness of stochastic bandit policies
scientific article

    Statements

    Identifiers

    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references