An asymptotically optimal strategy for constrained multi-armed bandit problems (Q784789): Difference between revisions

From MaRDI portal
Importer (talk | contribs)
Changed an Item
Import241208061232 (talk | contribs)
Normalize DOI.
 
(3 intermediate revisions by 3 users not shown)
Property / DOI
 
Property / DOI: 10.1007/s00186-019-00697-3 / rank
Normal rank
 
Property / Wikidata QID
 
Property / Wikidata QID: Q126414170 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Finite-time analysis of the multiarmed bandit problem / rank
 
Normal rank
Property / cites work
 
Property / cites work: Randomised allocation of treatments in sequential trials / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3809068 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Pure exploration in finitely-armed and continuous-armed bandits / rank
 
Normal rank
Property / cites work
 
Property / cites work: Prediction, Learning, and Games / rank
 
Normal rank
Property / cites work
 
Property / cites work: The multi-armed bandit, with constraints / rank
 
Normal rank
Property / cites work
 
Property / cites work: Multi‐Armed Bandit Allocation Indices / rank
 
Normal rank
Property / cites work
 
Property / cites work: Probability Inequalities for Sums of Bounded Random Variables / rank
 
Normal rank
Property / cites work
 
Property / cites work: Asymptotically efficient adaptive allocation rules / rank
 
Normal rank
Property / cites work
 
Property / cites work: Algorithms for stochastic optimization with function or expectation constraints / rank
 
Normal rank
Property / cites work
 
Property / cites work: Penalty Function with Memory for Discrete Optimization via Simulation with Stochastic Constraints / rank
 
Normal rank
Property / cites work
 
Property / cites work: Stochastically Constrained Ranking and Selection via SCORE / rank
 
Normal rank
Property / cites work
 
Property / cites work: Some aspects of the sequential design of experiments / rank
 
Normal rank
Property / cites work
 
Property / cites work: Introduction to Stochastic Search and Optimization / rank
 
Normal rank
Property / cites work
 
Property / cites work: Online Learning Methods for Networking / rank
 
Normal rank
Property / cites work
 
Property / cites work: Sample average approximation of expected value constrained stochastic programs / rank
 
Normal rank
Property / DOI
 
Property / DOI: 10.1007/S00186-019-00697-3 / rank
 
Normal rank

Latest revision as of 03:39, 10 December 2024

scientific article
Language Label Description Also known as
English
An asymptotically optimal strategy for constrained multi-armed bandit problems
scientific article

    Statements

    An asymptotically optimal strategy for constrained multi-armed bandit problems (English)
    0 references
    0 references
    3 August 2020
    0 references
    multi-armed bandit
    0 references
    constrained stochastic optimization
    0 references
    simulation optimization
    0 references
    constrained Markov decision process
    0 references

    Identifiers