Q4998863 (Q4998863): Difference between revisions

From MaRDI portal
Import240304020342 (talk | contribs)
Set profile property.
ReferenceBot (talk | contribs)
Changed an Item
 
Property / cites work
 
Property / cites work: UCB revisited: improved regret bounds for the stochastic multi-armed bandit problem / rank
 
Normal rank
Property / cites work
 
Property / cites work: Finite-time analysis of the multiarmed bandit problem / rank
 
Normal rank
Property / cites work
 
Property / cites work: Bandits with Knapsacks / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3809068 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Optimal Exploration–Exploitation in a Multi-armed Bandit Problem with Non-stationary Rewards / rank
 
Normal rank
Property / cites work
 
Property / cites work: New approaches to statistical learning theory / rank
 
Normal rank
Property / cites work
 
Property / cites work: Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: The multi-armed bandit problem with covariates / rank
 
Normal rank
Property / cites work
 
Property / cites work: Prediction, Learning, and Games / rank
 
Normal rank
Property / cites work
 
Property / cites work: On Upper-Confidence Bound Policies for Switching Bandit Problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Multi‐Armed Bandit Allocation Indices / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5396640 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q2810758 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Thompson Sampling: An Asymptotically Optimal Finite-Time Analysis / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5302093 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Asymptotically efficient adaptive allocation rules / rank
 
Normal rank
Property / cites work
 
Property / cites work: Learning in a Changing World: Restless Multiarmed Bandit With Unknown Dynamics / rank
 
Normal rank

Latest revision as of 05:12, 26 July 2024

scientific article; zbMATH DE number 7370520
Language Label Description Also known as
English
No label defined
scientific article; zbMATH DE number 7370520

    Statements

    0 references
    0 references
    0 references
    9 July 2021
    0 references
    multi-armed bandit
    0 references
    exploration-exploitation trade-off
    0 references
    retail management
    0 references
    online applications
    0 references
    regret bounds
    0 references
    incorporating time-series into bandits
    0 references

    Identifiers