Empirical Gittins index strategies with \(\varepsilon\)-explorations for multi-armed bandit problems (Q6167036): Difference between revisions

From MaRDI portal
Set OpenAlex properties.
Normalize DOI.
 
(One intermediate revision by one other user not shown)
Property / DOI
 
Property / DOI: 10.1016/j.csda.2022.107610 / rank
Normal rank
 
Property / cites work
 
Property / cites work: Sample mean based index policies by <i>O</i>(log <i>n</i>) regret for the multi-armed bandit problem / rank
 
Normal rank
Property / cites work
 
Property / cites work: Finite-time analysis of the multiarmed bandit problem / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4998881 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Methods and Applications of Statistics in Clinical Trials / rank
 
Normal rank
Property / cites work
 
Property / cites work: On Gittins' index theorem in continuous time / rank
 
Normal rank
Property / cites work
 
Property / cites work: A General Theory of MultiArmed Bandit Processes with Constrained Arm Switches / rank
 
Normal rank
Property / cites work
 
Property / cites work: Optimal stochastic scheduling / rank
 
Normal rank
Property / cites work
 
Property / cites work: On the Whittle index of Markov modulated restless bandits / rank
 
Normal rank
Property / cites work
 
Property / cites work: General Gittins index processes in discrete time. / rank
 
Normal rank
Property / cites work
 
Property / cites work: Dynamic allocation problems in continuous time / rank
 
Normal rank
Property / cites work
 
Property / cites work: Synchronization and optimality for multi-armed bandit problems in continuous time / rank
 
Normal rank
Property / cites work
 
Property / cites work: Continuous-time allocation indices and their discrete-time approximation / rank
 
Normal rank
Property / cites work
 
Property / cites work: Consistency of Sequential Bayesian Sampling Policies / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4197923 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4057976 / rank
 
Normal rank
Property / cites work
 
Property / cites work: On Bayesian models in stochastic scheduling / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4692329 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Multi‐Armed Bandit Allocation Indices / rank
 
Normal rank
Property / cites work
 
Property / cites work: Multi-armed bandit problem revisited / rank
 
Normal rank
Property / cites work
 
Property / cites work: Gittins indices in the dynamic allocation problem for diffusion processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Lévy bandits: Multi-armed bandits driven by Lévy processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Multi-armed bandits in discrete and continuous time / rank
 
Normal rank
Property / cites work
 
Property / cites work: On Bayesian index policies for sequential resource allocation / rank
 
Normal rank
Property / cites work
 
Property / cites work: A Survey of Some Results in Stochastic Adaptive Control / rank
 
Normal rank
Property / cites work
 
Property / cites work: Asymptotically efficient adaptive allocation rules / rank
 
Normal rank
Property / cites work
 
Property / cites work: Open bandit processes and optimal scheduling of queueing networks / rank
 
Normal rank
Property / cites work
 
Property / cites work: Adaptive treatment allocation and the multi-armed bandit problem / rank
 
Normal rank
Property / cites work
 
Property / cites work: The Multi-Armed Bandit With Stochastic Plays / rank
 
Normal rank
Property / cites work
 
Property / cites work: ON THE OPTIMALITY OF AN INDEX RULE IN MULTICHANNEL ALLOCATION FOR SINGLE-HOP MOBILE NETWORKS WITH MULTIPLE SERVICE CLASSES / rank
 
Normal rank
Property / cites work
 
Property / cites work: Bandit Algorithms / rank
 
Normal rank
Property / cites work
 
Property / cites work: On the Whittle Index for Restless Multiarmed Hidden Markov Bandits / rank
 
Normal rank
Property / cites work
 
Property / cites work: A minimax and asymptotically optimal algorithm for stochastic bandits / rank
 
Normal rank
Property / cites work
 
Property / cites work: Dynamic priority allocation via restless bandit marginal productivity indices / rank
 
Normal rank
Property / cites work
 
Property / cites work: Some aspects of the sequential design of experiments / rank
 
Normal rank
Property / cites work
 
Property / cites work: Linearly Parameterized Bandits / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4626283 / rank
 
Normal rank
Property / cites work
 
Property / cites work: A short proof of the Gittins index theorem / rank
 
Normal rank
Property / cites work
 
Property / cites work: Extensions of the multiarmed bandit problem: The discounted case / rank
 
Normal rank
Property / cites work
 
Property / cites work: INDEXABILITY AND OPTIMAL INDEX POLICIES FOR A CLASS OF REINITIALISING RESTLESS BANDITS / rank
 
Normal rank
Property / cites work
 
Property / cites work: Multi-armed bandit models for the optimal design of clinical trials: benefits and challenges / rank
 
Normal rank
Property / cites work
 
Property / cites work: On the Gittins index for multiarmed bandits / rank
 
Normal rank
Property / cites work
 
Property / cites work: Branching Bandit Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3882215 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Arm-acquiring bandits / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3221798 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3815845 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Open Bandit Processes with Uncountable States and Time-Backward Effects / rank
 
Normal rank
Property / DOI
 
Property / DOI: 10.1016/J.CSDA.2022.107610 / rank
 
Normal rank

Latest revision as of 19:04, 30 December 2024

scientific article; zbMATH DE number 7708608
Language Label Description Also known as
English
Empirical Gittins index strategies with \(\varepsilon\)-explorations for multi-armed bandit problems
scientific article; zbMATH DE number 7708608

    Statements

    Empirical Gittins index strategies with \(\varepsilon\)-explorations for multi-armed bandit problems (English)
    0 references
    7 July 2023
    0 references
    multi-armed bandit problem
    0 references
    reinforcement learning
    0 references
    rewarded Markov process
    0 references
    Gittins index
    0 references
    empirical Gittins index
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references

    Identifiers