Finite-Time Analysis for the Knowledge-Gradient Policy (Q4610155): Difference between revisions

From MaRDI portal
Changed an Item
Created claim: Wikidata QID (P12): Q130050586, #quickstatements; #temporary_batch_1726319863356
 
(5 intermediate revisions by 5 users not shown)
Property / describes a project that uses
 
Property / describes a project that uses: BayesDA / rank
 
Normal rank
Property / MaRDI profile type
 
Property / MaRDI profile type: MaRDI publication profile / rank
 
Normal rank
Property / arXiv ID
 
Property / arXiv ID: 1606.04624 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Sample mean based index policies by <i>O</i>(log <i>n</i>) regret for the multi-armed bandit problem / rank
 
Normal rank
Property / cites work
 
Property / cites work: Exploration-exploitation tradeoff using variance estimates in multi-armed bandits / rank
 
Normal rank
Property / cites work
 
Property / cites work: Finite-time analysis of the multiarmed bandit problem / rank
 
Normal rank
Property / cites work
 
Property / cites work: Optimal learning for sequential sampling with non-parametric beliefs / rank
 
Normal rank
Property / cites work
 
Property / cites work: Selecting a Selection Procedure / rank
 
Normal rank
Property / cites work
 
Property / cites work: Bandits With Heavy Tail / rank
 
Normal rank
Property / cites work
 
Property / cites work: Kullback-Leibler upper confidence bounds for optimal sequential allocation / rank
 
Normal rank
Property / cites work
 
Property / cites work: Efficient Dynamic Simulation Allocation in Ordinal Optimization / rank
 
Normal rank
Property / cites work
 
Property / cites work: Simulation budget allocation for further enhancing the efficiency of ordinal optimization / rank
 
Normal rank
Property / cites work
 
Property / cites work: The Knowledge-Gradient Policy for Correlated Normal Beliefs / rank
 
Normal rank
Property / cites work
 
Property / cites work: A Knowledge-Gradient Policy for Sequential Information Collection / rank
 
Normal rank
Property / cites work
 
Property / cites work: On Upper-Confidence Bound Policies for Switching Bandit Problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q2873072 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4197923 / rank
 
Normal rank
Property / cites work
 
Property / cites work: The Data-Correcting Algorithm for the Minimization of Supermodular Functions / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5689624 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Bayesian look ahead one-stage sampling allocations for selection of the best population / rank
 
Normal rank
Property / cites work
 
Property / cites work: A Bayesian Approach to Some Best Population Problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Global optimization of stochastic black-box systems via sequential kriging meta-models / rank
 
Normal rank
Property / cites work
 
Property / cites work: Efficient global optimization of expensive black-box functions / rank
 
Normal rank
Property / cites work
 
Property / cites work: Regret bounds for sleeping experts and bandits / rank
 
Normal rank
Property / cites work
 
Property / cites work: Asymptotically efficient adaptive allocation rules / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5396715 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3993195 / rank
 
Normal rank
Property / cites work
 
Property / cites work: The Knowledge-Gradient Algorithm for Sequencing Experiments in Drug Discovery / rank
 
Normal rank
Property / cites work
 
Property / cites work: An analysis of approximations for maximizing submodular set functions—I / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4497726 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Information-Theoretic Regret Bounds for Gaussian Process Optimization in the Bandit Setting / rank
 
Normal rank
Property / OpenAlex ID
 
Property / OpenAlex ID: W2963389017 / rank
 
Normal rank
Property / Wikidata QID
 
Property / Wikidata QID: Q130050586 / rank
 
Normal rank

Latest revision as of 14:37, 14 September 2024

scientific article; zbMATH DE number 6856518
Language Label Description Also known as
English
Finite-Time Analysis for the Knowledge-Gradient Policy
scientific article; zbMATH DE number 6856518

    Statements

    Finite-Time Analysis for the Knowledge-Gradient Policy (English)
    0 references
    0 references
    0 references
    5 April 2018
    0 references
    ranking and selection
    0 references
    sequential decision analysis
    0 references
    stochastic control
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references

    Identifiers

    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references