Optimal learning for sequential sampling with non-parametric beliefs (Q742143): Difference between revisions

From MaRDI portal
Set OpenAlex properties.
ReferenceBot (talk | contribs)
Changed an Item
 
Property / cites work
 
Property / cites work: The Continuum-Armed Bandit Problem / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5396715 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4836494 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Widely Convergent Method for Finding Multiple Solutions of Simultaneous Nonlinear Equations / rank
 
Normal rank
Property / cites work
 
Property / cites work: Sequential Procedures for Aggregating Arbitrary Estimators of a Conditional Mean / rank
 
Normal rank
Property / cites work
 
Property / cites work: Monotone Approximation of Decision Problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3241542 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3125064 / rank
 
Normal rank
Property / cites work
 
Property / cites work: A Knowledge-Gradient Policy for Sequential Information Collection / rank
 
Normal rank
Property / cites work
 
Property / cites work: The Knowledge-Gradient Policy for Correlated Normal Beliefs / rank
 
Normal rank
Property / cites work
 
Property / cites work: A decision-theoretic generalization of on-line learning and an application to boosting / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4845385 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3096184 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4057976 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4197923 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Bayesian look ahead one-stage sampling allocations for selection of the best population / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3999325 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Nonparametric and semiparametric models. / rank
 
Normal rank
Property / cites work
 
Property / cites work: Global optimization of stochastic black-box systems via sequential kriging meta-models / rank
 
Normal rank
Property / cites work
 
Property / cites work: Functional aggregation for nonparametric regression. / rank
 
Normal rank
Property / cites work
 
Property / cites work: The Knowledge-Gradient Algorithm for Sequencing Experiments in Drug Discovery / rank
 
Normal rank
Property / cites work
 
Property / cites work: Approximate Dynamic Programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: A Stochastic Approximation Method / rank
 
Normal rank
Property / cites work
 
Property / cites work: The Knowledge Gradient Algorithm for a General Class of Online Learning Problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Introduction to Stochastic Search and Optimization / rank
 
Normal rank
Property / cites work
 
Property / cites work: An informational approach to the global optimization of expensive-to-evaluate functions / rank
 
Normal rank

Latest revision as of 01:03, 9 July 2024

scientific article
Language Label Description Also known as
English
Optimal learning for sequential sampling with non-parametric beliefs
scientific article

    Statements

    Optimal learning for sequential sampling with non-parametric beliefs (English)
    0 references
    0 references
    0 references
    18 September 2014
    0 references
    The authors consider the problem of maximizing an unknown function over a finite set of possible alternatives. They propose a sequential learning policy for ranking and selection problems, using a non-parametric procedure for estimating the value of a policy. Their estimation approach aggregates over a set of kernel functions in order to achieve a more consistent estimator. The final estimate uses a weighting scheme with the inverse \ mean square errors of the kernel estimators as weights. This weighting scheme is shown to be optimal under the independent kernel estimators. For choosing the measurement, the authors employ the knowledge gradient policy that relies on predictive distributions to calculate the optimal sampling point. The method allows a setting where the beliefs are expected to be correlated but the correlation structure is unknown beforehand. Moreover, the proposed policy is shown to be asymptotically optimal.
    0 references
    Bayesian global optimization
    0 references
    knowledge gradient
    0 references
    non-parametric estimation
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references

    Identifiers