Optimal learning for sequential sampling with non-parametric beliefs (Q742143): Difference between revisions

The authors consider the problem of maximizing an unknown function over a finite set of possible alternatives. They propose a sequential learning policy for ranking and selection problems, using a non-parametric procedure for estimating the value of a policy. Their estimation approach aggregates over a set of kernel functions in order to achieve a more consistent estimator. The final estimate uses a weighting scheme with the inverse \ mean square errors of the kernel estimators as weights. This weighting scheme is shown to be optimal under the independent kernel estimators. For choosing the measurement, the authors employ the knowledge gradient policy that relies on predictive distributions to calculate the optimal sampling point. The method allows a setting where the beliefs are expected to be correlated but the correlation structure is unknown beforehand. Moreover, the proposed policy is shown to be asymptotically optimal.

0 references

zbMATH Keywords

Bayesian global optimization

0 references

knowledge gradient

0 references

non-parametric estimation

0 references

reviewed by

Ioan M. Stancu-Minasian

0 references

describes a project that uses

BayesDA

0 references

AdaBoost.MH

0 references

MaRDI profile type

MaRDI publication profile

0 references

full work available at URL

https://doi.org/10.1007/s10898-013-0050-5

0 references

cites work

The Continuum-Armed Bandit Problem

0 references

Q5396715

0 references

Q4836494

0 references

Widely Convergent Method for Finding Multiple Solutions of Simultaneous Nonlinear Equations

0 references

Sequential Procedures for Aggregating Arbitrary Estimators of a Conditional Mean

0 references

Monotone Approximation of Decision Problems

0 references

Q3241542

0 references

Q3125064

0 references

A Knowledge-Gradient Policy for Sequential Information Collection

0 references

The Knowledge-Gradient Policy for Correlated Normal Beliefs

0 references

A decision-theoretic generalization of on-line learning and an application to boosting

0 references

0 references

0 references

0 references

0 references

Bayesian look ahead one-stage sampling allocations for selection of the best population

0 references

Q3999325

0 references

Nonparametric and semiparametric models.

0 references

Global optimization of stochastic black-box systems via sequential kriging meta-models

0 references

Functional aggregation for nonparametric regression.

0 references

The Knowledge-Gradient Algorithm for Sequencing Experiments in Drug Discovery

0 references

Approximate Dynamic Programming

0 references

A Stochastic Approximation Method

0 references

The Knowledge Gradient Algorithm for a General Class of Online Learning Problems

0 references

Introduction to Stochastic Search and Optimization

0 references

An informational approach to the global optimization of expensive-to-evaluate functions

0 references

Identifiers

zbMATH Open document ID

1331.90042

0 references

DOI

10.1007/s10898-013-0050-5

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:742143

@@ Property / cites work @@
+The Continuum-Armed Bandit Problem
@@ Property / cites work: The Continuum-Armed Bandit Problem / rank @@
+Normal rank
@@ Property / cites work @@
+Q5396715
@@ Property / cites work: Q5396715 / rank @@
+Normal rank
@@ Property / cites work @@
+Q4836494
@@ Property / cites work: Q4836494 / rank @@
+Normal rank
@@ Property / cites work @@
+Widely Convergent Method for Finding Multiple Solutions of Simultaneous Nonlinear Equations
+Normal rank
@@ Property / cites work @@
+Sequential Procedures for Aggregating Arbitrary Estimators of a Conditional Mean
+Normal rank
@@ Property / cites work @@
+Monotone Approximation of Decision Problems
@@ Property / cites work: Monotone Approximation of Decision Problems / rank @@
+Normal rank
@@ Property / cites work @@
+Q3241542
@@ Property / cites work: Q3241542 / rank @@
+Normal rank
@@ Property / cites work @@
+Q3125064
@@ Property / cites work: Q3125064 / rank @@
+Normal rank
@@ Property / cites work @@
+A Knowledge-Gradient Policy for Sequential Information Collection
+Normal rank
@@ Property / cites work @@
+The Knowledge-Gradient Policy for Correlated Normal Beliefs
+Normal rank
@@ Property / cites work @@
+A decision-theoretic generalization of on-line learning and an application to boosting
+Normal rank
@@ Property / cites work @@
+Q4845385
@@ Property / cites work: Q4845385 / rank @@
+Normal rank
@@ Property / cites work @@
+Q3096184
@@ Property / cites work: Q3096184 / rank @@
+Normal rank
@@ Property / cites work @@
+Q4057976
@@ Property / cites work: Q4057976 / rank @@
+Normal rank
@@ Property / cites work @@
+Q4197923
@@ Property / cites work: Q4197923 / rank @@
+Normal rank
@@ Property / cites work @@
+Bayesian look ahead one-stage sampling allocations for selection of the best population
+Normal rank
@@ Property / cites work @@
+Q3999325
@@ Property / cites work: Q3999325 / rank @@
+Normal rank
@@ Property / cites work @@
+Nonparametric and semiparametric models.
@@ Property / cites work: Nonparametric and semiparametric models. / rank @@
+Normal rank
@@ Property / cites work @@
+Global optimization of stochastic black-box systems via sequential kriging meta-models
+Normal rank
@@ Property / cites work @@
+Functional aggregation for nonparametric regression.
+Normal rank
@@ Property / cites work @@
+The Knowledge-Gradient Algorithm for Sequencing Experiments in Drug Discovery
+Normal rank
@@ Property / cites work @@
+Approximate Dynamic Programming
@@ Property / cites work: Approximate Dynamic Programming / rank @@
+Normal rank
@@ Property / cites work @@
+A Stochastic Approximation Method
@@ Property / cites work: A Stochastic Approximation Method / rank @@
+Normal rank
@@ Property / cites work @@
+The Knowledge Gradient Algorithm for a General Class of Online Learning Problems
+Normal rank
@@ Property / cites work @@
+Introduction to Stochastic Search and Optimization
+Normal rank
@@ Property / cites work @@
+An informational approach to the global optimization of expensive-to-evaluate functions
+Normal rank