Optimal learning for sequential sampling with non-parametric beliefs (Q742143)

From MaRDI portal

Jump to:navigation, search

scientific article

Language	Label	Description	Also known as
English	Optimal learning for sequential sampling with non-parametric beliefs	scientific article

Statements

scholarly article

0 references

Optimal learning for sequential sampling with non-parametric beliefs (English)

0 references

zbMATH Open document ID

0 references

10.1007/s10898-013-0050-5

0 references

0 references

Warren B. Powell

0 references

Journal of Global Optimization

0 references

publication date

18 September 2014

0 references

The authors consider the problem of maximizing an unknown function over a finite set of possible alternatives. They propose a sequential learning policy for ranking and selection problems, using a non-parametric procedure for estimating the value of a policy. Their estimation approach aggregates over a set of kernel functions in order to achieve a more consistent estimator. The final estimate uses a weighting scheme with the inverse \ mean square errors of the kernel estimators as weights. This weighting scheme is shown to be optimal under the independent kernel estimators. For choosing the measurement, the authors employ the knowledge gradient policy that relies on predictive distributions to calculate the optimal sampling point. The method allows a setting where the beliefs are expected to be correlated but the correlation structure is unknown beforehand. Moreover, the proposed policy is shown to be asymptotically optimal.

0 references

Mathematics Subject Classification ID

0 references

zbMATH DE Number

0 references

zbMATH Keywords

Bayesian global optimization

0 references

knowledge gradient

0 references

non-parametric estimation

0 references

Ioan M. Stancu-Minasian

0 references

describes a project that uses

0 references

0 references

MaRDI profile type

MaRDI publication profile

0 references

full work available at URL

https://doi.org/10.1007/s10898-013-0050-5

0 references

0 references

The Continuum-Armed Bandit Problem

0 references

0 references

0 references

Widely Convergent Method for Finding Multiple Solutions of Simultaneous Nonlinear Equations

0 references

Sequential Procedures for Aggregating Arbitrary Estimators of a Conditional Mean

0 references

Monotone Approximation of Decision Problems

0 references

0 references

0 references

A Knowledge-Gradient Policy for Sequential Information Collection

0 references

The Knowledge-Gradient Policy for Correlated Normal Beliefs

0 references

A decision-theoretic generalization of on-line learning and an application to boosting

0 references

0 references

0 references

0 references

0 references

Bayesian look ahead one-stage sampling allocations for selection of the best population

0 references

0 references

Nonparametric and semiparametric models.

0 references

Global optimization of stochastic black-box systems via sequential kriging meta-models

0 references

Functional aggregation for nonparametric regression.

0 references

The Knowledge-Gradient Algorithm for Sequencing Experiments in Drug Discovery

0 references

Approximate Dynamic Programming

0 references

A Stochastic Approximation Method

0 references

The Knowledge Gradient Algorithm for a General Class of Online Learning Problems

0 references

Introduction to Stochastic Search and Optimization

0 references

An informational approach to the global optimization of expensive-to-evaluate functions

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:742143

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Item:Q742143&oldid=35411971"