Optimal Learning for Stochastic Optimization with Nonlinear Parametric Belief Models
From MaRDI portal
Abstract: We consider the problem of estimating the expected value of information (the knowledge gradient) for Bayesian learning problems where the belief model is nonlinear in the parameters. Our goal is to maximize some metric, while simultaneously learning the unknown parameters of the nonlinear belief model, by guiding a sequential experimentation process which is expensive. We overcome the problem of computing the expected value of an experiment, which is computationally intractable, by using a sampled approximation, which helps to guide experiments but does not provide an accurate estimate of the unknown parameters. We then introduce a resampling process which allows the sampled model to adapt to new information, exploiting past experiments. We show theoretically that the method converges asymptotically to the true parameters, while simultaneously maximizing our metric. We show empirically that the process exhibits rapid convergence, yielding good results with a very small number of experiments.
Recommendations
- Optimal learning for nonlinear parametric belief models over multidimensional continuous spaces
- Optimal learning with a local parametric belief model
- Optimal learning for sequential sampling with non-parametric beliefs
- Optimal learning with non-Gaussian rewards
- Optimal online learning for nonlinear belief models using discrete priors
- Solving stochastic optimization models with learning and rational expectations
- Stochastic learning approach for binary optimization: application to Bayesian optimal design of experiments
- Stochastic optimization on Bayesian nets
Cites work
- scientific article; zbMATH DE number 1906319 (Why is no real title available?)
- A Knowledge-Gradient Policy for Sequential Information Collection
- A Simple Distribution-Free Approach to the Max k-Armed Bandit Problem
- A literature survey of benchmark functions for global optimisation problems
- A survey on metaheuristics for stochastic combinatorial optimization
- Active learning
- Adaptation and tracking in system identification - a survey
- Approximate dynamic programming. Solving the curses of dimensionality
- Bayesian look ahead one-stage sampling allocations for selection of the best population
- Convergence results for single-step on-policy reinforcement-learning algorithms
- Efficient global optimization of expensive black-box functions
- Finite-time analysis of the multiarmed bandit problem
- Handbooks in operations research and management science: Simulation
- Introduction to Stochastic Search and Optimization
- Iterative learning control for deterministic systems
- Learning the demand function in a repeated Cournot oligopoly game
- Multi-armed bandit allocation indices. With a foreword by Peter Whittle.
- Nonlinear oligopolies. Stability and bifurcations
- On stable learning in dynamic oligopolies
- On the solution of stochastic optimization and variational problems in imperfect information regimes
- Optimal learning in experimental design using the knowledge gradient policy with application to characterizing nanoemulsion stability
- Simulation budget allocation for further enhancing the efficiency of ordinal optimization
- The knowledge-gradient algorithm for sequencing experiments in drug discovery
- The knowledge-gradient policy for correlated normal beliefs
Cited in
(8)- Optimal learning with local nonlinear parametric models over continuous designs
- Optimal learning in linear regression with combinatorial feature selection
- On ε-Optimality of the Pursuit Learning Algorithm
- Optimal learning for sequential sampling with non-parametric beliefs
- Optimal learning with a local parametric belief model
- Optimal learning for nonlinear parametric belief models over multidimensional continuous spaces
- Optimal online learning for nonlinear belief models using discrete priors
- Optimal learning in experimental design using the knowledge gradient policy with application to characterizing nanoemulsion stability
This page was built for publication: Optimal Learning for Stochastic Optimization with Nonlinear Parametric Belief Models
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q4586173)