Optimal learning with non-Gaussian rewards
From MaRDI portal
Publication:2806349
Recommendations
Cites work
- scientific article; zbMATH DE number 5016447 (Why is no real title available?)
- scientific article; zbMATH DE number 605729 (Why is no real title available?)
- scientific article; zbMATH DE number 1547390 (Why is no real title available?)
- scientific article; zbMATH DE number 1402217 (Why is no real title available?)
- scientific article; zbMATH DE number 6193745 (Why is no real title available?)
- scientific article; zbMATH DE number 3215021 (Why is no real title available?)
- scientific article; zbMATH DE number 3357742 (Why is no real title available?)
- A Knowledge-Gradient Policy for Sequential Information Collection
- A generalized Gittins index for a class of multiarmed bandits with general resource requirements
- Bandit problems with Lévy processes
- Comparison methods for stochastic models and risks
- Conditional Lévy processes
- Consistency of sequential Bayesian sampling policies
- Continuous multi-armed bandits and multiparameter processes
- Convergence of values in optimal stopping and convergence of optimal stopping times
- Convergence properties of the expected improvement algorithm with fixed mean and covariance functions
- Discrete multiarmed bandits and multiparameter processes
- Dynamic allocation problems in continuous time
- Dynamic assortment with demand learning for seasonal consumer goods
- Dynamic pricing with a prior on market response
- Explicit Gittins Indices for a Class of Superdiffusive Processes
- Finite-time analysis of the multiarmed bandit problem
- How Does the Value Function of a Markov Decision Process Depend on the Transition Probabilities?
- Introductory lectures on fluctuations of Lévy processes with applications.
- Lévy bandits: Multi-armed bandits driven by Lévy processes
- Multi-armed bandit allocation indices. With a foreword by Peter Whittle.
- On optimal stopping and free boundary problems
- Optimal investment and consumption with stochastic dividends
- Optimal learning and experimentation in bandit problems.
- Optimal learning for sequential sampling with non-parametric beliefs
- Probability and stochastics.
- Processes that can be embedded in Brownian motion
- Properties of the Gittins index with application to optimal scheduling
- Sequential testing problems for Lévy processes
- Stalking information: Bayesian inventory management with unobserved lost sales
- Sur l'approximation des réduites. (On the approximation of residues)
- The Multi-Armed Bandit Problem: Decomposition and Computation
- The knowledge gradient algorithm for a general class of online learning problems
- The learning component of dynamic allocation indices
Cited in
(11)- Undiscounted bandit games
- Optimal learning with \textit{Q}-aggregation
- The ratio index for budgeted learning, with applications
- Optimal Learning for Stochastic Optimization with Nonlinear Parametric Belief Models
- ON THE IDENTIFICATION AND MITIGATION OF WEAKNESSES IN THE KNOWLEDGE GRADIENT POLICY FOR MULTI-ARMED BANDITS
- Lévy bandits under Poissonian decision times
- A Framework of Learning Through Empirical Gain Maximization
- Lévy bandits: Multi-armed bandits driven by Lévy processes
- Learning Preferences Under Noise and Loss Aversion: An Optimization Approach
- ∊-Optimal nonlinear reinforcement scheme under a nonstationary muititeacher environment
- Optimal stopping problems in Lévy models with random observations
This page was built for publication: Optimal learning with non-Gaussian rewards
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2806349)