Optimal learning with non-Gaussian rewards (Q2806349): Difference between revisions

From MaRDI portal
Import240304020342 (talk | contribs)
Set profile property.
Import241208061232 (talk | contribs)
Normalize DOI.
 
(2 intermediate revisions by 2 users not shown)
Property / DOI
 
Property / DOI: 10.1017/apr.2015.9 / rank
Normal rank
 
Property / OpenAlex ID
 
Property / OpenAlex ID: W2297707299 / rank
 
Normal rank
Property / cites work
 
Property / cites work: PROPERTIES OF THE GITTINS INDEX WITH APPLICATION TO OPTIMAL SCHEDULING / rank
 
Normal rank
Property / cites work
 
Property / cites work: Finite-time analysis of the multiarmed bandit problem / rank
 
Normal rank
Property / cites work
 
Property / cites work: Optimal learning and experimentation in bandit problems. / rank
 
Normal rank
Property / cites work
 
Property / cites work: Sequential Testing Problems for Lévy Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Dynamic Assortment with Demand Learning for Seasonal Consumer Goods / rank
 
Normal rank
Property / cites work
 
Property / cites work: Conditional Lévy processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Probability and Stochastics / rank
 
Normal rank
Property / cites work
 
Property / cites work: Bandit Problems with Lévy Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Convergence of values in optimal stopping and convergence of optimal stopping times / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5631860 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5342182 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Dynamic allocation problems in continuous time / rank
 
Normal rank
Property / cites work
 
Property / cites work: Dynamic Pricing with a Prior on Market Response / rank
 
Normal rank
Property / cites work
 
Property / cites work: Explicit Gittins Indices for a Class of Superdiffusive Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Consistency of Sequential Bayesian Sampling Policies / rank
 
Normal rank
Property / cites work
 
Property / cites work: A Knowledge-Gradient Policy for Sequential Information Collection / rank
 
Normal rank
Property / cites work
 
Property / cites work: The learning component of dynamic allocation indices / rank
 
Normal rank
Property / cites work
 
Property / cites work: Multi‐Armed Bandit Allocation Indices / rank
 
Normal rank
Property / cites work
 
Property / cites work: A Generalized Gittins Index for a Class of Multiarmed Bandits with General Resource Requirements / rank
 
Normal rank
Property / cites work
 
Property / cites work: Lévy bandits: Multi-armed bandits driven by Lévy processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: The Multi-Armed Bandit Problem: Decomposition and Computation / rank
 
Normal rank
Property / cites work
 
Property / cites work: Introductory lectures on fluctuations of Lévy processes with applications. / rank
 
Normal rank
Property / cites work
 
Property / cites work: Sur l'approximation des réduites. (On the approximation of residues) / rank
 
Normal rank
Property / cites work
 
Property / cites work: Stalking Information: Bayesian Inventory Management with Unobserved Lost Sales / rank
 
Normal rank
Property / cites work
 
Property / cites work: Discrete multiarmed bandits and multiparameter processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Continuous multi-armed bandits and multiparameter processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Processes that can be embedded in Brownian motion / rank
 
Normal rank
Property / cites work
 
Property / cites work: How Does the Value Function of a Markov Decision Process Depend on the Transition Probabilities? / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q2778807 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3378055 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Optimal learning for sequential sampling with non-parametric beliefs / rank
 
Normal rank
Property / cites work
 
Property / cites work: The Knowledge Gradient Algorithm for a General Class of Online Learning Problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4937701 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4301147 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4521614 / rank
 
Normal rank
Property / cites work
 
Property / cites work: On optimal stopping and free boundary problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Convergence properties of the expected improvement algorithm with fixed mean and covariance functions / rank
 
Normal rank
Property / cites work
 
Property / cites work: Optimal investment and consumption with stochastic dividends / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5326975 / rank
 
Normal rank
Property / DOI
 
Property / DOI: 10.1017/APR.2015.9 / rank
 
Normal rank

Latest revision as of 23:13, 19 December 2024

scientific article
Language Label Description Also known as
English
Optimal learning with non-Gaussian rewards
scientific article

    Statements

    Optimal learning with non-Gaussian rewards (English)
    0 references
    0 references
    0 references
    17 May 2016
    0 references
    optimal learning
    0 references
    Gittins indices
    0 references
    multi-armed bandit
    0 references
    optimal stopping
    0 references
    Lévy process
    0 references
    non-Gaussian rewards
    0 references
    probabilistic interpolation
    0 references
    partial integro-differential equation
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references

    Identifiers

    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references