Optimal anytime regret with two experts (Q6062702): Difference between revisions

From MaRDI portal
Set OpenAlex properties.
ReferenceBot (talk | contribs)
Changed an Item
Property / cites work
 
Property / cites work: Minimax option pricing meets black-scholes in the limit / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q2913806 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Finite-time 4-expert prediction problem / rank
 
Normal rank
Property / cites work
 
Property / cites work: On the asymptotic optimality of the comb strategy for prediction with expert advice / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5651973 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4003876 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Optimal learning and experimentation in bandit problems. / rank
 
Normal rank
Property / cites work
 
Property / cites work: Analysis of two gradient-based algorithms for on-line regression / rank
 
Normal rank
Property / cites work
 
Property / cites work: How to use expert advice / rank
 
Normal rank
Property / cites work
 
Property / cites work: Prediction, Learning, and Games / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5606230 / rank
 
Normal rank
Property / cites work
 
Property / cites work: On the \(L^p\) norms of stochastic integrals and other martingales / rank
 
Normal rank
Property / cites work
 
Property / cites work: Online trading algorithms and robust option pricing / rank
 
Normal rank
Property / cites work
 
Property / cites work: Brownian motion hitting probabilities for general two-sided square-root boundaries / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3342857 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Prediction with expert advice: a PDE perspective / rank
 
Normal rank
Property / cites work
 
Property / cites work: Probability / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5538132 / rank
 
Normal rank
Property / cites work
 
Property / cites work: A random walk analogue of Lévy’s Theorem / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4320535 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Towards Optimal Algorithms for Prediction with Expert Advice / rank
 
Normal rank
Property / cites work
 
Property / cites work: Tight Lower Bounds for Multiplicative Weights Algorithmic Families / rank
 
Normal rank
Property / cites work
 
Property / cites work: A conditioned limit theorem for random walk and Brownian local time on square root boundaries / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q2744679 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3245635 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5739092 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Probability theory. A comprehensive course. / rank
 
Normal rank
Property / cites work
 
Property / cites work: Ito's formula for a random walk / rank
 
Normal rank
Property / cites work
 
Property / cites work: The weighted majority algorithm / rank
 
Normal rank
Property / cites work
 
Property / cites work: Primal-dual subgradient methods for convex problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: On the Hausdorff dimension of the Brownian slow points / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3378055 / rank
 
Normal rank
Property / cites work
 
Property / cites work: On a Property of Real Plane Curves of Even Degree / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4303982 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4508926 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Online Learning and Online Convex Optimization / rank
 
Normal rank
Property / cites work
 
Property / cites work: A First Passage Problem for the Wiener Process / rank
 
Normal rank
Property / cites work
 
Property / cites work: Probability with Martingales / rank
 
Normal rank

Revision as of 12:23, 3 August 2024

scientific article; zbMATH DE number 7761502
Language Label Description Also known as
English
Optimal anytime regret with two experts
scientific article; zbMATH DE number 7761502

    Statements

    Optimal anytime regret with two experts (English)
    0 references
    0 references
    0 references
    0 references
    0 references
    6 November 2023
    0 references
    Summary: We consider the classical problem of prediction with expert advice. In the fixedtime setting, where the time horizon is known in advance, algorithms that achieve the optimal regret are known when there are two, three, or four experts or when the number of experts is large. Much less is known about the problem in the anytime setting, where the time horizon is \textit{not} known in advance. No minimax optimal algorithm was previously known in the anytime setting, regardless of the number of experts. Even for the case of two experts, Luo and Schapire have left open the problem of determining the optimal algorithm. We design the first minimax optimal algorithm for minimizing regret in the anytime setting. We consider the case of two experts, and prove that the optimal regret is \(\gamma \sqrt{t}/2\) at all time steps \(t\), where \(\gamma\) is a natural constant that arose 35 years ago in studying fundamental properties of Brownian motion. The algorithm is designed by considering a continuous analog of the regret problem, which is solved using ideas from stochastic calculus.
    0 references
    prediction with expert advice
    0 references
    online learning
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references

    Identifiers

    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references