Optimal anytime regret with two experts (Q6062702): Difference between revisions

Summary: We consider the classical problem of prediction with expert advice. In the fixedtime setting, where the time horizon is known in advance, algorithms that achieve the optimal regret are known when there are two, three, or four experts or when the number of experts is large. Much less is known about the problem in the anytime setting, where the time horizon is \textit{not} known in advance. No minimax optimal algorithm was previously known in the anytime setting, regardless of the number of experts. Even for the case of two experts, Luo and Schapire have left open the problem of determining the optimal algorithm. We design the first minimax optimal algorithm for minimizing regret in the anytime setting. We consider the case of two experts, and prove that the optimal regret is \(\gamma \sqrt{t}/2\) at all time steps \(t\), where \(\gamma\) is a natural constant that arose 35 years ago in studying fundamental properties of Brownian motion. The algorithm is designed by considering a continuous analog of the regret problem, which is solved using ideas from stochastic calculus.

0 references

zbMATH Keywords

prediction with expert advice

0 references

online learning

0 references

MaRDI profile type

Publication

0 references

cites work

Minimax option pricing meets black-scholes in the limit

0 references

Q2913806

0 references

Finite-time 4-expert prediction problem

0 references

On the asymptotic optimality of the comb strategy for prediction with expert advice

0 references

Q5651973

0 references

Q4003876

0 references

Optimal learning and experimentation in bandit problems.

0 references

Analysis of two gradient-based algorithms for on-line regression

0 references

How to use expert advice

0 references

Prediction, Learning, and Games

0 references

Q5606230

0 references

On the \(L^p\) norms of stochastic integrals and other martingales

0 references

Online trading algorithms and robust option pricing

0 references

Brownian motion hitting probabilities for general two-sided square-root boundaries

0 references

Q3342857

0 references

Prediction with expert advice: a PDE perspective

0 references

Probability

0 references

Q5538132

0 references

A random walk analogue of Lévy’s Theorem

0 references

Q4320535

0 references

Towards Optimal Algorithms for Prediction with Expert Advice

0 references

Tight Lower Bounds for Multiplicative Weights Algorithmic Families

0 references

A conditioned limit theorem for random walk and Brownian local time on square root boundaries

0 references

Q2744679

0 references

Q3245635

0 references

Q5739092

0 references

Probability theory. A comprehensive course.

0 references

Ito's formula for a random walk

0 references

The weighted majority algorithm

0 references

Primal-dual subgradient methods for convex problems

0 references

On the Hausdorff dimension of the Brownian slow points

0 references

Q3378055

0 references

On a Property of Real Plane Curves of Even Degree

0 references

Q4303982

0 references

Q4508926

0 references

Online Learning and Online Convex Optimization

0 references

A First Passage Problem for the Wiener Process

0 references

Probability with Martingales

0 references

Identifiers

0 references

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

0 references

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:6062702

@@ Property / cites work @@
+Minimax option pricing meets black-scholes in the limit
+Normal rank
@@ Property / cites work @@
+Q2913806
@@ Property / cites work: Q2913806 / rank @@
+Normal rank
@@ Property / cites work @@
+Finite-time 4-expert prediction problem
@@ Property / cites work: Finite-time 4-expert prediction problem / rank @@
+Normal rank
@@ Property / cites work @@
+On the asymptotic optimality of the comb strategy for prediction with expert advice
+Normal rank
@@ Property / cites work @@
+Q5651973
@@ Property / cites work: Q5651973 / rank @@
+Normal rank
@@ Property / cites work @@
+Q4003876
@@ Property / cites work: Q4003876 / rank @@
+Normal rank
@@ Property / cites work @@
+Optimal learning and experimentation in bandit problems.
+Normal rank
@@ Property / cites work @@
+Analysis of two gradient-based algorithms for on-line regression
+Normal rank
@@ Property / cites work @@
+How to use expert advice
@@ Property / cites work: How to use expert advice / rank @@
+Normal rank
@@ Property / cites work @@
+Prediction, Learning, and Games
@@ Property / cites work: Prediction, Learning, and Games / rank @@
+Normal rank
@@ Property / cites work @@
+Q5606230
@@ Property / cites work: Q5606230 / rank @@
+Normal rank
@@ Property / cites work @@
+On the \(L^p\) norms of stochastic integrals and other martingales
+Normal rank
@@ Property / cites work @@
+Online trading algorithms and robust option pricing
+Normal rank
@@ Property / cites work @@
+Brownian motion hitting probabilities for general two-sided square-root boundaries
+Normal rank
@@ Property / cites work @@
+Q3342857
@@ Property / cites work: Q3342857 / rank @@
+Normal rank
@@ Property / cites work @@
+Prediction with expert advice: a PDE perspective
@@ Property / cites work: Prediction with expert advice: a PDE perspective / rank @@
+Normal rank
@@ Property / cites work @@
+Probability
@@ Property / cites work: Probability / rank @@
+Normal rank
@@ Property / cites work @@
+Q5538132
@@ Property / cites work: Q5538132 / rank @@
+Normal rank
@@ Property / cites work @@
+A random walk analogue of Lévy’s Theorem
@@ Property / cites work: A random walk analogue of Lévy’s Theorem / rank @@
+Normal rank
@@ Property / cites work @@
+Q4320535
@@ Property / cites work: Q4320535 / rank @@
+Normal rank
@@ Property / cites work @@
+Towards Optimal Algorithms for Prediction with Expert Advice
+Normal rank
@@ Property / cites work @@
+Tight Lower Bounds for Multiplicative Weights Algorithmic Families
+Normal rank
@@ Property / cites work @@
+A conditioned limit theorem for random walk and Brownian local time on square root boundaries
+Normal rank
@@ Property / cites work @@
+Q2744679
@@ Property / cites work: Q2744679 / rank @@
+Normal rank
@@ Property / cites work @@
+Q3245635
@@ Property / cites work: Q3245635 / rank @@
+Normal rank
@@ Property / cites work @@
+Q5739092
@@ Property / cites work: Q5739092 / rank @@
+Normal rank
@@ Property / cites work @@
+Probability theory. A comprehensive course.
@@ Property / cites work: Probability theory. A comprehensive course. / rank @@
+Normal rank
@@ Property / cites work @@
+Ito's formula for a random walk
@@ Property / cites work: Ito's formula for a random walk / rank @@
+Normal rank
@@ Property / cites work @@
+The weighted majority algorithm
@@ Property / cites work: The weighted majority algorithm / rank @@
+Normal rank
@@ Property / cites work @@
+Primal-dual subgradient methods for convex problems
+Normal rank
@@ Property / cites work @@
+On the Hausdorff dimension of the Brownian slow points
+Normal rank
@@ Property / cites work @@
+Q3378055
@@ Property / cites work: Q3378055 / rank @@
+Normal rank
@@ Property / cites work @@
+On a Property of Real Plane Curves of Even Degree
@@ Property / cites work: On a Property of Real Plane Curves of Even Degree / rank @@
+Normal rank
@@ Property / cites work @@
+Q4303982
@@ Property / cites work: Q4303982 / rank @@
+Normal rank
@@ Property / cites work @@
+Q4508926
@@ Property / cites work: Q4508926 / rank @@
+Normal rank
@@ Property / cites work @@
+Online Learning and Online Convex Optimization
@@ Property / cites work: Online Learning and Online Convex Optimization / rank @@
+Normal rank
@@ Property / cites work @@
+A First Passage Problem for the Wiener Process
@@ Property / cites work: A First Passage Problem for the Wiener Process / rank @@
+Normal rank
@@ Property / cites work @@
+Probability with Martingales
@@ Property / cites work: Probability with Martingales / rank @@
+Normal rank