Optimal anytime regret with two experts (Q6062702)

From MaRDI portal

Jump to:navigation, search

scientific article; zbMATH DE number 7761502

Language	Label	Description	Also known as
English	Optimal anytime regret with two experts	scientific article; zbMATH DE number 7761502

Statements

scholarly article

0 references

Optimal anytime regret with two experts (English)

0 references

Nicholas J. A. Harvey

0 references

Christopher Liaw

0 references

Edwin A. Perkins

0 references

Sikander Randhawa

0 references

Mathematical Statistics and Learning

0 references

publication date

6 November 2023

0 references

full work available at URL

https://arxiv.org/abs/2002.08994

0 references

Summary: We consider the classical problem of prediction with expert advice. In the fixedtime setting, where the time horizon is known in advance, algorithms that achieve the optimal regret are known when there are two, three, or four experts or when the number of experts is large. Much less is known about the problem in the anytime setting, where the time horizon is \textit{not} known in advance. No minimax optimal algorithm was previously known in the anytime setting, regardless of the number of experts. Even for the case of two experts, Luo and Schapire have left open the problem of determining the optimal algorithm. We design the first minimax optimal algorithm for minimizing regret in the anytime setting. We consider the case of two experts, and prove that the optimal regret is \(\gamma \sqrt{t}/2\) at all time steps \(t\), where \(\gamma\) is a natural constant that arose 35 years ago in studying fundamental properties of Brownian motion. The algorithm is designed by considering a continuous analog of the regret problem, which is solved using ideas from stochastic calculus.

0 references

zbMATH Keywords

prediction with expert advice

0 references

online learning

0 references

MaRDI profile type

MaRDI publication profile

0 references

Minimax option pricing meets black-scholes in the limit

0 references

0 references

Finite-time 4-expert prediction problem

0 references

On the asymptotic optimality of the comb strategy for prediction with expert advice

0 references

0 references

0 references

Optimal learning and experimentation in bandit problems.

0 references

Analysis of two gradient-based algorithms for on-line regression

0 references

How to use expert advice

0 references

Prediction, Learning, and Games

0 references

0 references

On the \(L^p\) norms of stochastic integrals and other martingales

0 references

Online trading algorithms and robust option pricing

0 references

Brownian motion hitting probabilities for general two-sided square-root boundaries

0 references

0 references

Prediction with expert advice: a PDE perspective

0 references

0 references

0 references

A random walk analogue of Lévy’s Theorem

0 references

0 references

Towards Optimal Algorithms for Prediction with Expert Advice

0 references

Tight Lower Bounds for Multiplicative Weights Algorithmic Families

0 references

A conditioned limit theorem for random walk and Brownian local time on square root boundaries

0 references

0 references

0 references

0 references

Probability theory. A comprehensive course.

0 references

Ito's formula for a random walk

0 references

The weighted majority algorithm

0 references

Primal-dual subgradient methods for convex problems

0 references

On the Hausdorff dimension of the Brownian slow points

0 references

0 references

On a Property of Real Plane Curves of Even Degree

0 references

0 references

0 references

Online Learning and Online Convex Optimization

0 references

A First Passage Problem for the Wiener Process

0 references

Probability with Martingales

0 references

Identifiers

0 references

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

zbMATH DE Number

0 references

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:6062702

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Item:Q6062702&oldid=37574709"