Scale-free online learning

DOI10.1016/J.TCS.2017.11.021MaRDI QIDQ1704560zbMATH OpenOpenAlexFDO

Publication date 12 March 2018

Published in Theoretical Computer Science (Search for Journal in Brave)

Full work available at URL https://arxiv.org/abs/1601.01974

optimization online algorithms online learning regret bounds

Learning and adaptive systems in artificial intelligence (68T05) Convex programming (90C25) Online algorithms; streaming algorithms (68W27)

Abstract: We design and analyze algorithms for online linear optimization that have optimal regret and at the same time do not need to know any upper or lower bounds on the norm of the loss vectors. Our algorithms are instances of the Follow the Regularized Leader (FTRL) and Mirror Descent (MD) meta-algorithms. We achieve adaptiveness to the norms of the loss vectors by scale invariance, i.e., our algorithms make exactly the same decisions if the sequence of loss vectors is multiplied by any positive constant. The algorithm based on FTRL works for any decision set, bounded or unbounded. For unbounded decisions sets, this is the first adaptive algorithm for online linear optimization with a non-vacuous regret bound. In contrast, we show lower bounds on scale-free algorithms based on MD on unbounded domains.

Recommendations

Cites work

Cited in

(20)

Describes a project that uses

Uses Software

This page was built for publication: Scale-free online learning

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q1704560)