Multilevel stochastic gradient methods for nested composition optimization

DOI10.1137/18M1164846MaRDI QIDQ4629336zbMATH OpenWikidataFDO

Authors Shuoguang Yang, Mengdi Wang, Ethan X. Fang

Publication date 22 March 2019

Published in SIAM Journal on Optimization (Search for Journal in Brave)

Full work available at URL https://arxiv.org/abs/1801.03600

simulation convex optimization stochastic gradient statistical learning stochastic optimization sample complexity

Convex programming (90C25) Online algorithms; streaming algorithms (68W27) Large-scale problems in mathematical programming (90C06) Stochastic programming (90C15)

Abstract: Stochastic gradient methods are scalable for solving large-scale optimization problems that involve empirical expectations of loss functions. Existing results mainly apply to optimization problems where the objectives are one- or two-level expectations. In this paper, we consider the multi-level compositional optimization problem that involves compositions of multi-level component functions and nested expectations over a random path. It finds applications in risk-averse optimization and sequential planning. We propose a class of multi-level stochastic gradient methods that are motivated from the method of multi-timescale stochastic approximation. First we propose a basic

T

-level stochastic compositional gradient algorithm, establish its almost sure convergence and obtain an

n

-iteration error bound

O (n^{- 1 / 2^{T}})

. Then we develop accelerated multi-level stochastic gradient methods by using an extrapolation-interpolation scheme to take advantage of the smoothness of individual component functions. When all component functions are smooth, we show that the convergence rate improves to

O (n^{- 4 / (7 + T)})

for general objectives and

O (n^{- 4 / (3 + T)})

for strongly convex objectives. We also provide almost sure convergence and rate of convergence results for nonconvex problems. The proposed methods and theoretical results are validated using numerical experiments.

Recommendations

Cites work

Cited in

(19)

Describes a project that uses

Uses Software

This page was built for publication: Multilevel stochastic gradient methods for nested composition optimization

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q4629336)