Learning theory estimates with observations from general stationary stochastic processes

DOI10.1162/NECO_A_00870zbMATH Open1476.68229arXiv1605.02887OpenAlexW2963651118WikidataQ50499957 ScholiaQ50499957MaRDI QIDQ5380606FDOQ5380606

Authors: Hanyuan Hang, Yun-Long Feng, Ingo Steinwart, Johan A. K. Suykens

Publication date: 5 June 2019

Published in: Neural Computation (Search for Journal in Brave)

Abstract: This paper investigates the supervised learning problem with observations drawn from certain general stationary stochastic processes. Here by emph{general}, we mean that many stationary stochastic processes can be included. We show that when the stochastic processes satisfy a generalized Bernstein-type inequality, a unified treatment on analyzing the learning schemes with various mixing processes can be conducted and a sharp oracle inequality for generic regularized empirical risk minimization schemes can be established. The obtained oracle inequality is then applied to derive convergence rates for several learning schemes such as empirical risk minimization (ERM), least squares support vector machines (LS-SVMs) using given generic kernels, and SVMs using Gaussian kernels for both least squares and quantile regression. It turns out that for i.i.d.~processes, our learning rates for ERM recover the optimal rates. On the other hand, for non-i.i.d.~processes including geometrically

a l p h a

-mixing Markov processes, geometrically

a l p h a

-mixing processes with restricted decay,

p h i

-mixing processes, and (time-reversed) geometrically

m a t h c a l C

-mixing processes, our learning rates for SVMs with Gaussian kernels match, up to some arbitrarily small extra term in the exponent, the optimal rates. For the remaining cases, our rates are at least close to the optimal rates. As a by-product, the assumed generalized Bernstein-type inequality also provides an interpretation of the so-called "effective number of observations" for various mixing processes.

Full work available at URL: https://arxiv.org/abs/1605.02887

Recommendations

Mathematics Subject Classification ID

Nonparametric estimation (62G05) Classification and discrimination; cluster analysis (statistical aspects) (62H30) Learning and adaptive systems in artificial intelligence (68T05) Stationary stochastic processes (60G10)

Cites Work

Cited In (12)

This page was built for publication: Learning theory estimates with observations from general stationary stochastic processes

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q5380606)