Local Rademacher complexities and oracle inequalities in risk minimization. (2004 IMS Medallion Lecture). (With discussions and rejoinder)

DOI10.1214/009053606000001019zbMATH Open1118.62065arXiv0708.0083OpenAlexW3105849782WikidataQ105584237 ScholiaQ105584237MaRDI QIDQ2373576FDOQ2373576

Authors: Vladimir Koltchinskii

Publication date: 12 July 2007

Published in: The Annals of Statistics (Search for Journal in Brave)

Abstract: Let

m a t h c a l F

be a class of measurable functions

f : S m a p s t o [0, 1]

defined on a probability space

(S, m a t h c a l A, P)

. Given a sample (X_1,...,X_n) of i.i.d. random variables taking values in S with common distribution P, let P_n denote the empirical measure based on (X_1,...,X_n). We study an empirical risk minimization problem

P_{n} f o m i n

,

f i n m a t h c a l F

. Given a solution

h a t f_{n}

of this problem, the goal is to obtain very general upper bounds on its excess risk [mathcal{E}_P(hat{f}_n):=Phat{f}_n-inf_{fin mathcal{F}}Pf,] expressed in terms of relevant geometric parameters of the class

m a t h c a l F

. Using concentration inequalities and other empirical processes tools, we obtain both distribution-dependent and data-dependent upper bounds on the excess risk that are of asymptotically correct order in many examples. The bounds involve localized sup-norms of empirical and Rademacher processes indexed by functions from the class. We use these bounds to develop model selection techniques in abstract risk minimization problems that can be applied to more specialized frameworks of regression and classification.

Full work available at URL: https://arxiv.org/abs/0708.0083

Recommendations

zbMATH Keywords

model selection classification concentration inequalities empirical risk minimization

Mathematics Subject Classification ID

Nonparametric regression and quantile regression (62G08) Classification and discrimination; cluster analysis (statistical aspects) (62H30) Learning and adaptive systems in artificial intelligence (68T05) Pattern recognition, speech recognition (68T10) Computational learning theory (68Q32) Probability theory on algebraic and topological structures (60B99)

Cites Work

Cited In (only showing first 100 items - show all)

This page was built for publication: Local Rademacher complexities and oracle inequalities in risk minimization. (2004 IMS Medallion Lecture). (With discussions and rejoinder)

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2373576)