Doubly penalized estimation in additive regression with high-dimensional data (Q2328052)

From MaRDI portal

Jump to:navigation, search

scientific article

Language	Label	Description	Also known as
English	Doubly penalized estimation in additive regression with high-dimensional data	scientific article

Statements

scholarly article

0 references

Doubly penalized estimation in additive regression with high-dimensional data (English)

0 references

0 references

The Annals of Statistics

0 references

publication date

9 October 2019

0 references

full work available at URL

https://arxiv.org/abs/1704.07229

0 references

https://projecteuclid.org/euclid.aos/1564797857

0 references

The authors consider high-dimensional nonparametric additive regression: given \((X_1,Y_1),\ldots,(X_n,Y_n)\), independent observations, where each \(Y_i\in\mathbb{R}\) is a response variable and each \(X_i\in\mathbb{R}^d\) is a vector of covariates, consider the model \(Y_i=g^*(X_i)+\varepsilon_i\), where \[ g^*(x)=\sum_{j=1}^pg_j^*\left(x^{(j)}\right)\,, \] each \(\varepsilon_i\) is a noise term, and, for each \(j\), \(x^{(j)}\) is a vector formed of a (small) subset of the components of \(x\in\mathbb{R}^d\), which may possibly be overlapping, with \(p\) possibly larger than \(n\). The class of estimators of \(g^*\) studied have two penalty components: one using an empirical \(L_2\) norm to induce sparsity of the estimator, and another using functional semi-norms to induce smoothness. The main results of the paper are oracle inequalities for predictive performance in this setting, giving upper bounds on the penalized predictive loss for both fixed and random designs. In the fixed design setting, new observations are drawn with covariates from the sample \((X_1,\ldots,X_n)\), whereas the random design setting has covariates drawn from the distributions of \((X_1,\ldots,X_n)\). These oracle inequalities are established under assumptions of sub-Gaussian tails for the noise, an entropy condition on the relevant functional classes, and an empirical compatibility condition. In the setting of random designs, this sample compatibility condition may be replaced by a population compatibility condition and a condition ensuring convergence of empirical norms. Compared to existing results in the literature, these conditions are weaker and the resulting inequalities give better rates of convergence. The framework is flexible, in that it allows for a decoupling of sparsity and smoothness conditions. The authors consider the special cases of Sobolev and bounded variation spaces (where explicit rates of convergence obtained in the oracle inequalities are shown to match minimax lower bounds), and also give results on convergence of empirical norms that may be of independent interest.

0 references

0 references

zbMATH Keywords

additive model

0 references

bounded variation space

0 references

ANOVA model

0 references

high-dimensional data

0 references

metric entropy

0 references

penalized estimation

0 references

reproducing kernel Hilbert space

0 references

Sobolev space

0 references

total variation

0 references

trend filtering

0 references

describes a project that uses

0 references

MaRDI profile type

MaRDI publication profile

0 references

Simultaneous analysis of Lasso and Dantzig selector

0 references

Sparsity oracle inequalities for the Lasso

0 references

Statistical inference in compound functional models

0 references

0 references

Persistene in high-dimensional linear predictor-selection and the virtue of overparametrization

0 references

Smoothing spline ANOVA models

0 references

0 references

Variable selection in nonparametric additive models

0 references

$\ell_1$ Trend Filtering

0 references

Nuclear-norm penalization and optimal rates for noisy low-rank matrix completion

0 references

Sparsity in multiple kernel learning

0 references

0 references

Component selection and smoothing in multivariate nonparametric regression

0 references

0 references

Nonparametric regression under qualitative smoothness assumptions

0 references

Locally adaptive regression splines

0 references

High-dimensional additive modeling

0 references

The Partial Linear Model in High Dimensions

0 references

0 references

Minimax-optimal rates for sparse additive models over kernel classes via convex programming

0 references

Sparse Additive Models

0 references

Additive models with trend filtering

0 references

Optimal global rates of convergence for nonparametric regression

0 references

Additive regression and other nonparametric models

0 references

Fast learning rate of multiple kernel learning: trade-off between sparsity and smoothness

0 references

Doubly penalized estimation in additive regression with high-dimensional data

0 references

Adaptive piecewise polynomial estimation via trend filtering

0 references

0 references

On the conditions used to prove oracle results for the Lasso

0 references

Weak convergence and empirical processes. With applications to statistics

0 references

Minimax-optimal nonparametric regression in high dimensions

0 references

Minimax optimal rates of estimation in high dimensional additive models

0 references

Identifiers

zbMATH Open document ID

0 references

10.1214/18-AOS1757

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

0 references

0 references

0 references

0 references

0 references

zbMATH DE Number

0 references

0 references

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:2328052

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Item:Q2328052&oldid=37160031"