Post selection shrinkage estimation for high-dimensional data analysis

DOI10.1002/ASMB.2193MaRDI QIDQ4620187zbMATH OpenOpenAlexFDO

Authors Xiaoli Gao, Yang Feng, S. Ejaz Ahmed

Publication date 8 February 2019

Published in Applied Stochastic Models in Business and Industry (Search for Journal in Brave)

Full work available at URL https://arxiv.org/abs/1603.07277, http://libres.uncg.edu/ir/uncg/f/X_Gao_Post_2017.pdf

Lasso ridge regression asymptotic risk sparse model post selection (positive) shrinkage estimation

Time series, auto-correlation, regression, etc. in statistics (GARCH) (62M10) Ridge regression; shrinkage estimators (Lasso) (62J07)

Abstract: In high-dimensional data settings where

p g g n

, many penalized regularization approaches were studied for simultaneous variable selection and estimation. However, with the existence of covariates with weak effect, many existing variable selection methods, including Lasso and its generations, cannot distinguish covariates with weak and no contribution. Thus, prediction based on a subset model of selected covariates only can be inefficient. In this paper, we propose a post selection shrinkage estimation strategy to improve the prediction performance of a selected subset model. Such a post selection shrinkage estimator (PSE) is data-adaptive and constructed by shrinking a post selection weighted ridge estimator in the direction of a selected candidate subset. Under an asymptotic distributional quadratic risk criterion, its prediction performance is explored analytically. We show that the proposed post selection PSE performs better than the post selection weighted ridge estimator. More importantly, it improves the prediction performance of any candidate subset model selected from most existing Lasso-type variable selection methods significantly. The relative performance of the post selection PSE is demonstrated by both simulation studies and real data analysis.

Recommendations

Cited in

(15)

This page was built for publication: Post selection shrinkage estimation for high-dimensional data analysis

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q4620187)