A survey of cross-validation procedures for model selection
From MaRDI portal
(Redirected from Publication:975579)
Abstract: Used to estimate the risk of an estimator or to perform model selection, cross-validation is a widespread strategy because of its simplicity and its apparent universality. Many results exist on the model selection performances of cross-validation procedures. This survey intends to relate these results to the most recent advances of model selection theory, with a particular emphasis on distinguishing empirical statements from rigorous theoretical results. As a conclusion, guidelines are provided for choosing the best cross-validation procedure according to the particular features of the problem in hand.
Recommendations
Cites work
- scientific article; zbMATH DE number 3174053 (Why is no real title available?)
- scientific article; zbMATH DE number 3860199 (Why is no real title available?)
- scientific article; zbMATH DE number 3928119 (Why is no real title available?)
- scientific article; zbMATH DE number 3949528 (Why is no real title available?)
- scientific article; zbMATH DE number 3789676 (Why is no real title available?)
- scientific article; zbMATH DE number 20176 (Why is no real title available?)
- scientific article; zbMATH DE number 3483405 (Why is no real title available?)
- scientific article; zbMATH DE number 3591259 (Why is no real title available?)
- scientific article; zbMATH DE number 1332320 (Why is no real title available?)
- scientific article; zbMATH DE number 597913 (Why is no real title available?)
- scientific article; zbMATH DE number 1034037 (Why is no real title available?)
- scientific article; zbMATH DE number 2062404 (Why is no real title available?)
- scientific article; zbMATH DE number 1522808 (Why is no real title available?)
- scientific article; zbMATH DE number 3441460 (Why is no real title available?)
- scientific article; zbMATH DE number 3444596 (Why is no real title available?)
- scientific article; zbMATH DE number 3446442 (Why is no real title available?)
- scientific article; zbMATH DE number 835699 (Why is no real title available?)
- scientific article; zbMATH DE number 845714 (Why is no real title available?)
- scientific article; zbMATH DE number 893887 (Why is no real title available?)
- scientific article; zbMATH DE number 5056254 (Why is no real title available?)
- scientific article; zbMATH DE number 3266204 (Why is no real title available?)
- scientific article; zbMATH DE number 3279684 (Why is no real title available?)
- scientific article; zbMATH DE number 3366380 (Why is no real title available?)
- scientific article; zbMATH DE number 3374797 (Why is no real title available?)
- scientific article; zbMATH DE number 3053501 (Why is no real title available?)
- 10.1162/153244302760200704
- A comparative study of ordinary cross-validation, v-fold cross-validation and the repeated learning-testing methods
- A cross-validatory method for dependent data
- A distribution-free theory of nonparametric regression
- A local cross-validation algorithm
- A predictive approach to the random effect model
- A universal prior for integers and estimation by minimum description length
- Adaptive Regression by Mixing
- An asymptotically optimal window selection rule for kernel density estimates
- Analysis of variance of cross-validation estimators of the generalization error
- Approximate efficiency of a selection procedure for the number of regression variables
- Asymptotic comparison of (partial) cross-validation, GCV and randomized GCV in nonparametric regression
- Asymptotic optimality for \(C_ p\), \(C_ L\), cross-validation and generalized cross-validation: Discrete index set
- Asymptotic properties of criteria for selection of variables in multiple regression
- Asymptotics for and against cross-validation
- Bandwidth selection in robust smoothing
- Bayesian model averaging: A tutorial. (with comments and a rejoinder).
- Bootstrap Model Selection
- Can the strengths of AIC and BIC be shared? A conflict between model indentification and regression estimation
- Comparison of two bandwidth selectors with dependent errors
- Concentration inequalities and model selection. Ecole d'Eté de Probabilités de Saint-Flour XXXIII -- 2003.
- Consistency of cross validation for comparing regression procedures
- Consistent cross-validated density estimation
- Cross validation model selection criteria for linear regression based on the Kullback-Leibler discrepancy
- Cross-Validation of Regression Models
- Cross-validation in nonparametric regression with outliers
- DATA-DEPENDENT ESTIMATION OF PREDICTION FUNCTIONS
- Data-driven bandwidth choice for density estimation based on dependent data
- Distribution-free performance bounds for potential function rules
- Estimating the Error Rate of a Prediction Rule: Improvement on Cross-Validation
- Estimating the dimension of a model
- Estimation of dependences based on empirical data. Transl. from the Russian by Samuel Kotz
- Estimation of the conditional risk in classification: the swapping method
- From Stein's unbiased risk estimates to the method of generalized cross- validation
- Gaussian model selection
- Heuristics of instability and stabilization in model selection
- Histogram selection in non Gaussian regression
- How Biased is the Apparent Error Rate of a Prediction Rule?
- How Far Are Automatically Chosen Regression Smoothing Parameters From Their Optimum?
- Improvements on Cross-Validation: The .632+ Bootstrap Method
- Inference for the generalization error
- Kernel Regression Estimation Using Repeated Measurements Data
- Large sample optimality of least squares cross-validation in density estimation
- Least angle and \(\ell _{1}\) penalized regression: a review
- Linear Model Selection by Cross-Validation
- Minimal penalties for Gaussian model selection
- Model Selection and Multimodel Inference
- Model selection and error estimation
- Model selection by resampling penalization
- Model selection for regression on a random design
- Model selection in nonparametric regression
- Model selection via multifold cross validation
- No unbiased estimator of the variance of K-fold cross-validation
- Nonparametric density estimation by exact leave-\(p\)-out cross-validation
- Nonparametric regression with correlated errors.
- On Kullback-Leibler loss and density estimation
- On bandwidth choice in nonparametric regression with both short- and long-range dependent errors
- On the bias and variability of bootstrap and cross-validation estimates of error rate in discrimination problems
- Optimal Oracle Inequality for Aggregation of Classifiers Under Low Noise Condition
- Oracle inequalities for multi-fold cross validation
- Periodic splines for spectral density estimation: the use of cross validation for determining the degree of smoothing
- Practical Approximate Solutions to Linear Operator Equations When the Data are Noisy
- Rademacher penalties and structural risk minimization
- Risk bounds for model selection via penalization
- Robust Estimation of a Location Parameter
- Robust Linear Model Selection by Cross-Validation
- Segmentation of the mean of heteroscedastic data via cross-validation
- Smoothed cross-validation
- Smoothing noisy data with spline functions: Estimating the correct degree of smoothing by the method of generalized cross-validation
- Some Comments on C P
- Statistical predictor identification
- Suboptimality of Penalized Empirical Risk Minimization in Classification
- The Estimation of Prediction Error
- The Predictive Sample Reuse Method with Applications
- The Relationship between Variable Selection and Data Agumentation and a Method for Prediction
- The cross-validated adaptive epsilon-net estimator
- Theory of Classification: a Survey of Some Recent Advances
- Weak convergence of dependent empirical measures with application to subsampling in function spaces
Cited in
(only showing first 100 items - show all)- Data-driven robust optimization
- Generalised density forecast combinations
- Penalized likelihood methods for modeling count data
- Cross-Validation, Risk Estimation, and Model Selection: Comment on a Paper by Rosset and Tibshirani
- Cross-validation for selecting a model selection procedure
- Portfolio approaches for constraint optimization problems
- Extensions of stability selection using subsamples of observations and covariates
- Linear Model Selection by Cross-Validation
- Slope heuristics: overview and implementation
- Parallel cross-validation: a scalable fitting method for Gaussian process models
- SUNNY: a lazy portfolio approach for constraint solving
- Physically-constrained data-driven inversions to infer the bed topography beneath glaciers flows. Application to East Antarctica
- Framing reinforcement learning from human reward: reward positivity, temporal discounting, episodicity, and performance
- Rank adaptive tensor recovery based model reduction for partial differential equations with high-dimensional random inputs
- Bayesian functional linear regression with sparse step functions
- SSC-EKE: semi-supervised classification with extensive knowledge exploitation
- Prediction error bounds for linear regression with the TREX
- Segmentation of the mean of heteroscedastic data via cross-validation
- Model selection in reinforcement learning
- On cross-validated Lasso in high dimensions
- Minimization and estimation of the variance of prediction errors for cross-validation designs
- Efficient approximate \(k\)-fold and leave-one-out cross-validation for ridge regression
- Virtual model validation of complex multiscale systems: applications to nonlinear elastostatics
- Ranking in evolving complex networks
- An Empirical Study of Indirect Cross-Validation
- High-dimensional regression with unknown variance
- Compressive sampling of polynomial chaos expansions: convergence analysis and sampling strategies
- Support vector regression based on grid-search method for short-term wind power forecasting
- Practical Bayesian model evaluation using leave-one-out cross-validation and WAIC
- Cross-validation with confidence
- Global resolution of the support vector machine regression parameters selection problem with LPCC
- A note on the validity of cross-validation for evaluating autoregressive time series prediction
- Estimator selection in the Gaussian setting
- Hierarchical Bayesian models of reinforcement learning: introduction and comparison to alternative methods
- A new technique for postsample model selection and validation
- Fast Cross-validation for Multi-penalty High-dimensional Ridge Regression
- Leave-one-out cross-validation is risk consistent for Lasso
- Network cross-validation by edge sampling
- On the usefulness of cross-validation for directional forecast evaluation
- Consistency of cross validation for comparing regression procedures
- Modeling uncertainty of expert elicitation for use in risk-based optimization
- Generative adversarial networks for financial trading strategies fine-tuning and combination
- Cross-validation-based adaptive sampling for Gaussian process models
- Spectral likelihood expansions for Bayesian inference
- Semiparametric zero-inflated modeling in multi-ethnic study of atherosclerosis (MESA)
- A cross-validation based estimation of the proportion of true null hypotheses
- Oracle inequalities for multi-fold cross validation
- Derivation, identification and validation of a computational model of a novel synthetic regulatory network in yeast
- Data science, big data and statistics
- Data partition methodology for validation of predictive models
- On polynomial chaos expansion via gradient-enhanced \(\ell_1\)-minimization
- Oracle inequalities for cross-validation type procedures
- A survey on learning approaches for undirected graphical models. Application to scene object recognition
- A survey of Bayesian predictive methods for model assessment, selection and comparison
- Robust machine learning by median-of-means: theory and practice
- A comparative study of ordinary cross-validation, v-fold cross-validation and the repeated learning-testing methods
- Optimal cross-validation in density estimation with the \(L^{2}\)-loss
- Data-based models for the prediction of dam behaviour: a review and some methodological considerations
- Estimating the Kullback–Liebler risk based on multifold cross‐validation
- Asymptotics for regression models under loss of identifiability
- Use of a novel grammatical inference approach in classification of amyloidogenic hexapeptides
- Estimating the trace of the matrix inverse by interpolating from the diagonal of an approximate inverse
- A comparison of regularization methods applied to the linear discriminant function with high-dimensional microarray data
- Fast cross-validation via sequential testing
- Semiparametric stochastic volatility modelling using penalized splines
- R package rjmcmc: reversible jump MCMC using post‐processing
- Bayesian calibration, validation and uncertainty quantification for predictive modelling of tumour growth: a tutorial
- Model selection properties of forward selection and sequential cross‐validation for high‐dimensional regression
- The adaptive and the thresholded Lasso for potentially misspecified models (and a lower bound for the Lasso)
- Estimation of nonbinary random response
- Prediction of arch dam deformation via correlated multi-target stacking
- Multiple predicting \(K\)-fold cross-validation for model selection
- Risk prediction of hypertension complications based on the intelligent algorithm optimized Bayesian network
- Equity clusters through the lens of realized semicorrelations
- \textsf{StreaMRAK} a streaming multi-resolution adaptive kernel algorithm
- Estimation of the global mode of a density: minimaxity, adaptation, and computational complexity
- A MOM-based ensemble method for robustness, subsampling and hyperparameter tuning
- A negative correlation ensemble transfer learning method for fault diagnosis based on convolutional neural network
- Comorbidity of chronic diseases in the elderly: patterns identified by a copula design for mixed responses
- European exchange trading funds trading with locally weighted support vector regression
- Variational approach for learning Markov processes from time series data
- A data driven equivariant approach to constrained Gaussian mixture modeling
- MLP-ANN-based execution time prediction model and assessment of input parameters through structural modeling
- Upward and downward bias when measuring inequality of opportunity
- Sequential Learning of Regression Models by Penalized Estimation
- Lasso regression and its application in forecasting macro economic indicators: a study on Vietnam's exports
- State-by-state minimax adaptive estimation for nonparametric hidden Markov models
- Scale-constrained approaches for maximum likelihood estimation and model selection of clusterwise linear regression models
- An algorithm for computationally expensive engineering optimization problems
- Toward robust early-warning models: a horse race, ensembles and model uncertainty
- An introduction to nonparametric adaptive estimation
- Clustered active-subspace based local Gaussian process emulator for high-dimensional and complex computer models
- Aggregated hold out for sparse linear regression with a robust loss function
- Modelling of count data using nonparametric mixtures
- Modeling strength and failure variability due to porosity in additively manufactured metals
- Slope heuristics and V-fold model selection in heteroscedastic regression using strongly localized bases
- Targeted cross-validation
- Model selection in utility-maximizing binary prediction
- scientific article; zbMATH DE number 7370537 (Why is no real title available?)
- Machine-learning in optimization of expensive black-box functions
This page was built for publication: A survey of cross-validation procedures for model selection
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q975579)