A survey of cross-validation procedures for model selection
From MaRDI portal
(Redirected from Publication:975579)
Abstract: Used to estimate the risk of an estimator or to perform model selection, cross-validation is a widespread strategy because of its simplicity and its apparent universality. Many results exist on the model selection performances of cross-validation procedures. This survey intends to relate these results to the most recent advances of model selection theory, with a particular emphasis on distinguishing empirical statements from rigorous theoretical results. As a conclusion, guidelines are provided for choosing the best cross-validation procedure according to the particular features of the problem in hand.
Recommendations
Cites work
- scientific article; zbMATH DE number 3174053 (Why is no real title available?)
- scientific article; zbMATH DE number 3860199 (Why is no real title available?)
- scientific article; zbMATH DE number 3928119 (Why is no real title available?)
- scientific article; zbMATH DE number 3949528 (Why is no real title available?)
- scientific article; zbMATH DE number 3789676 (Why is no real title available?)
- scientific article; zbMATH DE number 20176 (Why is no real title available?)
- scientific article; zbMATH DE number 3483405 (Why is no real title available?)
- scientific article; zbMATH DE number 3591259 (Why is no real title available?)
- scientific article; zbMATH DE number 1332320 (Why is no real title available?)
- scientific article; zbMATH DE number 597913 (Why is no real title available?)
- scientific article; zbMATH DE number 1034037 (Why is no real title available?)
- scientific article; zbMATH DE number 2062404 (Why is no real title available?)
- scientific article; zbMATH DE number 1522808 (Why is no real title available?)
- scientific article; zbMATH DE number 3441460 (Why is no real title available?)
- scientific article; zbMATH DE number 3444596 (Why is no real title available?)
- scientific article; zbMATH DE number 3446442 (Why is no real title available?)
- scientific article; zbMATH DE number 835699 (Why is no real title available?)
- scientific article; zbMATH DE number 845714 (Why is no real title available?)
- scientific article; zbMATH DE number 893887 (Why is no real title available?)
- scientific article; zbMATH DE number 5056254 (Why is no real title available?)
- scientific article; zbMATH DE number 3266204 (Why is no real title available?)
- scientific article; zbMATH DE number 3279684 (Why is no real title available?)
- scientific article; zbMATH DE number 3366380 (Why is no real title available?)
- scientific article; zbMATH DE number 3374797 (Why is no real title available?)
- scientific article; zbMATH DE number 3053501 (Why is no real title available?)
- 10.1162/153244302760200704
- A comparative study of ordinary cross-validation, v-fold cross-validation and the repeated learning-testing methods
- A cross-validatory method for dependent data
- A distribution-free theory of nonparametric regression
- A local cross-validation algorithm
- A predictive approach to the random effect model
- A universal prior for integers and estimation by minimum description length
- Adaptive Regression by Mixing
- An asymptotically optimal window selection rule for kernel density estimates
- Analysis of variance of cross-validation estimators of the generalization error
- Approximate efficiency of a selection procedure for the number of regression variables
- Asymptotic comparison of (partial) cross-validation, GCV and randomized GCV in nonparametric regression
- Asymptotic optimality for \(C_ p\), \(C_ L\), cross-validation and generalized cross-validation: Discrete index set
- Asymptotic properties of criteria for selection of variables in multiple regression
- Asymptotics for and against cross-validation
- Bandwidth selection in robust smoothing
- Bayesian model averaging: A tutorial. (with comments and a rejoinder).
- Bootstrap Model Selection
- Can the strengths of AIC and BIC be shared? A conflict between model indentification and regression estimation
- Comparison of two bandwidth selectors with dependent errors
- Concentration inequalities and model selection. Ecole d'Eté de Probabilités de Saint-Flour XXXIII -- 2003.
- Consistency of cross validation for comparing regression procedures
- Consistent cross-validated density estimation
- Cross validation model selection criteria for linear regression based on the Kullback-Leibler discrepancy
- Cross-Validation of Regression Models
- Cross-validation in nonparametric regression with outliers
- DATA-DEPENDENT ESTIMATION OF PREDICTION FUNCTIONS
- Data-driven bandwidth choice for density estimation based on dependent data
- Distribution-free performance bounds for potential function rules
- Estimating the Error Rate of a Prediction Rule: Improvement on Cross-Validation
- Estimating the dimension of a model
- Estimation of dependences based on empirical data. Transl. from the Russian by Samuel Kotz
- Estimation of the conditional risk in classification: the swapping method
- From Stein's unbiased risk estimates to the method of generalized cross- validation
- Gaussian model selection
- Heuristics of instability and stabilization in model selection
- Histogram selection in non Gaussian regression
- How Biased is the Apparent Error Rate of a Prediction Rule?
- How Far Are Automatically Chosen Regression Smoothing Parameters From Their Optimum?
- Improvements on Cross-Validation: The .632+ Bootstrap Method
- Inference for the generalization error
- Kernel Regression Estimation Using Repeated Measurements Data
- Large sample optimality of least squares cross-validation in density estimation
- Least angle and \(\ell _{1}\) penalized regression: a review
- Linear Model Selection by Cross-Validation
- Minimal penalties for Gaussian model selection
- Model Selection and Multimodel Inference
- Model selection and error estimation
- Model selection by resampling penalization
- Model selection for regression on a random design
- Model selection in nonparametric regression
- Model selection via multifold cross validation
- No unbiased estimator of the variance of K-fold cross-validation
- Nonparametric density estimation by exact leave-\(p\)-out cross-validation
- Nonparametric regression with correlated errors.
- On Kullback-Leibler loss and density estimation
- On bandwidth choice in nonparametric regression with both short- and long-range dependent errors
- On the bias and variability of bootstrap and cross-validation estimates of error rate in discrimination problems
- Optimal Oracle Inequality for Aggregation of Classifiers Under Low Noise Condition
- Oracle inequalities for multi-fold cross validation
- Periodic splines for spectral density estimation: the use of cross validation for determining the degree of smoothing
- Practical Approximate Solutions to Linear Operator Equations When the Data are Noisy
- Rademacher penalties and structural risk minimization
- Risk bounds for model selection via penalization
- Robust Estimation of a Location Parameter
- Robust Linear Model Selection by Cross-Validation
- Segmentation of the mean of heteroscedastic data via cross-validation
- Smoothed cross-validation
- Smoothing noisy data with spline functions: Estimating the correct degree of smoothing by the method of generalized cross-validation
- Some Comments on C P
- Statistical predictor identification
- Suboptimality of Penalized Empirical Risk Minimization in Classification
- The Estimation of Prediction Error
- The Predictive Sample Reuse Method with Applications
- The Relationship between Variable Selection and Data Agumentation and a Method for Prediction
- The cross-validated adaptive epsilon-net estimator
- Theory of Classification: a Survey of Some Recent Advances
- Weak convergence of dependent empirical measures with application to subsampling in function spaces
Cited in
(only showing first 100 items - show all)- Prediction of arch dam deformation via correlated multi-target stacking
- Multiple predicting \(K\)-fold cross-validation for model selection
- Risk prediction of hypertension complications based on the intelligent algorithm optimized Bayesian network
- Equity clusters through the lens of realized semicorrelations
- \textsf{StreaMRAK} a streaming multi-resolution adaptive kernel algorithm
- Estimation of the global mode of a density: minimaxity, adaptation, and computational complexity
- A MOM-based ensemble method for robustness, subsampling and hyperparameter tuning
- A negative correlation ensemble transfer learning method for fault diagnosis based on convolutional neural network
- Comorbidity of chronic diseases in the elderly: patterns identified by a copula design for mixed responses
- European exchange trading funds trading with locally weighted support vector regression
- Variational approach for learning Markov processes from time series data
- A data driven equivariant approach to constrained Gaussian mixture modeling
- MLP-ANN-based execution time prediction model and assessment of input parameters through structural modeling
- Upward and downward bias when measuring inequality of opportunity
- Sequential Learning of Regression Models by Penalized Estimation
- Lasso regression and its application in forecasting macro economic indicators: a study on Vietnam's exports
- State-by-state minimax adaptive estimation for nonparametric hidden Markov models
- Scale-constrained approaches for maximum likelihood estimation and model selection of clusterwise linear regression models
- An algorithm for computationally expensive engineering optimization problems
- Toward robust early-warning models: a horse race, ensembles and model uncertainty
- An introduction to nonparametric adaptive estimation
- Clustered active-subspace based local Gaussian process emulator for high-dimensional and complex computer models
- Aggregated hold out for sparse linear regression with a robust loss function
- Modelling of count data using nonparametric mixtures
- Modeling strength and failure variability due to porosity in additively manufactured metals
- Slope heuristics and V-fold model selection in heteroscedastic regression using strongly localized bases
- Targeted cross-validation
- Model selection in utility-maximizing binary prediction
- scientific article; zbMATH DE number 7370537 (Why is no real title available?)
- Machine-learning in optimization of expensive black-box functions
- Clusterwise linear regression modeling with soft scale constraints
- Surprises in high-dimensional ridgeless least squares interpolation
- Low-rank approximation and completion of positive tensors
- Correcting for unknown errors in sparse high-dimensional function approximation
- Sparse broadband beamformer design via proximal optimization Techniques
- Nonasymptotic control of the MLE for misspecified nonparametric hidden Markov models
- Optimality of training/test size and resampling effectiveness in cross-validation
- Best subset selection via cross-validation criterion
- Efficient semiparametric estimation and model selection for multidimensional mixtures
- A phase transition for finding needles in nonlinear haystacks with LASSO artificial neural networks
- From zero to hero: realized partial (co)variances
- An augmented MFS approach for brain activity reconstruction
- From Fixed-X to Random-X Regression: Bias-Variance Decompositions, Covariance Penalties, and Prediction Error Estimation
- Stochastic local interaction model: an alternative to kriging for massive datasets
- Estimating the index of increase via balancing deterministic and random data
- Volatility forecasting via SVR-GARCH with mixture of Gaussian kernels
- Metamodel construction for sensitivity analysis
- A new approach for classifier model selection and tuning using logistic regression and genetic algorithms
- Cross-validation based weights and structure determination of Chebyshev-polynomial neural networks for pattern classification
- Projection-based text line segmentation with a variable threshold
- Fermentation of \textit{Saccharomyces cerevisiae} -- combining kinetic modeling and optimization techniques points out avenues to effective process design
- Model selection criteria based on cross-validatory concordance statistics
- Bayesian inversion using adaptive polynomial chaos kriging within subset simulation
- Extended differential geometric LARS for high-dimensional GLMs with general dispersion parameter
- Decision-based model selection
- Data-driven robust optimization
- Generalised density forecast combinations
- Penalized likelihood methods for modeling count data
- Cross-Validation, Risk Estimation, and Model Selection: Comment on a Paper by Rosset and Tibshirani
- Cross-validation for selecting a model selection procedure
- Portfolio approaches for constraint optimization problems
- Extensions of stability selection using subsamples of observations and covariates
- Linear Model Selection by Cross-Validation
- Slope heuristics: overview and implementation
- Parallel cross-validation: a scalable fitting method for Gaussian process models
- SUNNY: a lazy portfolio approach for constraint solving
- Physically-constrained data-driven inversions to infer the bed topography beneath glaciers flows. Application to East Antarctica
- Framing reinforcement learning from human reward: reward positivity, temporal discounting, episodicity, and performance
- Rank adaptive tensor recovery based model reduction for partial differential equations with high-dimensional random inputs
- Bayesian functional linear regression with sparse step functions
- SSC-EKE: semi-supervised classification with extensive knowledge exploitation
- Prediction error bounds for linear regression with the TREX
- Segmentation of the mean of heteroscedastic data via cross-validation
- Model selection in reinforcement learning
- On cross-validated Lasso in high dimensions
- Minimization and estimation of the variance of prediction errors for cross-validation designs
- Efficient approximate \(k\)-fold and leave-one-out cross-validation for ridge regression
- Virtual model validation of complex multiscale systems: applications to nonlinear elastostatics
- Ranking in evolving complex networks
- An Empirical Study of Indirect Cross-Validation
- High-dimensional regression with unknown variance
- Compressive sampling of polynomial chaos expansions: convergence analysis and sampling strategies
- Support vector regression based on grid-search method for short-term wind power forecasting
- Practical Bayesian model evaluation using leave-one-out cross-validation and WAIC
- Cross-validation with confidence
- Global resolution of the support vector machine regression parameters selection problem with LPCC
- A note on the validity of cross-validation for evaluating autoregressive time series prediction
- Estimator selection in the Gaussian setting
- Hierarchical Bayesian models of reinforcement learning: introduction and comparison to alternative methods
- A new technique for postsample model selection and validation
- Fast Cross-validation for Multi-penalty High-dimensional Ridge Regression
- Leave-one-out cross-validation is risk consistent for Lasso
- Network cross-validation by edge sampling
- On the usefulness of cross-validation for directional forecast evaluation
- Consistency of cross validation for comparing regression procedures
- Modeling uncertainty of expert elicitation for use in risk-based optimization
- Generative adversarial networks for financial trading strategies fine-tuning and combination
- Cross-validation-based adaptive sampling for Gaussian process models
- Spectral likelihood expansions for Bayesian inference
- Semiparametric zero-inflated modeling in multi-ethnic study of atherosclerosis (MESA)
This page was built for publication: A survey of cross-validation procedures for model selection
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q975579)