A survey of cross-validation procedures for model selection
From MaRDI portal
(Redirected from Publication:975579)
Abstract: Used to estimate the risk of an estimator or to perform model selection, cross-validation is a widespread strategy because of its simplicity and its apparent universality. Many results exist on the model selection performances of cross-validation procedures. This survey intends to relate these results to the most recent advances of model selection theory, with a particular emphasis on distinguishing empirical statements from rigorous theoretical results. As a conclusion, guidelines are provided for choosing the best cross-validation procedure according to the particular features of the problem in hand.
Recommendations
Cites work
- scientific article; zbMATH DE number 3174053 (Why is no real title available?)
- scientific article; zbMATH DE number 3860199 (Why is no real title available?)
- scientific article; zbMATH DE number 3928119 (Why is no real title available?)
- scientific article; zbMATH DE number 3949528 (Why is no real title available?)
- scientific article; zbMATH DE number 3789676 (Why is no real title available?)
- scientific article; zbMATH DE number 20176 (Why is no real title available?)
- scientific article; zbMATH DE number 3483405 (Why is no real title available?)
- scientific article; zbMATH DE number 3591259 (Why is no real title available?)
- scientific article; zbMATH DE number 1332320 (Why is no real title available?)
- scientific article; zbMATH DE number 597913 (Why is no real title available?)
- scientific article; zbMATH DE number 1034037 (Why is no real title available?)
- scientific article; zbMATH DE number 2062404 (Why is no real title available?)
- scientific article; zbMATH DE number 1522808 (Why is no real title available?)
- scientific article; zbMATH DE number 3441460 (Why is no real title available?)
- scientific article; zbMATH DE number 3444596 (Why is no real title available?)
- scientific article; zbMATH DE number 3446442 (Why is no real title available?)
- scientific article; zbMATH DE number 835699 (Why is no real title available?)
- scientific article; zbMATH DE number 845714 (Why is no real title available?)
- scientific article; zbMATH DE number 893887 (Why is no real title available?)
- scientific article; zbMATH DE number 5056254 (Why is no real title available?)
- scientific article; zbMATH DE number 3266204 (Why is no real title available?)
- scientific article; zbMATH DE number 3279684 (Why is no real title available?)
- scientific article; zbMATH DE number 3366380 (Why is no real title available?)
- scientific article; zbMATH DE number 3374797 (Why is no real title available?)
- scientific article; zbMATH DE number 3053501 (Why is no real title available?)
- 10.1162/153244302760200704
- A comparative study of ordinary cross-validation, v-fold cross-validation and the repeated learning-testing methods
- A cross-validatory method for dependent data
- A distribution-free theory of nonparametric regression
- A local cross-validation algorithm
- A predictive approach to the random effect model
- A universal prior for integers and estimation by minimum description length
- Adaptive Regression by Mixing
- An asymptotically optimal window selection rule for kernel density estimates
- Analysis of variance of cross-validation estimators of the generalization error
- Approximate efficiency of a selection procedure for the number of regression variables
- Asymptotic comparison of (partial) cross-validation, GCV and randomized GCV in nonparametric regression
- Asymptotic optimality for \(C_ p\), \(C_ L\), cross-validation and generalized cross-validation: Discrete index set
- Asymptotic properties of criteria for selection of variables in multiple regression
- Asymptotics for and against cross-validation
- Bandwidth selection in robust smoothing
- Bayesian model averaging: A tutorial. (with comments and a rejoinder).
- Bootstrap Model Selection
- Can the strengths of AIC and BIC be shared? A conflict between model indentification and regression estimation
- Comparison of two bandwidth selectors with dependent errors
- Concentration inequalities and model selection. Ecole d'Eté de Probabilités de Saint-Flour XXXIII -- 2003.
- Consistency of cross validation for comparing regression procedures
- Consistent cross-validated density estimation
- Cross validation model selection criteria for linear regression based on the Kullback-Leibler discrepancy
- Cross-Validation of Regression Models
- Cross-validation in nonparametric regression with outliers
- DATA-DEPENDENT ESTIMATION OF PREDICTION FUNCTIONS
- Data-driven bandwidth choice for density estimation based on dependent data
- Distribution-free performance bounds for potential function rules
- Estimating the Error Rate of a Prediction Rule: Improvement on Cross-Validation
- Estimating the dimension of a model
- Estimation of dependences based on empirical data. Transl. from the Russian by Samuel Kotz
- Estimation of the conditional risk in classification: the swapping method
- From Stein's unbiased risk estimates to the method of generalized cross- validation
- Gaussian model selection
- Heuristics of instability and stabilization in model selection
- Histogram selection in non Gaussian regression
- How Biased is the Apparent Error Rate of a Prediction Rule?
- How Far Are Automatically Chosen Regression Smoothing Parameters From Their Optimum?
- Improvements on Cross-Validation: The .632+ Bootstrap Method
- Inference for the generalization error
- Kernel Regression Estimation Using Repeated Measurements Data
- Large sample optimality of least squares cross-validation in density estimation
- Least angle and \(\ell _{1}\) penalized regression: a review
- Linear Model Selection by Cross-Validation
- Minimal penalties for Gaussian model selection
- Model Selection and Multimodel Inference
- Model selection and error estimation
- Model selection by resampling penalization
- Model selection for regression on a random design
- Model selection in nonparametric regression
- Model selection via multifold cross validation
- No unbiased estimator of the variance of K-fold cross-validation
- Nonparametric density estimation by exact leave-\(p\)-out cross-validation
- Nonparametric regression with correlated errors.
- On Kullback-Leibler loss and density estimation
- On bandwidth choice in nonparametric regression with both short- and long-range dependent errors
- On the bias and variability of bootstrap and cross-validation estimates of error rate in discrimination problems
- Optimal Oracle Inequality for Aggregation of Classifiers Under Low Noise Condition
- Oracle inequalities for multi-fold cross validation
- Periodic splines for spectral density estimation: the use of cross validation for determining the degree of smoothing
- Practical Approximate Solutions to Linear Operator Equations When the Data are Noisy
- Rademacher penalties and structural risk minimization
- Risk bounds for model selection via penalization
- Robust Estimation of a Location Parameter
- Robust Linear Model Selection by Cross-Validation
- Segmentation of the mean of heteroscedastic data via cross-validation
- Smoothed cross-validation
- Smoothing noisy data with spline functions: Estimating the correct degree of smoothing by the method of generalized cross-validation
- Some Comments on C P
- Statistical predictor identification
- Suboptimality of Penalized Empirical Risk Minimization in Classification
- The Estimation of Prediction Error
- The Predictive Sample Reuse Method with Applications
- The Relationship between Variable Selection and Data Agumentation and a Method for Prediction
- The cross-validated adaptive epsilon-net estimator
- Theory of Classification: a Survey of Some Recent Advances
- Weak convergence of dependent empirical measures with application to subsampling in function spaces
Cited in
(only showing first 100 items - show all)- Estimation of scale functions to model heteroscedasticity by regularised kernel-based quantile methods
- On cross-validated Lasso in high dimensions
- High-dimensional regression with unknown variance
- Bayesian functional linear regression with sparse step functions
- A cross-validation based estimation of the proportion of true null hypotheses
- A survey of Bayesian predictive methods for model assessment, selection and comparison
- Prediction of arch dam deformation via correlated multi-target stacking
- Probabilities of discrepancy between minima of cross-validation, Vapnik bounds and true risks
- Fast cross-validation via sequential testing
- A phase transition for finding needles in nonlinear haystacks with LASSO artificial neural networks
- From zero to hero: realized partial (co)variances
- Bayesian estimation of ridge parameter under different loss functions
- Parallel cross-validation: a scalable fitting method for Gaussian process models
- Oracle inequalities for multi-fold cross validation
- Model selection via marginal likelihood estimation by combining thermodynamic integration and gradient matching
- On the usefulness of cross-validation for directional forecast evaluation
- MBLDA: a novel multiple between-class linear discriminant analysis
- Forbidden Knowledge and Specialized Training: A Versatile Solution for the Two Main Sources of Overfitting in Linear Regression
- Data science, big data and statistics
- Targeted cross-validation
- On the use of cross-validation for the calibration of the adaptive Lasso
- Portfolio approaches for constraint optimization problems
- An algorithm for computationally expensive engineering optimization problems
- Bayesian calibration, validation and uncertainty quantification for predictive modelling of tumour growth: a tutorial
- scientific article; zbMATH DE number 7370537 (Why is no real title available?)
- Clusterwise linear regression modeling with soft scale constraints
- On polynomial chaos expansion via gradient-enhanced \(\ell_1\)-minimization
- Model selection in reinforcement learning
- Generalised density forecast combinations
- Reducing bias and mitigating the influence of excess of zeros in regression covariates with multi-outcome adaptive LAD-lasso
- From zero crossings to quantile-frequency analysis of time series with an application to nondestructive evaluation
- Modeling swine population dynamics at a finer temporal resolution
- New techniques to perform cross-validation for time series models
- Metamodel construction for sensitivity analysis
- Virtual model validation of complex multiscale systems: applications to nonlinear elastostatics
- An Integrated machine learning and DEA-predefined performance outcome prediction framework with high-dimensional imbalanced data
- On model selection criteria for climate change impact studies
- Using Monte Carlo particle methods to estimate and quantify uncertainty in periodic parameters (research)
- Modelling of count data using nonparametric mixtures
- Efficient semiparametric estimation and model selection for multidimensional mixtures
- Cross-validation based weights and structure determination of Chebyshev-polynomial neural networks for pattern classification
- Projection-based text line segmentation with a variable threshold
- The adaptive and the thresholded Lasso for potentially misspecified models (and a lower bound for the Lasso)
- Equity clusters through the lens of realized semicorrelations
- Efficient estimation and correction of selection-induced bias with order statistics
- The out-of-source error in multi-source cross validation-type procedures
- Semiparametric zero-inflated modeling in multi-ethnic study of atherosclerosis (MESA)
- Carpal tunnel syndrome automatic classification: electromyography vs. ultrasound imaging
- Multi-population mortality forecasting using tensor decomposition
- Using Machine Learning Methods to Support Causal Inference in Econometrics
- Consistency of cross validation for comparing regression procedures
- Oracle inequalities for cross-validation type procedures
- An augmented MFS approach for brain activity reconstruction
- Low-rank approximation and completion of positive tensors
- Hold-out estimates of prediction models for Markov processes
- Slope heuristics and V-fold model selection in heteroscedastic regression using strongly localized bases
- MLP-ANN-based execution time prediction model and assessment of input parameters through structural modeling
- Estimating the Kullback–Liebler risk based on multifold cross‐validation
- Robust machine learning by median-of-means: theory and practice
- Trading signals in VIX futures
- Cross-validation with confidence
- Unbiased estimator for the variance of the leave-one-out cross-validation estimator for a Bayesian normal model with fixed variance
- Cross-validation-based adaptive sampling for Gaussian process models
- From Fixed-X to Random-X Regression: Bias-Variance Decompositions, Covariance Penalties, and Prediction Error Estimation: Rejoinder
- Clustered active-subspace based local Gaussian process emulator for high-dimensional and complex computer models
- Aggregated hold out for sparse linear regression with a robust loss function
- Modeling strength and failure variability due to porosity in additively manufactured metals
- Model selection in utility-maximizing binary prediction
- Trade-off between predictive performance and FDR control for high-dimensional Gaussian model selection
- Parameter estimation for models of chemical reaction networks from experimental data of reaction rates
- Modeling uncertainty of expert elicitation for use in risk-based optimization
- Cross-validation for change-point regression: pitfalls and solutions
- Variational approach for learning Markov processes from time series data
- Spectral likelihood expansions for Bayesian inference
- A data driven equivariant approach to constrained Gaussian mixture modeling
- Ranking in evolving complex networks
- Regular, median and Huber cross‐validation: A computational comparison
- Cross-Validation for Correlated Data
- A new approach for classifier model selection and tuning using logistic regression and genetic algorithms
- Fermentation of \textit{Saccharomyces cerevisiae} -- combining kinetic modeling and optimization techniques points out avenues to effective process design
- Model selection criteria based on cross-validatory concordance statistics
- \textsf{StreaMRAK} a streaming multi-resolution adaptive kernel algorithm
- Generative adversarial networks for financial trading strategies fine-tuning and combination
- Information criteria for model selection
- Prediction error bounds for linear regression with the TREX
- On foundation of the dimensionality reduction method for explanatory variables
- MDR method for nonbinary response variable
- Leave-one-out cross-validation is risk consistent for Lasso
- Decision-based model selection
- A comparative study of ordinary cross-validation, v-fold cross-validation and the repeated learning-testing methods
- \(\ell_1\)-penalised ordinal polytomous regression estimators with application to gene expression studies
- Data partition methodology for validation of predictive models
- Multiple predicting \(K\)-fold cross-validation for model selection
- Model selection properties of forward selection and sequential cross‐validation for high‐dimensional regression
- Estimation of nonbinary random response
- Root-finding approaches for computing conformal prediction set
- Bayesian synergistic metamodeling (BSM) for physical information infused data-driven metamodeling
- Global resolution of the support vector machine regression parameters selection problem with LPCC
- Comorbidity of chronic diseases in the elderly: patterns identified by a copula design for mixed responses
- A MOM-based ensemble method for robustness, subsampling and hyperparameter tuning
This page was built for publication: A survey of cross-validation procedures for model selection
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q975579)