On Measuring and Correcting the Effects of Data Mining and Model Selection

From MaRDI portal
Publication:3839585

DOI10.2307/2669609zbMath0920.62056OpenAlexW4237415315MaRDI QIDQ3839585

Jianming Ye

Publication date: 9 August 1998

Full work available at URL: https://doi.org/10.2307/2669609



Related Items

A discussion of prior-based Bayesian information criterion (PBIC), Greedy algorithms for prediction, Using simulated annealing to optimize the feature selection problem in marketing applications, Small area mean estimation after effect clustering, Tuning parameter selection in sparse regression modeling, Least angle regression. (With discussion), Estimation of an oblique structure via penalized likelihood factor analysis, Testing conditional mean through regression model sequence using Yanai's generalized coefficient of determination, On generalized degrees of freedom with application in linear mixed models selection, On improved loss estimation for shrinkage estimators, Extending AIC to best subset regression, False Discovery Rates to Detect Signals from Incomplete Spatially Aggregated Data, Degrees of freedom for piecewise Lipschitz estimators, Estimation of Lyapunov spectrum and model selection for a chaotic time series, The dual and degrees of freedom of linearly constrained generalized Lasso, Component selection and smoothing in multivariate nonparametric regression, Improving Reliability Estimation for Individual Numeric Predictions: A Machine Learning Approach, Computing AIC for black-box models using generalized degrees of freedom: A comparison with cross-validation, Selection model for domains across time: application to labour force survey by economic activities, A note on the generalized degrees of freedom under the \(L_{1}\) loss function, Computing the degrees of freedom of rank-regularized estimators and cousins, On the association between a random parameter and an observable, Resampling-based information criteria for best-subset regression, Geometrically designed variable knot splines in generalized (non-)linear models, Estimation of nonlinear differential equation model for glucose-insulin dynamics in type I diabetic patients using generalized smoothing, Automatic identification of curve shapes with applications to ultrasonic vocalization, The truth about the effective dimension, Combining Multiple Biomarker Models in Logistic Regression, Prediction error after model search, Model selection uncertainty and stability in beta regression models: a study of bootstrap-based model averaging with an empirical application to clickstream data, Optimal Simulator Selection, Combining models in longitudinal data analysis, Criterion constrained Bayesian hierarchical models, Efficient regularized isotonic regression with application to gene-gene interaction search, A Generalization Gap Estimation for Overparameterized Models via the Langevin Functional Variance, Model selection for two-sample problems with right-censored data: an application of Cox model, Conditional and unconditional methods for selecting variables in linear mixed models, Generalized degrees of freedom and adaptive model selection in linear mixed-effects models, Conditional Akaike information criterion for generalized linear mixed models, Optimal variance estimation without estimating the mean function, Selection of model selection criteria for multivariate ridge regression, Model selection in regression based on pre-smoothing, Autoregressive model selection based on a prediction perspective, Feasible generalized least squares using support vector regression, A method for choosing the smoothing parameter in a semi-parametric model for detecting change-points in blood flow, Effective degrees of freedom and its application to conditional AIC for linear mixed-effects models with correlated error structures, Measuring the prediction error. A comparison of cross-validation, bootstrap and covariance penalty methods, A new approach for selecting the number of factors, An introduction to model selection, Discussion of “From Fixed-X to Random-X Regression: Bias-Variance Decompositions, Covariance Penalties, and Prediction Error Estimation”, On the ``degrees of freedom of the lasso, Reducing over-dispersion by generalized degree of freedom and propensity score, On the choice of difference sequence in a unified framework for variance estimation in nonparametric regression, Model Selection for Generalized Estimating Equations Accommodating Dropout Missingness, On the degrees of freedom of mixed matrix regression, Degrees of freedom in low rank matrix estimation, Local behavior of sparse analysis regularization: applications to risk estimation, Detecting and handling outlying trajectories in irregularly sampled functional datasets, Markov chain estimation for test theory without an answer key, Generalized \(\ell_1\)-penalized quantile regression with linear constraints, PARSIMONIOUS PARAMETERIZATION OF AGE-PERIOD-COHORT MODELS BY BAYESIAN SHRINKAGE, A new adaptive local linear prediction method and its application in hydrological time series, The generalized degrees of freedom of multilinear principal component analysis, Selection Strategy for Covariance Structure of Random Effects in Linear Mixed-effects Models, A flexible shrinkage operator for fussy grouped variable selection, Fence methods for mixed model selection, Statistical significance of the Netflix challenge, Low Complexity Regularization of Linear Inverse Problems, An algebraic characterization of the optimum of regularized kernel methods, A likelihood‐based comparison of temporal models for physical processes, Bayesian P-spline estimation in hierarchical models specified by systems of affine differential equations, Adaptive order selection for autoregressive models, Estimation of the conditional risk in classification: the swapping method, Degrees of freedom for regularized regression with Huber loss and linear constraints, An improved model averaging scheme for logistic regression, Excess Optimism: How Biased is the Apparent Error of an Estimator Tuned by SURE?, A fast algorithm for optimizing ridge parameters in a generalized ridge regression by minimizing a model selection criterion, Model averaging for linear mixed models via augmented Lagrangian, On the predictive risk in misspecified quantile regression, Information criteria bias correction for group selection, Smoothing spline ANOVA models for large data sets with Bernoulli observations and the randomized GACV., Generalized cross validation in variable selection with and without shrinkage, Data enriched linear regression, CLEAR: Covariant LEAst-Square Refitting with Applications to Image Restoration, Model selection in linear mixed models, Sparse estimation via nonconcave penalized likelihood in factor analysis model, Prediction errors for penalized regressions based on generalized approximate message passing


Uses Software