Distribution-free predictive inference for regression
From MaRDI portal
Abstract: We develop a general framework for distribution-free predictive inference in regression, using conformal inference. The proposed methodology allows for the construction of a prediction band for the response variable using any estimator of the regression function. The resulting prediction band preserves the consistency properties of the original estimator under standard assumptions, while guaranteeing finite-sample marginal coverage even when these assumptions do not hold. We analyze and compare, both empirically and theoretically, the two major variants of our conformal framework: full conformal inference and split conformal inference, along with a related jackknife method. These methods offer different tradeoffs between statistical accuracy (length of resulting prediction intervals) and computational efficiency. As extensions, we develop a method for constructing valid in-sample prediction intervals called {it rank-one-out} conformal inference, which has essentially the same computational efficiency as split conformal inference. We also describe an extension of our procedures for producing prediction bands with locally varying length, in order to adapt to heteroskedascity in the data. Finally, we propose a model-free notion of variable importance, called {it leave-one-covariate-out} or LOCO inference. Accompanying this paper is an R package { t conformalInference} that implements all of the proposals we have introduced. In the spirit of reproducibility, all of our empirical results can also be easily (re)generated using this package.
Recommendations
Cites work
- scientific article; zbMATH DE number 3145583 (Why is no real title available?)
- scientific article; zbMATH DE number 2168212 (Why is no real title available?)
- scientific article; zbMATH DE number 1931847 (Why is no real title available?)
- scientific article; zbMATH DE number 845714 (Why is no real title available?)
- A conformal prediction approach to explore functional data
- Asymptotics of selective inference
- Classification with confidence
- Conditional validity of inductive conformal predictors
- Confidence Intervals and Hypothesis Testing for High-Dimensional Regression
- Confidence intervals for low dimensional parameters in high dimensional linear models
- Discussion: ``A significance test for the lasso
- Distribution-Free Prediction Sets
- Distribution-free Prediction Bands for Non-parametric Regression
- Exact post-selection inference, with application to the Lasso
- On asymptotically optimal confidence regions and tests for high-dimensional models
- On-line predictive linear regression
- Predictive Intervals Based on Reuse of the Sample
- Random forests
- Regularization and Variable Selection Via the Elastic Net
- Selective inference with a randomized response
- Simultaneous analysis of Lasso and Dantzig selector
- Sparse additive models
- Sparse models and methods for optimal instruments with an application to eminent domain
- Sparsity oracle inequalities for the Lasso
- Stability Selection
- Statistical significance in high-dimensional linear models
- Valid post-selection inference
Cited in
(93)- Methods to compute prediction intervals: a review and new results
- The limits of distribution-free conditional predictive inference
- Conformal Prediction: A Gentle Introduction
- Comparing six shrinkage estimators with large sample theory and asymptotically optimal prediction intervals
- Model-free model-fitting and predictive distributions
- Honest Confidence Sets for High-Dimensional Regression by Projection and Shrinkage
- GAMLSS: A distributional regression approach
- Predictive inference with the jackknife+
- Valid model-free prediction of future insurance claims
- A General Framework for Inference on Algorithm-Agnostic Variable Importance
- Conformal prediction: a unified review of theory and new challenges
- The Holdout Randomization Test for Feature Selection in Black Box Models
- Nested conformal prediction sets for classification with applications to probation data
- scientific article; zbMATH DE number 7370525 (Why is no real title available?)
- Rejoinder: Models as approximations
- Is distribution-free inference possible for binary regression?
- Random Forest Prediction Intervals
- Clustering on the torus by conformal prediction
- Conformal prediction bands for multivariate functional data
- Unsupervised streaming anomaly detection for instrumented infrastructure
- A survey on the explainability of supervised machine learning
- Distributional anchor regression
- Post-model-selection inference in linear regression models: an integrated review
- Discussion of the Paper “Prediction, Estimation, and Attribution” by B. Efron
- Discussion of Professor Bradley Efron’s Article on “Prediction, Estimation, and Attribution”
- scientific article; zbMATH DE number 1254128 (Why is no real title available?)
- Discussion of the Paper “Prediction, Estimation, and Attribution” by B. Efron
- Discussion of Professor Bradley Efron’s Article on “Prediction, Estimation, and Attribution”
- Cross-validation with confidence
- Nonparametric variable importance assessment using machine learning techniques
- Conformal prediction beyond exchangeability
- Nonparametric predictive distributions based on conformal prediction
- A Conformal Approach for Distribution-free Prediction of Functional Data
- Fast exact conformalization of the Lasso using piecewise linear homotopy
- Distribution-free conditional median inference
- Grouped feature importance and combined features effect plot
- An Exact and Robust Conformal Inference Method for Counterfactual and Synthetic Controls
- Model-free prediction and regression. A transformation-based approach to inference
- Discussion of Kallus and Mo, Qi, and Liu: New Objectives for Policy Learning
- Robust prediction interval estimation for Gaussian processes by cross-validation method
- Unrestricted permutation forces extrapolation: variable importance requires at least one more model, or there is no free variable importance
- Quantile regression approach to conditional mode estimation
- conformalInference.multi
- Homeostasis phenomenon in conformal prediction and predictive distribution functions
- Validity, consonant plausibility measures, and Conformal prediction
- Conditional predictive inference for stable algorithms
- Bootstrapping and sample splitting for high-dimensional, assumption-lean inference
- Comment: Statistical inference from a predictive perspective
- Multi split conformal prediction
- Prediction intervals for GLMs, GAMs, and some survival regression models
- Testing conditional independence in supervised learning algorithms
- scientific article; zbMATH DE number 7626724 (Why is no real title available?)
- Accelerating difficulty estimation for conformal regression forests
- Stochastic Tree Ensembles for Regularized Nonlinear Regression
- Set-Valued Support Vector Machine with Bounded Error Rates
- A minimax framework for quantifying risk-fairness trade-off in regression
- Conditional feature importance for mixed data
- Root-finding approaches for computing conformal prediction set
- Selective review of biased sampling problems with applications in modern statistics
- Inference for sparse linear regression based on the leave-one-covariate-out solution path
- Conformal Sensitivity Analysis for Individual Treatment Effects
- Hoeffding and Bernstein inequalities for weighted sums of exchangeable random variables
- What is a Randomization Test?
- Optimal Subsampling via Predictive Inference
- Robust Validation: Confident Predictions Even When Distributions Shift
- Variable selection in function-on-scalar single-index model via the alternating direction method of multipliers
- Supervised Machine Learning Techniques: An Overview with Applications to Banking
- De Finetti's theorem and related results for infinite weighted exchangeable sequences
- Understanding complex predictive models with ghost variables
- Total effects with constrained features
- A confidence machine for sparse high-order interaction model
- Multi-split conformal prediction via Cauchy aggregation
- Regression trees for fast and adaptive prediction intervals
- Feature importance: a closer look at Shapley values and LOCO
- Multivariate scalar on multidimensional distribution regression with application to modeling the association between physical activity and cognitive functions
- Distribution-Free Prediction Sets for Two-Layer Hierarchical Models
- Nonparametric Estimation and Conformal Inference of the Sufficient Forecasting With a Diverging Number of Factors
- Dimension-agnostic inference using cross U-statistics
- Explainable contextual anomaly detection using quantile regression forests
- Can a single neuron learn predictive uncertainty?
- Prediction intervals with controlled length in the heteroscedastic Gaussian regression
- Synthetic control as online linear regression
- Variable Selection Via Thompson Sampling
- Prediction in measurement error models
- Training-conditional coverage for distribution-free predictive inference
- Incorporating relative error criterion to conformal prediction for positive data
- Model-independent detection of new physics signals using interpretable semisupervised classifier tests
- A Two-Sample Conditional Distribution Test Using Conformal Prediction and Weighted Rank Sum
- Valid Model-Free Spatial Prediction
- Uncertainty in lung cancer stage for survival estimation via set-valued classification
- A latent variable mixture model for composition-on-composition regression with application to chemical recycling
- Neural networks for extreme quantile regression with an application to forecasting of flood risk
- Gaussian copula function-on-scalar regression in reproducing kernel Hilbert space
This page was built for publication: Distribution-free predictive inference for regression
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q112972)