Boosting algorithms: regularization, prediction and model fitting
From MaRDI portal
Abstract: We present a statistical perspective on boosting. Special emphasis is given to estimating potentially complex parametric or nonparametric models, including generalized linear and additive models as well as regression models for survival analysis. Concepts of degrees of freedom and corresponding Akaike or Bayesian information criteria, particularly useful for regularization and variable selection in high-dimensional covariate spaces, are discussed as well. The practical aspects of boosting procedures for fitting statistical models are illustrated by means of the dedicated open-source software package mboost. This package implements functions which can be used for model fitting, prediction and variable selection. It is flexible, allowing for the implementation of new boosting algorithms optimizing user-specified loss functions.
Recommendations
- Comment: Boosting algorithms: regularization, prediction and model fitting
- Rejoinder: Boosting algorithms: regularization, prediction and model fitting
- Comment on: Boosting algorithms: regularization, prediction and model fitting
- Boosting methods for regression
- scientific article; zbMATH DE number 2086343
- Boosting. Foundations and algorithms.
- Stochastic boosting algorithms
- Stochastic boosting algorithms
- The boosting approach to machine learning: an overview
- Robust boosting for regression problems
Cites work
- scientific article; zbMATH DE number 5957408 (Why is no real title available?)
- scientific article; zbMATH DE number 5957506 (Why is no real title available?)
- scientific article; zbMATH DE number 3123424 (Why is no real title available?)
- scientific article; zbMATH DE number 47282 (Why is no real title available?)
- scientific article; zbMATH DE number 3637090 (Why is no real title available?)
- scientific article; zbMATH DE number 700016 (Why is no real title available?)
- scientific article; zbMATH DE number 1950578 (Why is no real title available?)
- scientific article; zbMATH DE number 845714 (Why is no real title available?)
- scientific article; zbMATH DE number 5056247 (Why is no real title available?)
- 10.1162/1532443041424319
- 10.1162/153244304773936108
- A decision-theoretic generalization of on-line learning and an application to boosting
- A multivariate FGD technique to improve VaR computation in equity markets
- A new approach to variable selection in least squares problems
- AdaBoost is consistent
- Adaptive Lasso for sparse high-dimensional regression models
- Additive logistic regression: a statistical view of boosting. (With discussion and a rejoinder by the authors)
- Aggregating classifiers with ordinal response structure
- Arcing classifiers. (With discussion)
- Bagging predictors
- Better Subset Regression Using the Nonnegative Garrote
- BoosTexter: A boosting-based system for text categorization
- Boosted classification trees and class probability/quantile estimation
- Boosting With theL2Loss
- Boosting algorithms: with an application to bootstrapping multivariate time series
- Boosting for high-dimensional linear models
- Boosting ridge regression
- Boosting the margin: a new explanation for the effectiveness of voting methods
- Boosting with early stopping: convergence and consistency
- Convergence Rates of General Regularization Methods for Statistical Inverse Problems and Applications
- Convexity, Classification, and Risk Bounds
- Cryptographic limitations on learning Boolean formulae and finite automata
- ElemStatLearn
- Empirical margin distributions and bounding the generalization error of combined classifiers
- Generalized Additive Modeling with Implicit Variable Selection by Likelihood‐Based Boosting
- Generalized additive models
- Generalized monotonic regression based on B-splines with an application to air pollution data
- Greedy function approximation: A gradient boosting machine.
- High-dimensional graphs and variable selection with the Lasso
- Knot selection by boosting techniques
- Least angle regression. (With discussion)
- Looking for lumps: boosting and bagging for density estimation.
- Matching pursuits with time-frequency dictionaries
- Model Selection and the Principle of Minimum Description Length
- Multi-class AdaBoost
- On boosting kernel regression
- On early stopping in gradient descent learning
- On the Bayes-risk consistency of regularized boosting methods.
- Persistene in high-dimensional linear predictor-selection and the virtue of overparametrization
- Process consistency for AdaBoost.
- Random forests
- Smoothing Parameter Selection in Nonparametric Regression Using an Improved Akaike Information Criterion
- Soft margins for AdaBoost
- Sparse boosting
- Survival ensembles
- The Adaptive Lasso and Its Oracle Properties
- The boosting approach to machine learning: an overview
- The elements of statistical learning. Data mining, inference, and prediction
- Unified methods for censored longitudinal data and causality
- Weak greedy algorithms
Cited in
(only showing first 100 items - show all)- Model building in nonproportional hazard regression
- High-dimensional L₂-Boosting: rate of convergence
- Generalised joint regression for count data: a penalty extension for competitive settings
- Kullback-Leibler-based discrete failure time models for integration of published prediction models with new time-to-event dataset
- Semiparametric regression during 2003--2007
- Spike-and-slab priors for function selection in structured additive regression models
- Sparse-Group Boosting: Unbiased Group and Variable Selection
- A penalty approach to differential item functioning in Rasch models
- scientific article; zbMATH DE number 1944955 (Why is no real title available?)
- Sparse kernel deep stacking networks
- Calibrating machine learning approaches for probability estimation: a comprehensive comparison
- Sample size and predictive performance of machine learning methods with survival data: a simulation study
- Lifetime analysis with monotonic degradation: a boosted first hitting time model based on a homogeneous gamma process
- A unified framework of constrained regression
- Ridge estimation for multinomial logit models with symmetric side constraints
- Boosted coefficient models
- Buckley-James boosting model based on extreme learning machine and random survival forests
- Boosting for real and functional samples: an application to an environmental problem
- Delta Boosting Machine with Application to General Insurance
- Two-step sparse boosting for high-dimensional longitudinal data with varying coefficients
- Early stopping for statistical inverse problems via truncated SVD estimation
- An update on statistical boosting in biomedicine
- Dimension reduction boosting
- Gradient boosting for generalised additive mixed models
- Boosting for detection of gene-environment interactions
- Model-based boosting 2.0
- A simple extension of boosting for asymmetric mislabeled data
- Forecasting retained earnings of privately held companies with PCA and \(L^1\) regression
- Pseudo-value regression trees
- Boosting iterative stochastic ensemble method for nonlinear calibration of subsurface flow models
- A boosting first-hitting-time model for survival analysis in high-dimensional settings
- Sparse and smooth additive isotonic model in high-dimensional settings
- Random gradient boosting for predicting conditional quantiles
- On the relevance of prognostic information for clinical trials: A theoretical quantification
- Loss-guided stability selection
- Three categories customer churn prediction based on the adjusted real AdaBoost
- Transformation boosting machines
- Sequential double cross-validation for assessment of added predictive ability in high-dimensional omic applications
- General sparse boosting: improving feature selection of \(L_{2}\) boosting by correlation-based penalty family
- A novel localized least-squares collocation method for coupled bulk-surface problems
- New multicategory boosting algorithms based on multicategory Fisher-consistent losses
- Invariance, causality and robustness
- Subject-specific Bradley–Terry–Luce models with implicit variable selection
- Boosting Distributional Copula Regression
- Boosting beyond the mean -- extending component-wise gradient boosting algorithms to multiple dimensions
- Promoting similarity of model sparsity structures in integrative analysis of cancer genetic data
- Variable selection and model choice in structured survival models
- A review on instance ranking problems in statistical learning
- Determining cutoff point of ensemble trees based on sample size in predicting clinical dose with DNA microarray data
- The reliability of classification of terminal nodes in GUIDE decision tree to predict the nonalcoholic fatty liver disease
- Boosting multi-state models
- Prediction-based variable selection for component-wise gradient boosting
- The functional linear array model
- Wavelet-based gradient boosting
- Predicting 5G throughput with BAMMO, a boosted additive model for data with missing observations
- Gradient boosting for linear mixed models
- Significance tests for boosted location and scale models with linear base-learners
- Automatic model selection for high-dimensional survival analysis
- On the choice and influence of the number of boosting steps for high-dimensional linear Cox-models
- Insurance loss modeling with gradient tree-boosted mixture models
- Boosting in Cox regression: a comparison between the likelihood-based and the model-based approaches with focus on the R-packages \textit{CoxBoost} and \textit{mboost}
- Boosting techniques for nonlinear time series models
- Probing for sparse and fast variable selection with model-based boosting
- Quantitative robustness of instance ranking problems
- De-noising boosting methods for variable selection and estimation subject to error-prone variables
- Boosting with missing predictors
- Predicting the Whole Distribution with Methods for Depth Data Analysis Demonstrated on a Colorectal Cancer Treatment Study
- Conditional transformation models for survivor function estimation
- Rank-based estimation in the \(\ell_1\)-regularized partly linear model for censored outcomes with application to integrated analyses of clinical predictors and gene expression data
- Empirical investigations of boosting with pseudo-outcome imputation for missing responses
- Accelerated gradient boosting
- Gradient-boosted generalized linear models for conditional vine copulas
- Boosting for statistical modelling-A non-technical introduction
- Advances in estimation and inference for historical functional linear models
- Use of majority votes in statistical learning
- Detection of differential item functioning in Rasch models by boosting techniques
- Rejoinder: Boosting algorithms: regularization, prediction and model fitting
- Functional gradient ascent for probit regression
- Boosting nonlinear additive autoregressive time series
- Boosting ridge regression
- Data-driven state-of-charge prediction of a storage cell using ABC/GBRT, ABC/MLP and Lasso machine learning techniques
- Genetic prediction modeling in large cohort studies via boosting targeted loss functions
- Feature selection filter for classification of power system operating states
- Comparison and contrast of two general functional regression modelling frameworks
- Comment on: Boosting algorithms: regularization, prediction and model fitting
- Modeling Postoperative Mortality in Older Patients by Boosting Discrete-Time Competing Risks Models
- Fast and scalable variable selection for spatial autoregressive models
- Unbiased Boosting Estimation for Censored Survival Data
- Boosting kernel-based dimension reduction for jointly propagating spatial variability and parameter uncertainty in long-running flow simulators
- A spatio-temporal machine learning model for mortgage credit risk: default probabilities and loan portfolios
- Boosting multiplicative model combination
- Boosting multivariate structured additive distributional regression models
- Modelling additive extremile regression by iteratively penalized least asymmetric weighted squares and gradient descent boosting
- Robust nonparametric integrative analysis to decipher heterogeneity and commonality across subgroups using sparse boosting
- The integrated calibration index (ICI) and related metrics for quantifying the calibration of logistic regression models
- An empirical comparison of classification algorithms for mortgage default prediction: evidence from a distressed mortgage market
- Bayesian variable selection and estimation in semiparametric joint models of multivariate longitudinal and survival data
- Boosting Prediction with Data Missing Not at Random
- Stochastic boosting algorithms
- Stochastic boosting algorithms
Describes a project that uses
Uses Software
This page was built for publication: Boosting algorithms: regularization, prediction and model fitting
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q449780)