The elements of statistical learning. Data mining, inference, and prediction

From MaRDI portal
Publication:5943421


zbMath0973.62007MaRDI QIDQ5943421

Robert Tibshirani, Jerome H. Friedman, Trevor Hastie

Publication date: 23 September 2001

Published in: Springer Series in Statistics (Search for Journal in Brave)


62-01: Introductory exposition (textbooks, tutorial papers, etc.) pertaining to statistics

68T05: Learning and adaptive systems in artificial intelligence

62C99: Statistical decision theory

68T99: Artificial intelligence


Related Items

Predictive learning via rule ensembles, Ant colony optimization for continuous domains, SVM-Maj: a majorization approach to linear support vector machines with different hinge errors, ElemStatLearn, Fast exact leave-one-out cross-validation of sparse least-squares support vector machines, Auto-associative models and generalized principal component analysis, Statistical properties of distance estimators, Dynamic path analysis -- a new approach to analyzing time-dependent covariates, Adaptive stepsizes for recursive estimation with applications in approximate dynamic programming, Image classification with the use of radial basis function neural networks and the minimization of the localized generalization error, Logistic regression using covariates obtained by product-unit neural network models, A survey of content-based image retrieval with high-level semantics, Handwritten digit classification using higher order singular value decomposition, A two-stage algorithm for identification of nonlinear dynamic systems, An effective architecture for learning and evolving flexible job-shop schedules, Filter-based classification of training image patterns for spatial simulation, Rudiments of rough sets, Rough sets: some extensions, Rough sets and Boolean reasoning, Constrained estimation and the theorem of Kuhn-Tucker, Self-generating prototypes for pattern classification, Modeling and optimizing a vendor managed replenishment system using machine learning and genetic algorithms, Best subset selection, persistence in high-dimensional statistical learning and optimization under \(l_1\) constraint, Improved customer choice predictions using ensemble methods, Genetic algorithms for the selection of smoothing parameters in additive models, Regularization in statistics, Identification of critical genes in microarray experiments by a neuro-fuzzy approach, Modeling nonlinearities with mixtures-of-experts of time series models, On the computational complexity of the minimum committee problem, An extension of Fisher's discriminant analysis for stochastic processes, Reliable computing with unreliable components: Using separable environments to stabilize long-term information storage, Rejoinder: One-step sparse estimates in nonconcave penalized likelihood models, A comparison of several nearest neighbor classifier metrics using tabu search algorithm for the feature selection problem, Biometric dispersion matcher, Locally linear reconstruction for instance-based learning, Augmenting the bootstrap to analyze high dimensional genomic data, A new bootstrap-based forecast evaluation method tested on time series, Feature selection using localized generalization error for supervised classification problems using RBFNN, Partial least squares Cox regression for genome-wide data, Bayesian shrinkage prediction for the regression problem, Enhanced piecewise regression based on deterministic annealing, Counting and enumerating aggregate classifiers, PCA and SVD with nonnegative loadings, Variable selection bias in regression trees with constant fits, Partial least-squares vs. Lanczos bidiagonalization. I: Analysis of a projection method for multiple regression, On Stein's lemma, dependent covariates and functional monotonicity in multi-dimensional modeling, Separation index and partial membership for clustering, Identification of interaction patterns and classification with applications to microarray data, Performing hypothesis tests on the shape of functional data, On optimum choice of \(k\) in nearest neighbor classification, Bandwidth selection for a class of difference-based variance estimators in the nonparametric regression: a possible approach, Visualizing ``typical and ``exotic internet traffic data, Optimization of nearest neighbor classifiers via metaheuristic algorithms for credit risk assessment, When do stepwise algorithms meet subset selection criteria?, Asymptotic normality of the recursive M-estimators of the scale parameters, Functional dissipation microarrays for classification, Iterative sliced inverse regression for segmentation of ultrasound and MR images, Tree-structured regression and the differentiation of integrals, Deriving weights in multiple-criteria decision making with support vector machines, Regularized finite mixture models for probability trajectories, New multicategory boosting algorithms based on multicategory Fisher-consistent losses, Bayesian multinomial regression with class-specific predictor selection, Controlled stratification for quantile estimation, Spectrum estimation for large dimensional covariance matrices using random matrix theory, Support vector machines for classification of aggregates by means of IR-spectra, Optimization of gene-environment networks in the presence of errors and uncertainty with Chebychev approximation, Symmetric measures via moments, An algorithm for the recognition of levels of congestion in road traffic problems, Degrees of conditional (in)dependence: A framework for approximate Bayesian networks and examples related to the rough set-based feature selection, Full-body person recognition system., Concentration of measure and cluster analysis., Compound decision theory and empirical Bayes methods, Mining data to find subsets of high activity., Assessing model mimicry using the parametric bootstrap., Functional multi-layer perceptron: A nonlinear tool for functional data analysis, Model-based mixture discriminant analysis -- an experimental study, Bayesian methods for neural networks and related models, On data depth and distribution-free discriminant analysis using separating surfaces, A note on margin-based loss functions in classification, Automatic feature extraction for classifying audio data, Backfitting neural networks, General empirical Bayes wavelet methods and exactly adaptive minimax estimation, On the connections between statistical disclosure control for microdata and some artificial intelligence tools, Quasi-regression with shrinkage, Radial basis function interpolation in the quantum trajectory method: optimization of the multi-quadric shape parameter., Problems in gene clustering based on gene expression data, Exploring interactions in high-dimensional genomic data: an overview of logic regression, with applications, Least angle regression. (With discussion), Nonparametric regression analysis of uncertain and imprecise data using belief functions, A list-based compact representation for large decision tables management, Practical selection of SVM parameters and noise estimation for SVM regression, Complexity control in statistical learning, A survey of temporal data mining, Optimization and applications of echo state networks with leaky- integrator neurons, Nonlinear principal component analysis of noisy data, Nonlinear analog predictor analysis: A coupled neural network/analog model for climate downscaling, Environmentally adaptive acoustic transmission loss prediction in turbulent and nonturbulent atmospheres, Applications of regularized least squares to pattern classification, Opportunities and challenges applying functional data analysis to the study of open source software evolution, Data mining in electronic commerce, Stein's identity, Fisher information, and projection pursuit: A triangulation, Optimally regularised kernel Fisher discriminant classification, Learning payoff functions in infinite games, Predictive modelling of heterogeneous sequence collections by topographic ordering of histories, Annealing stochastic approximation Monte Carlo algorithm for neural network training, Clustering and combinatorial optimization in recursive supervised learning, The analysis of ordered categorical data: An overview and a survey of recent developments. (With discussion), A combined approach for segment-specific market basket analysis, Consistency of spectral clustering, High-dimensional generalized linear models and the lasso, Bounds for Bayesian order identification with application to mixtures, Quadratic distances on probabilities: A unified foundation, Structured variable selection in support vector machines, Generative models for similarity-based classification, Clustering of biological time series by cepstral coefficients based distances, Masking effects on linear regression in multi-class classification, Using virtual sample generation to build up management knowledge in the early manufacturing stages, Randomised restarted search in ILP, Quantitative pharmacophore models with inductive logic programming, Bayesian variable selection for high dimensional generalized linear models: convergence rates of the fitted densities, Regularized estimation for preference disaggregation in multiple criteria decision making, Automatic construction of feedforward/recurrent fuzzy systems by clustering-aided simplex particle swarm optimization, Optimal threshold analysis of segmentation methods for identifying target customers, Extensions of vector quantization for incremental clustering, Analysis of an alignment algorithm for nonlinear dimensionality reduction, Describing disability through individual-level mixture models for multivariate binary data, On the ``degrees of freedom of the lasso, Accelerated convergence for nonparametric regression with coarsened predictors, Robust multiclass kernel-based classifiers, Two-parameter ridge regression and its convergence to the eventual pairwise model, Metric learning by discriminant neighborhood embedding, Rodeo: Sparse, greedy nonparametric regression, Approximation and learning by greedy algorithms, Generalized mixture models, semi-supervised learning, and unknown class inference, An optimal choice of window width for LOWESS normalization of microarray data, Sparse estimation of large covariance matrices via a nested Lasso penalty, Identification of MIMO Hammerstein models using least squares support vector machines, Frequency-based views to pattern collections, Reducing multiclass cancer classification to binary by output coding and SVM, Computational intelligence in earth sciences and environmental applications: issues and challenges., Additive regularization trade-off: fusion of training and validation levels in kernel methods, Propagation-separation approach for local likelihood estimation, An algebra of human concept learning, Diffusion maps, spectral clustering and reaction coordinates of dynamical systems, Spectral independent component analysis, Diversification for better classification trees, Interpolation of Lipschitz functions, Gene function classification using NCI-60 cell line gene expression profiles, Fast string matching by using probabilities: on an optimal mismatch variant of Horspool's algorithm, The sign statistic, one-way layouts and mixture models, Kernel smoothers: an overview of curve estimators for the first graduate course in nonparametric statistics, Convex kernel underestimation of functions with multiple local minima, Modeling individual differences using Dirichlet processes, Bayesian nonparametric model selection and model testing, Model selection by normalized maximum likelihood, Goodness-of-fit and confidence intervals of approximate models, Image segmentation by using the localized subspace iteration algorithm, Mathematical programming based heuristics for improving LP-generated classifiers for the multiclass supervised classification problem, Handling missing values in support vector machine classifiers, A tandem clustering process for multimodal datasets, A model for prejudiced learning in noisy environments, Partially adaptive robust estimation of regression models and applications, On the consistency properties of linear and quadratic discriminant analyses, Boosting with early stopping: convergence and consistency, Piecewise linear regularized solution paths, Classifiers of support vector machine type with \(\ell_1\) complexity regularization, Test-based classification: A linkage between classification and statistical testing, On generalized semi-infinite optimization of genetic networks, Aligned Rank Transform Techniques for Analysis of Variance and Multiple Comparisons, Theory of Classification: a Survey of Some Recent Advances, Free-Knot Spline Smoothing for Functional Data, A valuation model for cut diamonds, Partially Supervised Learning Using an EM‐Boosting Algorithm, Computational Biology: Toward Deciphering Gene Regulatory Information in Mammalian Genomes, Regularized Estimation in the Accelerated Failure Time Model with High-Dimensional Covariates, Non-parametric modelling of time-varying customer service times at a bank call centre, Assessing the risk situation of network security for active defense, Mathematical contributions to dynamics and optimization of gene-environment networks, Logistic discrimination using robust estimators: An influence function approach, Characterization of Graphs Using Degree Cores, Independence Decomposition in Dynamic Bayesian Networks, Measuring Time Series Predictability Using Support Vector Regression, Variable Selection in Penalized Model‐Based Clustering Via Regularization on Grouped Parameters, ON SEMIPARAMETRIC REGRESSION WITH O'SULLIVAN PENALIZED SPLINES, A comparison of classification models to identify the Fragile X Syndrome, Summer temperature effects on deaths and hospital admissions among the elderly population in two Italian cities, Fluid flow pattern analysis in a trough region: a nonparametric approach, Analysis of growth curve data by using cubic smoothing splines, Logistic Discrimination with Total Variation Regularization, ROC‐Based Utility Function Maximization for Feature Selection and Classification with Applications to High‐Dimensional Protease Data, Functional approaches for predicting land use with the temporal evolution of coarse resolution remote sensing data, A COMPARISON OF MIXED MODEL SPLINES FOR CURVE FITTING, ACCELERATED FAILURE TIME MODELS WITH NONLINEAR COVARIATES EFFECTS, Data-driven evolving fuzzy systems using eTS and FLEXFIS: comparative analysis, Using ensemble and metaheuristics learning principles with artificial neural networks to improve due date prediction performance, Noisy Independent Component Analysis as a Method of Rotating the Factor Scores, Mining representative subset based on fuzzy clustering, Analysis of Heat Wave Effects on Health by Using Generalized Additive Model and Bootstrap-Based Model Selection, Factor Analysis as Data Matrix Decomposition: A New Approach for Quasi-Sphering in Noisy ICA, New Bootstrap Applications in Supervised Learning, Self-Modelling Warping Functions, Sparsity and Smoothness Via the Fused Lasso, Aggregating classifiers with ordinal response structure, Feature Informativeness in High-Dimensional Discriminant Analysis, Environmental Statistics—A Personal View, Results concerning the generalized partially linear single-index model, Regularization and Variable Selection Via the Elastic Net, Wavelet classification of high frequency pupillary responses, Generalized Additive Modeling with Implicit Variable Selection by Likelihood‐Based Boosting, Variable Selection for Logistic Regression Using a Prediction‐Focused Information Criterion, Feature‐Specific Penalized Latent Class Analysis for Genomic Data, Motor Unit Number Estimation—A Bayesian Approach, Nonlinear evolution of the Richtmyer–Meshkov instability, Linear Regression Model-Guided Clustering for Training RBF Networks for Regression Problems, Search for relevant sets of variables in a high‐dimensional setup keeping the familywise error rate, A Note on Breiman's Random Forest Data Mining Technique and Conventional Cox Modeling of Survival Statistics: The Case of the Phantom “Induct” Covariate in the Ohio State University Kidney Transplant Database, Smoothing Lipschitz functions, A graph-based estimator of the number of clusters, Bus Arrival Time Prediction Using Support Vector Machines, Selection of Binary Variables and Classification by Boosting, High-Dimensional Discriminant Analysis, Churn detection via customer profile modelling, Penalized Item Response Theory Models: Application to Epigenetic Alterations in Bladder Cancer, Time Series Classification Based on Spectral Analysis, Boosted Regression Trees with Errors in Variables, Statistical modelling of functional data, Combining Predictors for Classification Using the Area under the Receiver Operating Characteristic Curve, Adaptive smoothing in kernel discriminant analysis, Generalized Additive Models for Location, Scale and Shape, The Neglog Transformation and Quantile Regression for the Analysis of a Large Credit Scoring Database, LATTICE OPTION PRICING BY MULTIDIMENSIONAL INTERPOLATION, A statistical, self-organizing learning system with validation, Extremely randomized trees, Tight Clustering: A Resampling‐Based Approach for Identifying Stable and Tight Patterns in Data, Modeling Hidden Exposures in Claim Severity Via the Em Algorithm, Assessing the Skill of Yes/No Predictions, Extremely randomized trees, A Hierarchical Bayesian Model for Predicting the Functional Consequences of Amino-Acid Polymorphisms, Logistic model trees, Logistic model trees


Uses Software