Statistical modeling: The two cultures. (With comments and a rejoinder).

From MaRDI portal
Publication:1431204

DOI10.1214/ss/1009213726zbMath1059.62505OpenAlexW2084341220WikidataQ29011672 ScholiaQ29011672MaRDI QIDQ1431204

Leo Breiman

Publication date: 27 May 2004

Published in: Statistical Science (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1214/ss/1009213726



Related Items

The balance property in neural network modelling, What are the Most Important Statistical Ideas of the Past 50 Years?, A Guide to Teaching Data Science, Meeting Student Needs for Multivariate Data Analysis: A Case Study in Teaching an Undergraduate Multivariate Data Analysis Course, Challenges and Opportunities for Statistics and Statistical Education: Looking Back, Looking Forward, Teaching the Next Generation of Statistics Students to “Think With Data”: Special Issue on Statistics and the Undergraduate Curriculum, Mere Renovation is Too Little Too Late: We Need to Rethink our Undergraduate Curriculum from the Ground Up, The Second Course in Statistics: Design and Analysis of Experiments?, A Data Science Course for Undergraduates: Thinking With Data, The Application of Fuzzy Decision Trees in Company Audit Fee Evaluation: A Sensitivity Analysis, Supervised Machine Learning Techniques: An Overview with Applications to Banking, The explanation game: a formal framework for interpretable machine learning, Deep dynamic modeling with just two time points: Can we still allow for individual trajectories?, Orthogonalized Kernel Debiased Machine Learning for Multimodal Data Analysis, Discussion of “A Risk-Based Measure of Time-Varying Prognostic Discrimination for Survival Models,” by C. Jason Liang and Patrick J. Heagerty, On Some Principles of Statistical Inference, Implications of the Data Revolution for Statistics Education, Optimization over decision trees: a case study for the design of stable direct-current electricity networks, Deconfounding and Causal Regularisation for Stability and External Validity, Stable Discovery of Interpretable Subgroups via Calibration in Causal Studies, Missing data imputation in clinical trials using recurrent neural network facilitated by clustering and oversampling, Design Principles for Data Analysis, Rejoinder to “Nonparametric variable importance assessment using machine learning techniques”, Analytical Problem Solving Based on Causal, Correlational and Deductive Models, Some Foundational Aspects of Rough Sets Rendering Its Wide Applicability, Integration of model-based recursive partitioning with bias reduction estimation: a case study assessing the impact of Oliver's four factors on the probability of winning a basketball game, Penalized time-varying model averaging, Fast, Optimal, and Targeted Predictions Using Parameterized Decision Analysis, Understanding complex predictive models with ghost variables, Parsimony as the ultimate regularizer for physics-informed machine learning, The posterior predictive null, AutonoML: Towards an Integrated Framework for Autonomous Machine Learning, Automated Deep Learning: Neural Architecture Search Is Not the End, Random Forest Prediction Intervals, Visualizing the Implicit Model Selection Tradeoff, Geometry and applied statistics, A historical overview of textbook presentations of statistical science, Defining replicability of prediction rules, An evolutionary estimation procedure for generalized semilinear regression trees, Probabilistic prediction for binary treatment choice: with focus on personalized medicine, Behavioral analytics for myopic agents, Spatial performance analysis in basketball with CART, random forest and extremely randomized trees, Some models are useful, but how do we know which ones? Towards a unified Bayesian model taxonomy, Time-varying forecast combination for factor-augmented regressions with smooth structural changes, Bayesian hierarchical stacking: some models are (somewhere) useful, Deep learning in fluid dynamics, Comment, Aggregated functional data model for near-infrared spectroscopy calibration and prediction, Prediction, Estimation, and Attribution, Parameter Identifiability in Statistical Machine Learning: A Review, Comment, Data-Driven Identification of Parametric Partial Differential Equations, Hierarchical modelling in searching for complex patterns: constrained sums of information systems, Boosting for statistical modelling-A non-technical introduction, Exploring and modelling team performances of the Kaggle European Soccer database, What can modern statistics offer imaging neuroscience?, Remembering Leo Breiman, Remembering Leo, Sovereign risk zones in Europe during and after the debt crisis, The growing ubiquity of algorithms in society: implications, impacts and innovations, Rough Sets: From Rudiments to Challenges, Veridical data science, Prediction with missing data via Bayesian Additive Regression Trees, A predictive analytics approach for demand forecasting in the process industry, Portfolio selection in non-stationary markets, Comments on ``Data science, big data and statistics, Prediction, Estimation, and Attribution, Leadership in Statistics: Increasing Our Value and Visibility, The Need for More Emphasis on Prediction: A “Nondenominational” Model-Based Approach, Workshop on statistical approaches for the evaluation of complex computer models, Statistical fraud detection: a review, Comment, What is a statistical model? (With comments and rejoinder)., Reviews, MODEL OF MODELS: A NEW PERSPECTIVE TO DEAL WITH MODEL UNCERTAINTY, Do German economic research institutes publish efficient growth and inflation forecasts? A Bayesian analysis, Least angle regression. (With discussion), On Tackling Explanation Redundancy in Decision Trees, Stability of continuous value discretisation: an application within rough set theory, Ridge regression and the Lasso: how do they do as finders of significant regressors and their multipliers?, Model averages sharpened into Occam’s razors: Deep learning enhanced by Rényi entropy, A conversation with Tom Louis, Estimating genetic architectures from artificial-selection responses: a random-effect framework, Using Differentiable Programming for Flexible Statistical Modeling, Demystifying Statistical Learning Based on Efficient Influence Functions, A Problem of Distributive Justice, Solved by the Lasso, Classifier technology and the illusion of progress, On the role of statistics in the era of big data: a call for a debate, Data learning from big data, On the role of statistics in the era of big data: a computer science perspective, An Effective Bayesian Neural Network Classifier with a Comparison Study to Support Vector Machine, Mixing partially linear regression models, Estimation and Inference of Heterogeneous Treatment Effects using Random Forests, The unreasonable effectiveness of deep learning in artificial intelligence, Evaluating human behaviour in response to AI recommendations for judgemental forecasting, Variable selection – A review and recommendations for the practicing statistician, Modern Koopman Theory for Dynamical Systems, Rough sets: some extensions, A globally convergent algorithm for Lasso-penalized mixture of linear regression models, Best subset selection, persistence in high-dimensional statistical learning and optimization under \(l_1\) constraint, An update on statistical boosting in biomedicine, A survey on enhanced subspace clustering, The Holdout Randomization Test for Feature Selection in Black Box Models, Collective reserving using individual claims data, Measuring regional effects of model inputs with random Forest, Preference disaggregation and statistical learning for multicriteria decision support: A review, Comment on: ``Models as approximations, Penalized spline support vector classifiers computational issues, Interpretable classifiers using rules and Bayesian analysis: building a better stroke prediction model, Heuristic optimization methods for dynamic panel data model selection: application on the Russian innovative performance, To explain or to predict?, A Bayesian perspective of statistical machine learning for big data, Structures and assumptions: strategies to harness gene \(\times\) gene and gene \(\times\) environment interactions in GWAS, Bayesian synthesis: combining subjective analyses, with an application to ozone data, Support vector machines with applications, Fisher lecture: Dimension reduction in regression, SIRUS: stable and interpretable RUle set for classification, Three-Dimensional Bifurcation Analysis of a Predator-Prey Model with Uncertain Formulation, Modeling and inference for infectious disease dynamics: a likelihood-based approach, Multi-scale support vector algorithms for hot spot detection and modelling, ESTIMATING UNIVARIATE DISTRIBUTIONS VIA RELATIVE ENTROPY MINIMIZATION: CASE STUDIES ON FINANCIAL AND ECONOMIC DATA, Leveraged least trimmed absolute deviations, Unnamed Item, Behavioral modeling in weight loss interventions, Bundling classifiers by bagging trees, Model selection and local geometry, All Models are Wrong, but Many are Useful: Learning a Variable's Importance by Studying an Entire Class of Prediction Models Simultaneously, Predictive Distribution Modeling Using Transformation Forests, Persistene in high-dimensional linear predictor-selection and the virtue of overparametrization, A novel technique of object ranking and classification under ignorance: an application to the corporate failure risk problem, Remembering Leo Breiman, Leo Breiman: An important intellectual and personal force in statistics, my life and that of many others, Navigating random forests and related advances in algorithmic modeling, Adjusted chi-square test for degree-corrected block models, Mathematical models and reality: A constructivist perspective, Bayesian Weibull tree models for survival analysis of clinico-genomic data, Data science, big data and statistics, Comments on ``Data science, big data and statistics, Rejoinder on ``Data science, big data and statistics, Learning causal effect using machine learning with application to China's typhoon, Shewhart’s Idea of Predictability and Modern Statistics, A report on the future of statistics (with comments and rejoinder), Delta Boosting Machine with Application to General Insurance, Popular raster-based methods of prospectivity modeling and their relationships, On estimation and inference in latent structure random graphs, Agglomerative joint clustering of metabolic data with spike at zero: A Bayesian perspective, The Rise of Statistical Phylogenetics, Multiple choice from competing regression models under multicollinearity based on standardized update, Pitfalls and merits of cointegration-based mortality models, Interpretation of Black-Box Predictive Models, Prediction-based regularization using data augmented regression, Data-driven estimation in equilibrium using inverse optimization, Price probabilities: a class of Bayesian and non-Bayesian prediction rules, Measuring the Stability of Results From Supervised Statistical Learning, ABC–CDE: Toward Approximate Bayesian Computation With Complex High-Dimensional Data and Limited Simulations, Classification tree analysis using TARGET, Empirical characterization of random forest variable importance measures, Interpretable machine learning: fundamental principles and 10 grand challenges, Symmetrical and non-symmetrical variants of three-way correspondence analysis for ordered variables, The foundations of statistical science: a history of textbook presentations, Some challenges for statistics, Comment: Will competition-winning methods for causal inference also succeed in practice?, Data science vs. statistics: two cultures?, Models only say what they're told to say, Applying Deep Reinforcement Learning in Automated Stock Trading, Machine learning versus statistical modeling, Risk prediction with machine learning and regression methods, Evaluating the impact of a grouping variable on job satisfaction drivers, Parsimonious classification via generalized linear mixed models



Cites Work