Nonparametric estimation (62G05) Density estimation (62G07) Nonparametric regression and quantile regression (62G08) Asymptotic properties of nonparametric inference (62G20) Classification and discrimination; cluster analysis (statistical aspects) (62H30) Learning and adaptive systems in artificial intelligence (68T05)
Abstract: We propose generalized random forests, a method for non-parametric statistical estimation based on random forests (Breiman, 2001) that can be used to fit any quantity of interest identified as the solution to a set of local moment equations. Following the literature on local maximum likelihood estimation, our method considers a weighted set of nearby training examples; however, instead of using classical kernel weighting functions that are prone to a strong curse of dimensionality, we use an adaptive weighting function derived from a forest designed to express heterogeneity in the specified quantity of interest. We propose a flexible, computationally efficient algorithm for growing generalized random forests, develop a large sample theory for our method showing that our estimates are consistent and asymptotically Gaussian, and provide an estimator for their asymptotic variance that enables valid confidence intervals. We use our approach to develop new methods for three statistical tasks: non-parametric quantile regression, conditional average partial effect estimation, and heterogeneous treatment effect estimation via instrumental variables. A software implementation, grf for R and C++, is available from CRAN.
Recommendations
Cites work
- scientific article; zbMATH DE number 6378123 (Why is no real title available?)
- scientific article; zbMATH DE number 991833 (Why is no real title available?)
- scientific article; zbMATH DE number 3860199 (Why is no real title available?)
- scientific article; zbMATH DE number 3738700 (Why is no real title available?)
- scientific article; zbMATH DE number 3782216 (Why is no real title available?)
- scientific article; zbMATH DE number 1911984 (Why is no real title available?)
- A Class of Statistics with Asymptotically Normal Distribution
- A Unified Approach to Structural Change Tests Based on ML Scores,FStatistics, and OLS Residuals
- A local generalized method of moments estimator
- A random forest guided tour
- Analysis of a random forests model
- Analyzing bagging
- Applied Econometrics with R
- Asymptotic Statistics
- BART: Bayesian additive regression trees
- Bagging predictors
- Comments on: ``A random forest guided tour
- Consistency of random forests
- Consistency of random forests and other averaging classifiers
- Consistency of random survival forests
- Consistent nonparametric regression. Discussion
- Double/debiased machine learning for treatment and structural parameters
- Econometric analysis of cross section and panel data.
- Estimation and Inference of Heterogeneous Treatment Effects using Random Forests
- Extremely randomized trees
- Generalized M‐fluctuation tests for parameter instability
- Generalized random forests
- Greedy function approximation: A gradient boosting machine.
- Identification and Estimation of Local Average Treatment Effects
- Instrumental Variable Estimation of Nonparametric Models
- Local Likelihood Estimation
- Local Maximum Likelihood Estimation and Inference
- Local Regression and Likelihood
- Nonparametric instrumental regression
- On asymptotically efficient estimation in semiparametric models
- On the layered nearest neighbour estimate, the bagged nearest neighbour estimate and the random forest method in regression and classification
- Panel Data Discrete Choice Models with Lagged Dependent Variables
- Quantifying uncertainty in random forests via confidence intervals and hypothesis tests
- Quantile regression forests
- Random Forests and Adaptive Nearest Neighbors
- Random forests
- Recursive partitioning for heterogeneous causal effects
- Reinforcement learning trees
- Root-N-Consistent Semiparametric Regression
- Semiparametric instrumental variable estimation of treatment response models.
- Some Comments on C P
- Sparse models and methods for optimal instruments with an application to eminent domain
- Standard errors for bagged and random forest estimators
- Testing for the Constancy of Parameters Over Time
- Tests For Constancy Of Model Parameters Over Time
- Tests for Parameter Instability and Structural Change With Unknown Change Point
- The Asymptotic Variance of Semiparametric Estimators
- The Cusum Test with Ols Residuals
- The Influence Curve and Its Role in Robust Estimation
- The Kernel Estimate of a Regression Function in Likelihood-Based Models
- The central role of the propensity score in observational studies for causal effects
- The jackknife estimate of variance
- Tree-based multivariate regression and density estimation with right-censored data
Cited in
(only showing first 100 items - show all)- Generalizing treatment effects with incomplete covariates: identifying assumptions and multiple imputation algorithms
- Interaction forests: identifying and exploiting interpretable quantitative and qualitative interaction effects
- Towards convergence rate analysis of random forests for classification
- Random Forest Adjustment for Approximate Bayesian Computation
- On variance estimation of random forests with Infinite-order U-statistics
- Quantile generalized measures of correlation
- A distance-based test of independence between two multivariate time series
- Local Linear Forests
- SETAR-Tree: a novel and accurate tree algorithm for global time series forecasting
- To do or not to do? Cost-sensitive causal classification with individual treatment effect estimates
- Improving uplift model evaluation on randomized controlled trial data
- Semiparametric estimation for average causal effects using propensity score-based spline
- Comment: Invariance and causal inference
- Rates of convergence for random forests via generalized U-statistics
- A Pliable Lasso
- Robust inference of conditional average treatment effects using dimension reduction
- Minimax optimal rates for Mondrian trees and forests
- A reluctant additive model framework for interpretable nonlinear individualized treatment rules
- Response transformation and profit decomposition for revenue uplift modeling
- Asymptotic properties of high-dimensional random forests
- Orthogonal statistical learning
- Methods for integrating trials and non-experimental data to examine treatment effect heterogeneity
- Quantile regression forests
- Heterogeneous treatment effect-based random forest: HTERF
- Instrument Validity Tests With Causal Forests
- A conditional linear combination test with many weak instruments
- Optimal Nonparametric Inference with Two-Scale Distributional Nearest Neighbors
- Distributional regression forests for probabilistic precipitation forecasting in complex terrain
- Transformation boosting machines
- Bayesian regression tree models for causal inference: regularization, confounding, and heterogeneous effects (with discussion)
- A semiparametric instrumental variable approach to optimal treatment regimes under endogeneity
- scientific article; zbMATH DE number 7626802 (Why is no real title available?)
- Gradient boosting for extreme quantile regression
- Recent advances in statistical methodologies in evaluating program for high-dimensional data
- Learning causal effect using machine learning with application to China's typhoon
- ROC‐guided survival trees and ensembles
- Attention-based random forest and contamination model
- Discussion of Kallus (2020) and Mo, Qi, and Liu (2020): New Objectives for Policy Learning
- Nonparametric C- and D-vine-based quantile regression
- Predictive Distribution Modeling Using Transformation Forests
- Robust nonparametric regression: a review
- scientific article; zbMATH DE number 7370548 (Why is no real title available?)
- Manifold Oblique Random Forests: Towards Closing the Gap on Convolutional Deep Networks
- Sufficient dimension reduction for average causal effect estimation
- Two-stage least squares random forests with an application to Angrist and Evans (1998)
- Estimation and validation of ratio-based conditional average treatment effects using observational data
- Nonunitarizable Representations and Random Forests
- Robust estimation of heterogeneous treatment effects: an algorithm-based approach
- The Effect of Job Loss and Unemployment Insurance on Crime in Brazil
- Forward variable selection for random forest models
- Sharp Sensitivity Analysis for Inverse Propensity Weighting via Quantile Balancing
- grf
- Learning optimal biomarker-guided treatment policy for chronic disorders
- Ranking of average treatment effects with generalized random forests for time-to-event outcomes
- Text Selection
- Comparison of methods that combine multiple randomized trials to estimate heterogeneous treatment effects
- Exploratory subgroup identification in the heterogeneous Cox model: a relatively simple procedure
- Learning and confirming a class of treatment responders in clinical trials
- Detecting heterogeneous treatment effects with instrumental variables and application to the Oregon Health Insurance Experiment
- Dimension Reduction Forests: Local Variable Importance Using Structured Random Forests
- Doubly robust treatment effect estimation with missing attributes
- A Random Forest Approach for Bounded Outcome Variables
- scientific article; zbMATH DE number 7370525 (Why is no real title available?)
- Comparing algorithms for characterizing treatment effect heterogeneity in randomized trials
- Medoid splits for efficient random forests in metric spaces
- Estimation and evaluation of individualized treatment rules following multiple imputation
- Causal mediation analysis with latent subgroups
- A tutorial on individualized treatment effect prediction from randomized trials with a binary endpoint
- Comparing Covariate Prioritization via Matching to Machine Learning Methods for Causal Inference Using Five Empirical Applications
- Designing optimal, data-driven policies from multisite randomized trials
- Using Wasserstein generative adversarial networks for the design of Monte Carlo simulations
- Ordinal trees and random forests: score-free recursive partitioning and improved ensembles
- Time series quantile regression using random forests
- Semiparametric estimation of long-term treatment effects
- Quantile regression by dyadic CART
- HETEROGENEOUS TREATMENT EFFECTS OF NUDGE AND REBATE: CAUSAL MACHINE LEARNING IN A FIELD EXPERIMENT ON ELECTRICITY CONSERVATION
- Model-based random forests for ordinal regression
- Building Trees for Probabilistic Prediction via Scoring Rules
- New forest-based approaches for sufficient dimension reduction
- Heterogeneous causal effects with imperfect compliance: a Bayesian machine learning approach
- An embedded model estimator for non-stationary random functions using multiple secondary variables
- Bayesian additive regression trees with model trees
- Generalized random forests
- Coalescent random forests
- Consistency of survival tree and forest models: splitting bias and correction
- Extremal Random Forests
- Evaluating individualized treatment effect predictions: a model-based perspective on discrimination and calibration assessment
- Modern approaches for evaluating treatment effect heterogeneity from clinical trials and observational data
- Learning when-to-treat policies
- Neural random forests
- Experimental Evaluation of Individualized Treatment Rules
- Regularizing double machine learning in partially linear endogenous models
- Contrast trees and distribution boosting
- Uncertainty quantification for honest regression trees
- Augmented minimax linear estimation
- Random forests
- Critical random forests
- Linear Aggregation in Tree-Based Estimators
- Is there a role for statistics in artificial intelligence?
- Cross-Validation, Risk Estimation, and Model Selection: Comment on a Paper by Rosset and Tibshirani
This page was built for publication: Generalized random forests
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q666599)