Generalized random forests
From MaRDI portal
Nonparametric estimation (62G05) Density estimation (62G07) Nonparametric regression and quantile regression (62G08) Asymptotic properties of nonparametric inference (62G20) Classification and discrimination; cluster analysis (statistical aspects) (62H30) Learning and adaptive systems in artificial intelligence (68T05)
Abstract: We propose generalized random forests, a method for non-parametric statistical estimation based on random forests (Breiman, 2001) that can be used to fit any quantity of interest identified as the solution to a set of local moment equations. Following the literature on local maximum likelihood estimation, our method considers a weighted set of nearby training examples; however, instead of using classical kernel weighting functions that are prone to a strong curse of dimensionality, we use an adaptive weighting function derived from a forest designed to express heterogeneity in the specified quantity of interest. We propose a flexible, computationally efficient algorithm for growing generalized random forests, develop a large sample theory for our method showing that our estimates are consistent and asymptotically Gaussian, and provide an estimator for their asymptotic variance that enables valid confidence intervals. We use our approach to develop new methods for three statistical tasks: non-parametric quantile regression, conditional average partial effect estimation, and heterogeneous treatment effect estimation via instrumental variables. A software implementation, grf for R and C++, is available from CRAN.
Recommendations
Cites work
- scientific article; zbMATH DE number 6378123 (Why is no real title available?)
- scientific article; zbMATH DE number 991833 (Why is no real title available?)
- scientific article; zbMATH DE number 3860199 (Why is no real title available?)
- scientific article; zbMATH DE number 3738700 (Why is no real title available?)
- scientific article; zbMATH DE number 3782216 (Why is no real title available?)
- scientific article; zbMATH DE number 1911984 (Why is no real title available?)
- A Class of Statistics with Asymptotically Normal Distribution
- A Unified Approach to Structural Change Tests Based on ML Scores,FStatistics, and OLS Residuals
- A local generalized method of moments estimator
- A random forest guided tour
- Analysis of a random forests model
- Analyzing bagging
- Applied Econometrics with R
- Asymptotic Statistics
- BART: Bayesian additive regression trees
- Bagging predictors
- Comments on: ``A random forest guided tour
- Consistency of random forests
- Consistency of random forests and other averaging classifiers
- Consistency of random survival forests
- Consistent nonparametric regression. Discussion
- Double/debiased machine learning for treatment and structural parameters
- Econometric analysis of cross section and panel data.
- Estimation and Inference of Heterogeneous Treatment Effects using Random Forests
- Extremely randomized trees
- Generalized M‐fluctuation tests for parameter instability
- Generalized random forests
- Greedy function approximation: A gradient boosting machine.
- Identification and Estimation of Local Average Treatment Effects
- Instrumental Variable Estimation of Nonparametric Models
- Local Likelihood Estimation
- Local Maximum Likelihood Estimation and Inference
- Local Regression and Likelihood
- Nonparametric instrumental regression
- On asymptotically efficient estimation in semiparametric models
- On the layered nearest neighbour estimate, the bagged nearest neighbour estimate and the random forest method in regression and classification
- Panel Data Discrete Choice Models with Lagged Dependent Variables
- Quantifying uncertainty in random forests via confidence intervals and hypothesis tests
- Quantile regression forests
- Random Forests and Adaptive Nearest Neighbors
- Random forests
- Recursive partitioning for heterogeneous causal effects
- Reinforcement learning trees
- Root-N-Consistent Semiparametric Regression
- Semiparametric instrumental variable estimation of treatment response models.
- Some Comments on C P
- Sparse models and methods for optimal instruments with an application to eminent domain
- Standard errors for bagged and random forest estimators
- Testing for the Constancy of Parameters Over Time
- Tests For Constancy Of Model Parameters Over Time
- Tests for Parameter Instability and Structural Change With Unknown Change Point
- The Asymptotic Variance of Semiparametric Estimators
- The Cusum Test with Ols Residuals
- The Influence Curve and Its Role in Robust Estimation
- The Kernel Estimate of a Regression Function in Likelihood-Based Models
- The central role of the propensity score in observational studies for causal effects
- The jackknife estimate of variance
- Tree-based multivariate regression and density estimation with right-censored data
Cited in
(only showing first 100 items - show all)- Towards convergence rate analysis of random forests for classification
- Rates of convergence for random forests via generalized U-statistics
- Bayesian additive regression trees with model trees
- Transformation boosting machines
- Interaction forests: identifying and exploiting interpretable quantitative and qualitative interaction effects
- Dimension Reduction Forests: Local Variable Importance Using Structured Random Forests
- Contrast trees and distribution boosting
- Cross-Validation, Risk Estimation, and Model Selection: Comment on a Paper by Rosset and Tibshirani
- Doubly robust treatment effect estimation with missing attributes
- Random forests
- scientific article; zbMATH DE number 7626802 (Why is no real title available?)
- Semiparametric estimation for average causal effects using propensity score-based spline
- Two-stage least squares random forests with an application to Angrist and Evans (1998)
- Attention-based random forest and contamination model
- Coalescent random forests
- A distance-based test of independence between two multivariate time series
- scientific article; zbMATH DE number 7370525 (Why is no real title available?)
- Nonparametric C- and D-vine-based quantile regression
- Semiparametric estimation of long-term treatment effects
- Predictive Distribution Modeling Using Transformation Forests
- Quantile regression forests
- Minimax optimal rates for Mondrian trees and forests
- Recent advances in statistical methodologies in evaluating program for high-dimensional data
- Detecting heterogeneous treatment effects with instrumental variables and application to the Oregon Health Insurance Experiment
- Comparing Covariate Prioritization via Matching to Machine Learning Methods for Causal Inference Using Five Empirical Applications
- Linear Aggregation in Tree-Based Estimators
- Learning causal effect using machine learning with application to China's typhoon
- Bayesian regression tree models for causal inference: regularization, confounding, and heterogeneous effects (with discussion)
- Consistency of survival tree and forest models: splitting bias and correction
- Response transformation and profit decomposition for revenue uplift modeling
- Neural random forests
- Asymptotic properties of high-dimensional random forests
- Nonunitarizable Representations and Random Forests
- Critical random forests
- Learning when-to-treat policies
- grf
- Heterogeneous causal effects with imperfect compliance: a Bayesian machine learning approach
- Generalized random forests
- Uncertainty quantification for honest regression trees
- Augmented minimax linear estimation
- Discussion of Kallus (2020) and Mo, Qi, and Liu (2020): New Objectives for Policy Learning
- A Random Forest Approach for Bounded Outcome Variables
- Gradient boosting for extreme quantile regression
- Local Linear Forests
- A Pliable Lasso
- Orthogonal statistical learning
- A semiparametric instrumental variable approach to optimal treatment regimes under endogeneity
- Targeting customers under response-dependent costs
- scientific article; zbMATH DE number 7370548 (Why is no real title available?)
- To do or not to do? Cost-sensitive causal classification with individual treatment effect estimates
- Robust inference of conditional average treatment effects using dimension reduction
- Bounds on the conditional and average treatment effect with unobserved confounding factors
- Distributional regression forests for probabilistic precipitation forecasting in complex terrain
- An embedded model estimator for non-stationary random functions using multiple secondary variables
- Comment: Invariance and causal inference
- Sufficient dimension reduction for average causal effect estimation
- Experimental Evaluation of Individualized Treatment Rules
- Regularizing double machine learning in partially linear endogenous models
- Ordinal trees and random forests: score-free recursive partitioning and improved ensembles
- Comparing algorithms for characterizing treatment effect heterogeneity in randomized trials
- A conditional linear combination test with many weak instruments
- Medoid splits for efficient random forests in metric spaces
- Estimation and evaluation of individualized treatment rules following multiple imputation
- Optimal Nonparametric Inference with Two-Scale Distributional Nearest Neighbors
- Causal mediation analysis with latent subgroups
- A tutorial on individualized treatment effect prediction from randomized trials with a binary endpoint
- Robust estimation of heterogeneous treatment effects: an algorithm-based approach
- The Effect of Job Loss and Unemployment Insurance on Crime in Brazil
- Improving uplift model evaluation on randomized controlled trial data
- Sharp Sensitivity Analysis for Inverse Propensity Weighting via Quantile Balancing
- Extremal Random Forests
- Evaluating individualized treatment effect predictions: a model-based perspective on discrimination and calibration assessment
- Modern approaches for evaluating treatment effect heterogeneity from clinical trials and observational data
- Forward variable selection for random forest models
- On variance estimation of random forests with Infinite-order U-statistics
- Quantile generalized measures of correlation
- Robust nonparametric regression: a review
- Quantile regression by dyadic CART
- Methods for integrating trials and non-experimental data to examine treatment effect heterogeneity
- Learning optimal biomarker-guided treatment policy for chronic disorders
- Ranking of average treatment effects with generalized random forests for time-to-event outcomes
- Text Selection
- Comparison of methods that combine multiple randomized trials to estimate heterogeneous treatment effects
- Exploratory subgroup identification in the heterogeneous Cox model: a relatively simple procedure
- HETEROGENEOUS TREATMENT EFFECTS OF NUDGE AND REBATE: CAUSAL MACHINE LEARNING IN A FIELD EXPERIMENT ON ELECTRICITY CONSERVATION
- Learning and confirming a class of treatment responders in clinical trials
- Heterogeneous treatment effect-based random forest: HTERF
- Is there a role for statistics in artificial intelligence?
- Random Forest Adjustment for Approximate Bayesian Computation
- Time series quantile regression using random forests
- Causal inference methods for combining randomized trials and observational studies: a review
- Mixed-level screening designs based on skew-symmetric conference matrices
- Distributional (Single) Index Models
- Model-based random forests for ordinal regression
- Designing optimal, data-driven policies from multisite randomized trials
- Building Trees for Probabilistic Prediction via Scoring Rules
- Using Wasserstein generative adversarial networks for the design of Monte Carlo simulations
- Manifold Oblique Random Forests: Towards Closing the Gap on Convolutional Deep Networks
- New forest-based approaches for sufficient dimension reduction
- A reluctant additive model framework for interpretable nonlinear individualized treatment rules
This page was built for publication: Generalized random forests
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q666599)