Least-squares independence regression for non-linear causal inference under non-Gaussian noise
From MaRDI portal
Publication:479477
Abstract: The discovery of non-linear causal relationship under additive non-Gaussian noise models has attracted considerable attention recently because of their high flexibility. In this paper, we propose a novel causal inference algorithm called least-squares independence regression (LSIR). LSIR learns the additive noise model through the minimization of an estimator of the squared-loss mutual information between inputs and residuals. A notable advantage of LSIR over existing approaches is that tuning parameters such as the kernel width and the regularization parameter can be naturally optimized by cross-validation, allowing us to avoid overfitting in a data-dependent fashion. Through experiments with real-world datasets, we show that LSIR compares favorably with a state-of-the-art causal inference method.
Recommendations
- Non-linear causal inference using Gaussianity measures
- scientific article
- A linear non-Gaussian acyclic model for causal discovery
- Learning causal graphs of nonlinear structural vector autoregressive model using information theory criteria
- Justifying additive noise model-based causal discovery via algorithmic information theory
Cites work
- scientific article; zbMATH DE number 1266748 (Why is no real title available?)
- scientific article; zbMATH DE number 1332320 (Why is no real title available?)
- scientific article; zbMATH DE number 708500 (Why is no real title available?)
- scientific article; zbMATH DE number 1493045 (Why is no real title available?)
- A Consistent Test for Bivariate Dependence
- A linear non-Gaussian acyclic model for causal discovery
- Algorithmic Learning Theory
- Elements of Information Theory
- Justifying additive noise model-based causal discovery via algorithmic information theory
- Kernel dimension reduction in regression
- Nonlinear programming and variational inequality problems. A unified approach
- On Information and Sufficiency
- On the influence of the kernel on the consistency of support vector machines
- On the limited memory BFGS method for large scale optimization
- Pattern recognition and machine learning.
- Statistical analysis of kernel-based least-squares density-ratio estimation
- Sufficient dimension reduction via squared-loss mutual information estimation
- Theory of Reproducing Kernels
Cited in
(3)
This page was built for publication: Least-squares independence regression for non-linear causal inference under non-Gaussian noise
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q479477)