Linear regression with sparsely permuted data
From MaRDI portal
Abstract: In regression analysis of multivariate data, it is tacitly assumed that response and predictor variables in each observed response-predictor pair correspond to the same entity or unit. In this paper, we consider the situation of "permuted data" in which this basic correspondence has been lost. Several recent papers have considered this situation without further assumptions on the underlying permutation. In applications, the latter is often to known to have additional structure that can be leveraged. Specifically, we herein consider the common scenario of "sparsely permuted data" in which only a small fraction of the data is affected by a mismatch between response and predictors. However, an adverse effect already observed for sparsely permuted data is that the least squares estimator as well as other estimators not accounting for such partial mismatch are inconsistent. One approach studied in detail herein is to treat permuted data as outliers which motivates the use of robust regression formulations to estimate the regression parameter. The resulting estimate can subsequently be used to recover the permutation. A notable benefit of the proposed approach is its computational simplicity given the general lack of procedures for the above problem that are both statistically sound and computationally appealing.
Recommendations
- Sparse linear regression from perturbed data
- Linear Regression With a Sparse Parameter Vector
- Approximate sparse linear regression
- Scaled sparse linear regression
- Sparsest piecewise-linear regression of one-dimensional data
- scientific article; zbMATH DE number 6276198
- Permutation inference distribution for linear regression and related models
- Linear Regression With Shuffled Data: Statistical and Computational Limits of Permutation Recovery
- Sparse regression: scalable algorithms and empirical performance
Cites work
- scientific article; zbMATH DE number 3644343 (Why is no real title available?)
- scientific article; zbMATH DE number 4061904 (Why is no real title available?)
- scientific article; zbMATH DE number 2062645 (Why is no real title available?)
- scientific article; zbMATH DE number 3297798 (Why is no real title available?)
- A Bayesian procedure for file linking to analyze end-of-life medical costs
- A file linkage problem of DeGroot and Goel revisited
- A simple proof of the restricted isometry property for random matrices
- A tail inequality for quadratic forms of subgaussian random vectors
- Algorithm 948: DAESA -- a Matlab tool for structural analysis of differential-algebraic equations: software
- An elementary proof of a theorem of Johnson and Lindenstrauss
- Assignment Problems
- Best subset selection via a modern optimization lens
- Concentration inequalities. A nonasymptotic theory of independence
- Corrupted Sensing: Novel Guarantees for Separating Structured Signals
- Decoding by Linear Programming
- Estimation of the correlation coefficient from a broken random sample
- Geometric approach to error-correcting codes and reconstruction of signals
- Geometric inference for general high-dimensional linear inverse problems
- Least quantile regression via modern optimization
- Linear Regression With Shuffled Data: Statistical and Computational Limits of Permutation Recovery
- Matchmaking
- Minimax rates in permutation estimation for feature matching
- Optimal rates of statistical seriation
- Outlier detection using nonconvex penalized regression
- Random Fields and Geometry
- Regression Analysis With Linked Data
- Regression analysis with linked data: problems and possible solutions
- Robust 1-bit Compressed Sensing and Sparse Logistic Regression: A Convex Programming Approach
- Robust Estimation of a Location Parameter
- Robust Lasso With Missing and Grossly Corrupted Observations
- Robust Statistics
- Scaled sparse linear regression
- Simultaneous analysis of Lasso and Dantzig selector
- Some assignment problems arising from multiple target tracking
- Square-root lasso: pivotal recovery of sparse signals via conic programming
- The Generalized Lasso With Non-Linear Observations
- The broken sample problem
- The convex geometry of linear inverse problems
- Unlabeled Sensing With Random Linear Measurements
Cited in
(14)- Optimal detection of the feature matching map in presence of noise and outliers
- Regression with linked datasets subject to linkage error
- A Pseudo-Likelihood Approach to Linear Regression With Partially Shuffled Data
- scientific article; zbMATH DE number 7306893 (Why is no real title available?)
- Robust regression using probabilistically linked data
- Linear regression with mismatched data: a provably optimal local search algorithm
- Optimal Permutation Recovery in Permuted Monotone Matrix Model
- Matrix recovery from permutations
- Linear regression with partially mismatched data: local search with theoretical guarantees
- Provable training set debugging for linear regression
- Matching a discrete distribution by Poisson matching quantiles estimation
- Sparse linear regression from perturbed data
- Homomorphic sensing of subspace arrangements
- Linear Regression With a Sparse Parameter Vector
This page was built for publication: Linear regression with sparsely permuted data
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q1711600)