Robust estimation in the linear model with asymmetric error distributions (Q1082742)
From MaRDI portal
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | Robust estimation in the linear model with asymmetric error distributions |
scientific article |
Statements
Robust estimation in the linear model with asymmetric error distributions (English)
0 references
1986
0 references
This article considers robust estimation of a location parameter and robust estimation of a linear regression model when the observations and error terms, respectively, are independent and identically distributed random variables, each with distribution function F. As in Huber's theory of robust estimation F is an unknown member of a specified class of distribution functions \({\mathcal F}.\) Two particular classes of distribution functions are considered, namely \({\mathcal F}_{a_ 0,\epsilon}\) and \({\mathcal F}_{[-a_ 0,b_ 0]}\). F is in \({\mathcal F}_{a_ 0,\epsilon}\) if it is governed on the set \([-a_ 0,a_ 0]\) by the standard normal distribution function contaminated, with probability \(\epsilon\), by an unknown distribution which is symmetric about 0; outside the interval \([-a_ 0,a_ 0]\), F is completely unknown. The parameters \(a_ 0\) \((a_ 0>0)\) and \(\epsilon\) \((0<\epsilon <1)\) are assumed to be known. The class \({\mathcal F}_{[-a_ 0,b_ 0]}\) consists of all distributions which on a fixed set \([-a_ 0,b_ 0]\) are governed by a known distribution, outside the interval \([- a_ 0,b_ 0]\) the distribution is completely arbitrary. In this article a sequence of M-estimators for the location problem is found which is optimal (in the sense of minimizing the asymptotic mean squared error as F ranges over \({\mathcal F}_{a_ 0,\epsilon})\) within the class of all M-estimators. This result is then generalized to the problem of estimating a linear regression model. Finally, the optimal M- estimators among the class of consistent and asymptotically normal M- estimators of the linear model parameters are found under the error distribution models \({\mathcal F}_{a_ 0,\epsilon}\) and \({\mathcal F}_{[- a_ 0,b_ 0]}.\) For the model \({\mathcal F}_{a_ 0,\epsilon}\) the strong assumption that the design matrix is constructed by repeating p fixed linearly independent rows as \(n\to \infty\) is made. The authors suggest that the same results can be achieved under weaker conditions on the design matrix, but with some additional complications in the derivations. However, the authors argue that this approach would result in a decrease in the small-sample efficiency.
0 references
asymmetric error distributions
0 references
robust regression
0 references
linear model
0 references
asymptotic distributions
0 references
consistency
0 references
asymptotically normal
0 references
robust estimation
0 references
location parameter
0 references
standard normal distribution
0 references
minimizing the asymptotic mean squared error
0 references
optimal M-estimators
0 references