High-dimensional robust precision matrix estimation: cellwise corruption under \(\epsilon \)-contamination (Q1753147): Difference between revisions

From MaRDI portal
Added link to MaRDI item.
RedirectionBot (talk | contribs)
Removed claim: reviewed by (P1447): Item:Q959160
Property / reviewed by
 
Property / reviewed by: Q464760 / rank
Normal rank
 

Revision as of 14:33, 21 February 2024

scientific article
Language Label Description Also known as
English
High-dimensional robust precision matrix estimation: cellwise corruption under \(\epsilon \)-contamination
scientific article

    Statements

    High-dimensional robust precision matrix estimation: cellwise corruption under \(\epsilon \)-contamination (English)
    0 references
    0 references
    0 references
    28 May 2018
    0 references
    Consider the following cellwise contamination model. For each \(n\) and \(p\) in \(\{1,2,\dots\}\) and \(k\in \{1,\dots,n\}\), \(X_{k}\) is a real \(p\)-dimensional random vector defined by \[ X_{k}=(I-B_{k})Y_{k}+B_{k}Z_{k}, \] where \(B_{k}\), \(Y_{k}\) and \(Z_{k}\) are independent random fields, \(B_{k}\) is a diagonal \((p\times p)\)-matrix \[ B_{k}=\left[ \begin{matrix} B_{k1} & 0 & \cdots & 0 \\ 0 & B_{k2} & \cdots & 0 \\ \vdots & \vdots & \vdots & \vdots \\ 0 & 0 & \cdots & B_{kp} \end{matrix} \right] , \] where \(B_{k1}\), \(B_{k2}\) and \(B_{kp}\) are independent random variables with \[ P(B_{ki}=1)=\varepsilon_{i}, \quad i\in \{1,\ldots,p\}. \] In this paper it is assumed that \(Y_{k}\thicksim \mathcal{N}(\mu ,\Sigma^{\ast})\), and \(Z_{k}\thicksim \mathcal{H}^{\ast}\) (an unspecified distribution). The objective of the paper is the estimation of \(\Sigma^{\ast}\), and \(\Omega^{\ast}=(\Sigma^{\ast})^{-1}\). If the data are uncontaminated \[ \varepsilon_{i}=0, \quad i\in \{1,\ldots ,p\}, \tag{1} \] then the classical sample covariance estimator \(\widehat{\Sigma}= \left[ \widehat{\Sigma}_{ij}\right]_{1\leq i,j\leq p}\) defined as \[ \widehat{\Sigma}_{ij}=\frac{1}{n-1}\sum_{k=1}^{n}(X_{ki}- \overline{X}_{i})(X_{kj}-\overline{X}_{j}), \quad 1\leq i,j\leq p \] where \[ \overline{X}_{i}=\frac{1}{n}\sum_{k=1}^{n}X_{ki}, \quad 1\leq i\leq p \] is used. It is well known that if (1) is not met then \(\widehat{\Sigma}\) may be very inefficient. Several authors have proposed estimators of the covariance matrix under various ``pure'' model and contamination assumptions. Among them are the so-called GLasso and CLIMA. Recently, good properties of these estimators have been tested from the point of view of the ``breakdown point'' (maximum proportion of contamination that an estimator can admit). The authors of this paper propose new estimators modifying the aforementioned ones incorporating robust estimators of the covariance matrix. The results reported encourage to follow and extend the proposed estimators to other ``pure'' and contaminated models. A detailed and updated review of the ``state of the art'' in the subject is made. It is worth highlighting the list of notations that are used in the paper, which undoubtedly facilitates their reading. Although the theorems are enunciated and demonstrated with detail and rigor, the comments included in the work facilitate the understanding and application of them. I believe that the study of this work will be very useful for mathematicians and statisticians who are interested in the unavoidable topic of robust estimation in statistics.
    0 references
    robust covariance matrix estimation
    0 references
    cellwise contamination
    0 references
    Kendall's tau
    0 references
    Spearman's rho
    0 references
    median absolute deviation
    0 references

    Identifiers