Local projections for high-dimensional outlier detection
From MaRDI portal
Publication:824970
DOI10.1007/S40300-020-00183-5zbMATH Open1484.62064arXiv1708.01550OpenAlexW3047312767MaRDI QIDQ824970FDOQ824970
Authors: Thomas Ortner, P. Filzmoser, Maia Rohm, Sarka Brodinova, Christian Breiteneder
Publication date: 16 December 2021
Published in: Metron (Search for Journal in Brave)
Abstract: In this paper, we propose a novel approach for outlier detection, called local projections, which is based on concepts of Local Outlier Factor (LOF) (Breunig et al., 2000) and RobPCA (Hubert et al., 2005). By using aspects of both methods, our algorithm is robust towards noise variables and is capable of performing outlier detection in multi-group situations. We are further not reliant on a specific underlying data distribution. For each observation of a dataset, we identify a local group of dense nearby observations, which we call a core, based on a modification of the k-nearest neighbours algorithm. By projecting the dataset onto the space spanned by those observations, two aspects are revealed. First, we can analyze the distance from an observation to the center of the core within the projection space in order to provide a measure of quality of description of the observation by the projection. Second, we consider the distance of the observation to the projection space in order to assess the suitability of the core for describing the outlyingness of the observation. These novel interpretations lead to a univariate measure of outlyingness based on aggregations over all local projections, which outperforms LOF and RobPCA as well as other popular methods like PCOut (Filzmoser et al., 2008) and subspace-based outlier detection (Kriegel et al., 2009) in our simulation setups. Experiments in the context of real-word applications employing datasets of various dimensionality demonstrate the advantages of local projections.
Full work available at URL: https://arxiv.org/abs/1708.01550
Recommendations
- High-dimensional outlier detection using random projections
- Projection-based outlier detection in functional data
- Multiple outlier detection in multivariate data using projection pursuit techniques
- Computationally easy outlier detection via projection pursuit with finitely many directions
- Detecting multivariate outliers using projection pursuit with particle swarm optimization
Classification and discrimination; cluster analysis (statistical aspects) (62H30) Estimation in multivariate analysis (62H12)
Cites Work
- Outlier identification in high dimensions
- Fast and robust discriminant analysis
- A survey on unsupervised outlier detection in high‐dimensional numerical data
- Guided Projections for Analyzing the Structure of High-Dimensional Data
- CASOS: a subspace method for anomaly detection in high dimensional astronomical databases
Cited In (1)
Uses Software
This page was built for publication: Local projections for high-dimensional outlier detection
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q824970)