Validating visual clusters in large datasets: fixed point clusters of spectral features. (Q1852887)

From MaRDI portal

Jump to:navigation, search

scientific article

Language	Label	Description	Also known as
English	Validating visual clusters in large datasets: fixed point clusters of spectral features.	scientific article

Statements

scholarly article

0 references

Validating visual clusters in large datasets: fixed point clusters of spectral features. (English)

0 references

Christian Hennig

0 references

Norbert Christlieb

0 references

Computational Statistics and Data Analysis

0 references

publication date

21 January 2003

0 references

Finding clusters in large datasets is a difficult task. Almost all computationally feasible methods are related to \(k\)-means and need a clear partition structure of the data, while most such datasets contain masking outliers and other deviations from the usual models of partitioning cluster analysis. It is possible to look for clusters informally using graphic tools like the grand tour, but the meaning and the validity of such patterns is unclear. In this paper, a three-step-approach is suggested: In the first step, data visualization methods like the grand tour are used to find cluster candidate subsets of the data. In the second step, reproducible clusters are generated from them by means of fixed point clustering, a method to find a single cluster at a time based on the Mahalanobis distance. In the third step, the validity of the clusters is assessed by the use of classification plots. The approach is applied to an astronomical dataset of spectra from the Hamburg/ESO survey.

0 references

zbMATH Keywords

Discriminant coordinates

0 references

Outliers

0 references

Contamination model

0 references

Sky surveys

0 references

Stellar populations

0 references

Data visualization

0 references

Classification plots

0 references

describes a project that uses

0 references

0 references

MaRDI profile type

MaRDI publication profile

0 references

The Grand Tour: A Tool for Viewing Multidimensional Data

0 references

The Masking Breakdown Point of Multivariate Outlier Identification Rules

0 references

Trimmed \(k\)-means: An attempt to robustify quantizers

0 references

How Many Clusters? Which Clustering Method? Answers Via Model-Based Cluster Analysis

0 references

0 references

0 references

0 references

0 references

Finding Groups in Data

0 references

A procedure for the detection of multivariate outliers.

0 references

0 references

0 references

Visual clustering and classification: The Oronsay particle size dataset revisited

0 references

Identifiers

zbMATH Open document ID

0 references

10.1016/S0167-9473(02)00077-4

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

zbMATH DE Number

0 references

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:1852887

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Item:Q1852887&oldid=34330456"