Symmetry in data mining and analysis: a unifying view based on hierarchy
From MaRDI portal
Publication:1048428
DOI10.1134/S0081543809020175zbMATH Open1185.68277arXiv0805.2744OpenAlexW1979366912MaRDI QIDQ1048428FDOQ1048428
Authors: F. Murtagh
Publication date: 12 January 2010
Published in: Proceedings of the Steklov Institute of Mathematics (Search for Journal in Brave)
Abstract: Data analysis and data mining are concerned with unsupervised pattern finding and structure determination in data sets. The data sets themselves are explicitly linked as a form of representation to an observational or otherwise empirical domain of interest. "Structure" has long been understood as symmetry which can take many forms with respect to any transformation, including point, translational, rotational, and many others. Beginning with the role of number theory in expressing data, we show how we can naturally proceed to hierarchical structures. We show how this both encapsulates traditional paradigms in data analysis, and also opens up new perspectives towards issues that are on the order of the day, including data mining of massive, high dimensional, heterogeneous data sets. Linkages with other fields are also discussed including computational logic and symbolic dynamics. The structures in data surveyed here are based on hierarchy, represented as p-adic numbers or an ultrametric topology.
Full work available at URL: https://arxiv.org/abs/0805.2744
Recommendations
Cites Work
- Title not available (Why is that?)
- Title not available (Why is that?)
- Multiscale methods for data on graphs and irregular multidimensional situations
- Hierarchical clustering schemes
- Symbolic Analysis of High-Dimensional Time Series
- Title not available (Why is that?)
- Geometric Representation of High Dimension, Low Sample Size Data
- Neighborliness of randomly projected simplices in high dimensions
- A \(p\)-adic model of DNA sequence and genetic code
- Initializing \(K\)-means batch clustering: A critical evaluation of several techniques
- Mathematical classification and clustering
- The remarkable simplicity of very high dimensional data: application of model-based clustering
- Title not available (Why is that?)
- Hierarchical Clustering of Massive, High Dimensional Data Sets by Exploiting Ultrametric Embedding
- An Order Theoretic Model for Cluster Analysis
- Title not available (Why is that?)
- Title not available (Why is that?)
- Title not available (Why is that?)
- Two-mode clustering methods: astructuredoverview
- Hierarchical trees can be perfectly scaled in one dimension
- Counting dendrograms: A survey
- On ultrametricity, data coding, and computation
- A Survey of Recent Advances in Hierarchical Clustering Algorithms
- Problem Decomposition and Data Reorganization by a Clustering Technique
- Information dynamics in cognitive, psychological, social and anomalous phenomena.
- Wavelet theory as $ p$-adic spectral analysis
- A wavelet theory for local fields and related groups
- \(p\)-adic numbers. An introduction
- Model-Based Compressive Sensing
- Title not available (Why is that?)
- Title not available (Why is that?)
- Gene expression from polynomial dynamics in the 2-adic information space
- Number theory as the ultimate physical theory
- Title not available (Why is that?)
- Mathematics and complex systems
- p-adic space-time and string theory
- Wavelets and spectral analysis of ultrametric pseudodifferential operators
- Title not available (Why is that?)
- p-Adic Strings and Their Applications
- Order Patterns in Time Series
- Title not available (Why is that?)
- A wreath product group approach to signal and image processing. I: Multiresolution analysis
- Espaces ultramétriques
- The Haar wavelet transform of a dendrogram
- Mumford dendrograms and discrete \(p\)-adic symmetries
- An algebraic approach to multiresolution analysis
- Title not available (Why is that?)
- Title not available (Why is that?)
- Title not available (Why is that?)
- Cluster Analysis Based on Posets
- An Ordering Algorithm for Analysis of Data Arrays
Cited In (5)
Uses Software
This page was built for publication: Symmetry in data mining and analysis: a unifying view based on hierarchy
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q1048428)