Sparse p-adic data coding for computationally efficient and effective big data analytics

From MaRDI portal
Publication:344041

DOI10.1134/S2070046616030055zbMATH Open1353.94026arXiv1604.06961MaRDI QIDQ344041FDOQ344041


Authors: F. Murtagh Edit this on Wikidata


Publication date: 22 November 2016

Published in: \(p\)-Adic Numbers, Ultrametric Analysis, and Applications (Search for Journal in Brave)

Abstract: We develop the theory and practical implementation of p-adic sparse coding of data. Rather than the standard, sparsifying criterion that uses the L0 pseudo-norm, we use the p-adic norm. We require that the hierarchy or tree be node-ranked, as is standard practice in agglomerative and other hierarchical clustering, but not necessarily with decision trees. In order to structure the data, all computational processing operations are direct reading of the data, or are bounded by a constant number of direct readings of the data, implying linear computational time. Through p-adic sparse data coding, efficient storage results, and for bounded p-adic norm stored data, search and retrieval are constant time operations. Examples show the effectiveness of this new approach to content-driven encoding and displaying of data.


Full work available at URL: https://arxiv.org/abs/1604.06961




Recommendations




Cites Work


Cited In (3)





This page was built for publication: Sparse \(p\)-adic data coding for computationally efficient and effective big data analytics

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q344041)