Clustering by Compression

From MaRDI portal
Publication:3546722

DOI10.1109/TIT.2005.844059zbMath1297.68097WikidataQ56269115 ScholiaQ56269115MaRDI QIDQ3546722

Paul M. B. Vitányi, Rudi L. Cilibrasi

Publication date: 21 December 2008

Published in: IEEE Transactions on Information Theory (Search for Journal in Brave)




Related Items (60)

Exploring programmable self-assembly in non-DNA based molecular computingQuantum information distanceTopographic Mapping of Large Dissimilarity Data SetsKolmogorov Complexity-Based Similarity Measures to Website Classification Problems: Leveraging Normalized Compression DistanceUsing data compressors to construct order tests for homogeneity and component independenceOn universal prediction and Bayesian confirmationA parametrized family of Tversky metrics connecting the Jaccard distance to an analogue of the normalized information distanceSublinear algorithms for approximating string compressibilityComputable model discovery and high-level-programming approximations to algorithmic complexityAlgorithmic complexity bounds on future prediction errorsOpen problems in universal induction \& intelligenceClustering with respect to the information distanceRealism and Texture: Benchmark Problems for Natural ComputationA philosophical treatise of universal inductionApproximating ( k,ℓ )-Median Clustering for Polygonal CurvesHierarchical clustering of text documentsINFORMATION DISTANCE AND ITS APPLICATIONSA fast quartet tree heuristic for hierarchical clusteringSimilarity and denoisingAspects in classification learning -- review of recent developments in learning vector quantizationUsing ideas of Kolmogorov complexity for studying biological textsNotes on sum-tests and independence testsA \textit{really} simple approximation of smallest grammarTemporal clustering of time series via threshold autoregressive models: application to commodity pricesOn Universal Transfer LearningA linearly computable measure of string complexityDetecting life signatures with RNA sequence similarity measuresOn the Complexity and Dimension of Continuous Finite-Dimensional MapsIndefinite Proximity Learning: A ReviewTextual data compression in computational biology: algorithmic techniquesAn All-or-Nothing Flavor to the Church-Turing HypothesisNormalized information-based divergencesEvaluating the Impact of Information Distortion on Normalized Compression DistanceAlgorithmic relative complexitySolovay functions and their applications in algorithmic randomnessApplication of data compression methods to nonparametric estimation of characteristics of discrete-time stochastic processesAn extension of the Burrows-Wheeler transformRanking inter-relationships between clustersInformation-theoretic method for classification of textsAn automatic and parameter-free information-based method for sparse representation in wavelet basesA new combinatorial approach to sequence comparisonCompression-based distance between string data and its application to literary work classification based on authorshipCompression based homogeneity testingDistance measures for biological sequences: some recent approachesSequence distance via parsing complexity: heartbeat signalsUniversal codes as a basis for time series testingNonapproximability of the normalized information distanceImproved metaheuristics for the quartet method of hierarchical clusteringApplication of Kolmogorov complexity and universal codes to identity testing and nonparametric testing of serial independence for time seriesA copula entropy approach to correlation measurement at the country levelAn exact algorithm for the minimum quartet tree cost problemBetween order and chaos: The quest for meaningful informationExpanding the algorithmic information theory frame for applications to Earth observationHydrozip: how hydrological knowledge can be used to improve compression of hydrological dataArtificial sequences and complexity measuresMining Compressing Sequential PatternsSummarizing and understanding large graphsOn universal transfer learningOn the Value of Multiple Read/Write Streams for Data CompressionProbing the quantum–classical boundary with compression software




This page was built for publication: Clustering by Compression