Ckmeans.1d.dp
swMATH15783CRANCkmeans.1d.dpMaRDI QIDQ27655FDOQ27655
Optimal, Fast, and Reproducible Univariate Clustering
Haizhou Wang, Mingzhou Song, Hua Zhong
Last update: 19 August 2023
Copyright license: GNU Lesser General Public License
Software version identifier: 4.3.4, 1.0, 1.1, 2.0, 2.1, 2.2, 2.3, 2.4, 2.5, 3.0, 3.01, 3.02, 3.3.0, 3.3.1, 3.3.3, 3.4.0-1, 3.4.0, 3.4.6-1, 3.4.6-2, 3.4.6-3, 3.4.6-4, 3.4.6-5, 3.4.6-6, 3.4.6, 4.0.0, 4.0.1, 4.2.0, 4.2.1, 4.2.2, 4.3.0, 4.3.2, 4.3.3, 4.3.5
Source code repository: https://github.com/cran/Ckmeans.1d.dp
Fast, optimal, and reproducible weighted univariate clustering by dynamic programming. Four problems are solved, including univariate k-means (Wang & Song 2011) <doi:10.32614/RJ-2011-015> (Song & Zhong 2020) <doi:10.1093/bioinformatics/btaa613>, k-median, k-segments, and multi-channel weighted k-means. Dynamic programming is used to minimize the sum of (weighted) within-cluster distances using respective metrics. Its advantage over heuristic clustering in efficiency and accuracy is pronounced when there are many clusters. Multi-channel weighted k-means groups multiple univariate signals into k clusters. An auxiliary function generates histograms adaptive to patterns in data. This package provides a powerful set of tools for univariate data analysis with guaranteed optimality, efficiency, and reproducibility, useful for peak calling on temporal, spatial, and spectral data.
Cited In (6)
This page was built for software: Ckmeans.1d.dp