Algorithm AS 136: A K-Means Clustering Algorithm

From MaRDI portal
Publication:105416

DOI10.2307/2346830zbMath0447.62062OpenAlexW1977556410WikidataQ105584859 ScholiaQ105584859MaRDI QIDQ105416

J. A. Hartigan, M. Anthony Wong, J. A. Hartigan, M. Anthony Wong

Publication date: 1979

Published in: Applied Statistics (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.2307/2346830



Related Items

The Landscape of Causal Inference: Perspective From Citation Network Analysis, k-POD: A Method for k-Means Clustering of Missing Data, Clustering Effects in Unreplicated Factorial Experiments, Sparse Matrix Computational Techniques in Concept Decomposition Matrix Approximation, Clustering of gamma-ray bursts through kernel principal component analysis, Optimal partitioning for the proportional hazards model, Representative points for location-biased datasets, Discrete facility location in machine learning, PC-GAIN: pseudo-label conditional generative adversarial imputation networks for incomplete data, A power-controlled reliability assessment for multi-class probabilistic classifiers, A literature survey of matrix methods for data science, Bayesian spatial design of optimal deep tube well locations in Matlab, Bangladesh, Subsampling spectral clustering for stochastic block models in large-scale networks, A guarantee rate optimization model for wastewater treatment system design under uncertainty, A Novel Bayesian Functional Spatial Partitioning Method with Application to Prostate Cancer Lesion Detection Using MRI, Distance Metrics and Clustering Methods for Mixed‐type Data, Dynamic clustering of multivariate panel data, Model-based clustering via mixtures of unrestricted skew normal factor analyzers with complete and incomplete data, MLE of jointly constrained mean-covariance of multivariate normal distributions, The cluster correlation-network support vector machine for high-dimensional binary classification, Fast prediction of aquifer thermal energy storage: a multicyclic metamodelling procedure, A change-point detection and clustering method in the recurrent-event context, Deep learning discrete calculus (DLDC): a family of discrete numerical methods by universal approximation for STEM education to frontier research, How to describe the spatial near-far relations among concepts?, DPC-FSC: an approach of fuzzy semantic cells to density peaks clustering, FGC\_SS: fast graph clustering method by joint spectral embedding and improved spectral rotation, A novel method for optimizing spectral rotation embedding \(K\)-means with coordinate descent, A fusion learning method to subgroup analysis of Alzheimer's disease, An automated robust algorithm for clustering multivariate data, Zero-inflated time series clustering via ensemble thick-pen transform, Using machine learning to capture heterogeneity in trade agreements, Penguins Go Parallel: A Grammar of Graphics Framework for Generalized Parallel Coordinate Plots, Spatial Linear Regression with Covariate Measurement Errors: Inference and Scalable Computation in a Functional Modeling Approach, Functional data clustering via information maximization, Agglomeration of polygonal grids using graph neural networks with applications to multigrid solvers, Model-based clustering and classification using mixtures of multivariate skewed power exponential distributions, A machine learning-based probabilistic computational framework for uncertainty quantification of actuation of clustered tensegrity structures, A new measure for assessment of clustering based on kernel density estimation, Robust mixture regression modeling based on the normal mean-variance mixture distributions, Features for the 0-1 knapsack problem based on inclusionwise maximal solutions, Moving Up the Cluster Tree with the Gradient Flow, Clustering categorical data: soft rounding \(k\)-modes, Unsupervised learning on U.S. weather forecast performance, Polynomial-chaos-based conditional statistics for probabilistic learning with heterogeneous data applied to atomic collisions of helium on graphite substrate, UP-DPC: ultra-scalable parallel density peak clustering, Multi-view clustering with adaptive procrustes on Grassmann manifold, Faraday wave–droplet dynamics: discrete-time analysis, A NEW MULTILEVEL MODELING APPROACH FOR CLUSTERED SURVIVAL DATA, Relax-and-split method for nonconvex inverse problems, Dirichlet process mixture models for unsupervised clustering of symptoms in Parkinson's disease, Unnamed Item, Use of jump process to model mobility in massive multiplayer on-line games, A new algorithm for clustering based on kernel density estimation, Robust mixtures of factor analysis models using the restricted multivariate skew-t distribution, Digital images of multipolar neurons from the human dentate nucleus: topologic and morphometric analysis accompanied with the classification by cluster analysis, Efficient Computation of Option Prices and Greeks by Quasi--Monte Carlo Method with Smoothing and Dimension Reduction, Non-parametric planar shape representation based on adaptive curvature functions, Unnamed Item, Accelerating parallel tempering: Quantile tempering algorithm (QuanTA), Fractionally-supervised classification, Dynamic Tensor Clustering, A Simple Framework for Stability Analysis of State-Dependent Networks of Heterogeneous Agents, Sparse Bayesian Imaging of Solar Flares, Using Chernoff’s Bounding Method for High-Performance Structural Break Detection and Forecast Error Reduction, A parameterless scale-space approach to find meaningful modes in histograms — Application to image and spectrum segmentation, Localization of impact on box mechanical structure by the method of modal parameters extraction combined with K-means clustering, Analyzing and clustering students' application preferences in higher education, EM algorithm for mixture of skew-normal distributions fitted to grouped data, Multi-Phase Segmentation Using Modified Complex Cahn-Hilliard Equations, Unnamed Item, Fast estimation of posterior probabilities in change-point analysis through a constrained hidden Markov model, Quantum annealing for combinatorial clustering, Novel adaptive hybrid discontinuous Galerkin algorithms for elliptic problems, Clustering longitudinal profiles using P-splines and mixed effects models applied to time-course gene expression data, A survey of functional principal component analysis, Flexible clustering via extended mixtures of common \(t\)-factor analyzers, Mixtures of common \(t\)-factor analyzers for modeling high-dimensional data with missing values, Mixture of linear experts model for censored data: a novel approach with scale-mixture of normal distributions, Clustering with the average silhouette width, Generalized \(k\)-means in GLMs with applications to the outbreak of COVID-19 in the United States, Bayesian cluster analysis: point estimation and credible balls (with discussion), Mathematical modeling and efficient optimization methods for the distance-dependent rearrangement clustering problem, Biclustering models for two-mode ordinal data, A two-stage solution method for the annual dairy transportation problem, An auto-realignment method in quasi-Monte Carlo for pricing financial derivatives with jump structures, A survey of content-based image retrieval with high-level semantics, A spatio-temporal modeling framework for weather radar image data in tropical southeast Asia, A discrete inter-species cuckoo search for flowshop scheduling problems, Cluster differences scaling with a within-clusters loss component and a fuzzy successive approximation strategy to avoid local minima, Analyzing gene expression time-courses based on multi-resolution shape mixture model, A new heuristic for solving the \(p\)-median problem in the plane, Approximating the objective function's gradient using perceptrons for constrained minimization with application in drag reduction, Variable selection for high-dimensional genomic data with censored outcomes using group Lasso prior, An adaptive importance sampling algorithm for Bayesian inversion with multimodal distributions, Spatial data compression via adaptive dispersion clustering, Angle-based models for ranking data, Model-based clustering of probability density functions, Radial basis functions for exploratory data analysis: An iterative majorisation approach for Minkowski distances based on multidimensional scaling, A comparison of the classification capabilities of the 1-dimensional Kohonen neural network with two partitioning and three hierarchical cluster analysis algorithms, Mixtures of common factor analyzers for high-dimensional data with missing information, Integrating robust clustering techniques in S-PLUS, Optimizing word set coverage for multi-event summarization, An indication of unification for different clustering approaches, Clusteranalyse - Überblick und neuere Entwicklungen, Optimal estimators of principal points for minimizing expected mean squared distance, Recovery guarantees for exemplar-based clustering, Bayesian model-based tight clustering for time course data, Bayesian multiscale smoothing of Gaussian noised images, Simulation techniques for the calculus of wrapped compartments, Probabilistic-statistical programs from ``Applied Statistics, Graph clustering, Context prediction of mobile users based on time-inferred pattern networks: a probabilistic approach, Bayesian inference for nonlinear structural time series models, Semi-nonnegative rank for real matrices and its connection to the usual rank, Self-consistency and a generalized principal subspace theorem, A sampling-based approach for probabilistic design with random fields, Optimal partitioning of a data set based on the \(p\)-median model, Structural regularized projection twin support vector machine for data classification, Optimising \(k\)-means clustering results with standard software packages, Linear grouping using orthogonal regression, Structural multiple empirical kernel learning, Partition clustering of high dimensional low sample size data based on \(p\)-values, GAP: a graphical environment for matrix visualization and cluster analysis, A differential evolution algorithm for finding the median ranking under the Kemeny axiomatic approach, Cooperative clustering, An incremental nested partition method for data clustering, Identification of surgical practice patterns using evolutionary cluster analysis, Algorithms for clustering clickstream data, Validating vehicle routing zone construction using Monte Carlo simulation, Feature selection and machine learning with mass spectrometry data for distinguishing cancer and non-cancer samples, RHclust, Efficient algorithms using subiterative convergence for Kemeny ranking problem, Enhancing principal direction divisive clustering, A hybrid macroscopic-based model for traffic flow in road networks, Multiphase image segmentation and modulation recovery based on shape and topological sensitivity, Robust mixture modeling based on scale mixtures of skew-normal distributions, Iterative sliced inverse regression for segmentation of ultrasound and MR images, Model-based clustering of time series in group-specific functional subspaces, A constrained \(k\)-means clustering algorithm for classifying spatial units, Mixtures of multivariate power exponential distributions, Model-based clustering of multivariate ordinal data relying on a stochastic binary search algorithm, Variable selection for model-based clustering using the integrated complete-data likelihood, Asymptotic regularity of subdivisions of Euclidean domains by iterated PCA and iterated 2-means, A semiparametric method for clustering mixed data, PK-means: A new algorithm for gene clustering, Flexible Nonhomogeneous Markov Models for Panel Observed Data, Feature screening in large scale cluster analysis, A toolbox for \(K\)-centroids cluster analysis, Isometric sliced inverse regression for nonlinear manifold learning, K-hyperline clustering learning for sparse component analysis, The importance of the scales in heterogeneous robust clustering, CLUES: a non-parametric clustering method based on local shrinking, AS 136, Assessing agreement of clustering methods with gene expression microarray data, Exemplar-based clustering via simulated annealing, Principal point classification: applications to differentiating drug and placebo responses in longitudinal studies, A graph b-coloring framework for data clustering, Binary whale optimization algorithm and binary moth flame optimization with clustering algorithms for clinical breast cancer diagnoses, An ensemble feature ranking algorithm for clustering analysis, A global algorithm to estimate the expectations of the components of an observed univariate mixture, Semi-supervised cross-entropy clustering with information bottleneck constraint, A Bayesian sparse finite mixture model for clustering data from a heterogeneous population, Clustering qualitative data based on binary equivalence relations: neighborhood search heuristics for the clique partitioning problem, A novel fuzzy clustering algorithm using observation weighting and context information for reverberant blind speech separation, Asymptotic properties of univariate sample k-means clusters, An extension of the \(p\)-median group technology algorithm, Identifying heterogeneous transgenerational DNA methylation sites via clustering in beta regression, Functional cluster analysis via orthonormalized Gaussian basis expansions and its application, A criterion based on the Mahalanobis distance for cluster analysis with subsampling, Fast, linear time hierarchical clustering using the Baire metric, Unsupervised classification of eclipsing binary light curves throughk-medoids clustering, Principal points analysis via p-median problem for binary data, Análisis de los conglomerados de precipitación y sus cambios estacionales sobre América Central para el período 1976-2015, A Two-Stage Color Image Segmentation Method Based on Saturation-Value Total Variation, A Permutation Based Procedure for Classification Assessment, Clustering Microarray Data: Theoretical and Practical Issues, Online Learning of Inverted Beta-Liouville HMMs for Anomaly Detection in Crowd Scenes, Parallel Domain Decomposition Strategies for Stochastic Elliptic Equations. Part A: Local Karhunen--Loève Representations, Improving Bayesian Local Spatial Models in Large Datasets, Estimating Multiple Precision Matrices With Cluster Fusion Regularization, Estimating covariate-adjusted measures of diagnostic accuracy based on pooled biomarker assessments, Unnamed Item, Image Segmentation via Fischer-Burmeister Total Variation and Thresholding, Unnamed Item, Nonparametric K-means algorithm with applications in economic and functional data, Survival prediction and variable selection with simultaneous shrinkage and grouping priors, SingleCross-clustering: an algorithm for finding elongated clusters with automatic estimation of outliers and number of clusters, A Relabeling Approach to Handling the Class Imbalance Problem for Logistic Regression, Graph Refinement via Simultaneously Low-Rank and Sparse Approximation, Multivariate functional data modeling with time-varying clustering, Randomized block Kaczmarz methods with \(k\)-means clustering for solving large linear systems, Correlation Clustering with Constrained Cluster Sizes and Extended Weights Bounds, An empirical comparison and characterisation of nine popular clustering methods, Economic fluctuations and fiscal policy in Europe: a political business cycles approach using panel data and clustering (1996--2013), RESAMPLING FOR FUZZY CLUSTERING, Structural nonparallel support vector machine for pattern recognition, Machine learning based refinement strategies for polyhedral grids with applications to virtual element and polyhedral discontinuous Galerkin methods, Functional data clustering via hypothesis testing \(k\)-means, Finite mixture biclustering of discrete type multivariate data, The \(k\)-means algorithm for 3D shapes with an application to apparel design, Constrained clustering with a complex cluster structure, Clustering of imbalanced high-dimensional media data, Functional data clustering: a survey, Robust model-based clustering via mixtures of skew-\(t\) distributions with missing information, A novel mixture model using the multivariate normal mean-variance mixture of Birnbaum-Saunders distributions and its application to extrasolar planets, An exact algorithm for semi-supervised minimum sum-of-squares clustering, Adaptive training of local reduced bases for unsteady incompressible Navier-Stokes flows, Machine learning-enabled self-consistent parametrically-upscaled crystal plasticity model for Ni-based superalloys, Eigenvalues of quaternion tensors with applications to color video processing, Deep reinforcement learning for optimal well control in subsurface systems with uncertain geology, Cluster-based generalized multiscale finite element method for elliptic PDEs with random coefficients, Balanced \(k\)-means clustering on an adiabatic quantum computer, A novel clustering approach and prediction of optimal number of clusters: global optimum search with enhanced positioning, Unnamed Item, Efficient homomorphic comparison methods with optimal complexity, Handling Noise and Outliers in Fuzzy Clustering, On the expectation-maximization algorithm for Rice-Rayleigh mixtures with application to noise parameter estimation in magnitude MR datasets, James-Stein shrinkage to improve \(k\)-means cluster analysis, Multi‐stage multivariate modeling of temporal patterns in prescription counts for competing drugs in a therapeutic category, Robust clusterwise linear regression through trimming, SAR image segmentation based on quantum-inspired multiobjective evolutionary clustering algorithm, Latent Features in Similarity Judgments: A Nonparametric Bayesian Approach, Web retrieval: Techniques for the aggregation and selection of queries and answers, Spatial distribution preserving-based sparse subspace clustering for hyperspectral image, A Family of Unsupervised Sampling Algorithms, A robust linear grouping algorithm, Building an interpretable fuzzy rule base from data using Orthogonal Least Squares - Application to a depollution problem, Non-convex clustering via proximal alternating linearized minimization method, Robust Linear Clustering, POD and CVT-based reduced-order modeling of Navier-Stokes flows, Bayesian profile regression with an application to the National survey of children's health, Hierarchical Factor Models for Large Spatially Misaligned Data: A Low‐Rank Predictive Process Approach, k-Means Algorithm in Statistical Shape Analysis, Centroidal Voronoi tessellation algorithms for image compression, segmentation, and multichannel restoration, Bootstrapping in a high dimensional but very low-sample size problem, A predictive view of Bayesian clustering, Fast multiscale clustering and manifold identification, Unnamed Item, COMPUTATIONAL INTELLIGENCE METHODS FOR FINANCIAL TIME SERIES MODELING, Evolutionary Rough k-Medoid Clustering, Advanced visualization of self-organizing maps with vector fields, UNSUPERVISED LEARNING BASED DISTRIBUTED DETECTION OF GLOBAL ANOMALIES, Nuclei segmentation for computer-aided diagnosis of breast cancer, Copula analysis of mixture models, Reusable components in decision tree induction algorithms, A comparison of heuristic procedures for minimum within-cluster sums of squares partitioning, Maximum likelihood estimation for multivariate skew normal mixture models, Prequential analysis of complex data with adaptive model reselection, Nonlinear joint latent variable models and integrative tumor subtype discovery, The next‐generation K‐means algorithm, An efficient k‐means‐type algorithm for clustering datasets with incomplete records, Clustering Using Objective Functions and Stochastic Search, Latent regression analysis, Heteroscedastic factor mixture analysis, Model misspecification, A case study of using the generalized K-harmonic means method in decision-making processes, A new nonparametric interpoint distance-based measure for assessment of clustering, Simultaneous Registration and Clustering for Multidimensional Functional Data, Improving Spectral Clustering Using the Asymptotic Value of the Normalized Cut, Scalable Bayesian Nonparametric Clustering and Classification, Estimating the Number of Clusters Using Cross-Validation, Ensemble survival trees for identifying subpopulations in personalized medicine, Application of Biostatistics and Bioinformatics Tools to Identify Putative Transcription Factor-Gene Regulatory Network of Ankylosing Spondylitis and Sarcoidosis, Asymptotic properties of bivariate k-means clusters, A tandem clustering process for multimodal datasets, Multi-objective Optimization to Improve Robustness in Networks, A Novel Neural Model With Lateral Interaction for Learning Tasks, Characterization of lung tumor subtypes through gene expression cluster validity assessment, Clustering-Based Model Order Reduction for Nonlinear Network Systems, Unnamed Item, Multivariate response and parsimony for Gaussian cluster-weighted models, Robust clustering of multiply censored data via mixtures of \(t\) factor analyzers, Fast indefinite multi-point (IMP) clustering, High-dimensional clustering via random projections, Numerical studies of MacQueen's \(k\)-means algorithm for computing the centroidal Voronoi tessellations, Visualizing non-hierarchical and hierarchical cluster analysis with clustergrams, Bayesian sparse convex clustering via global-local shrinkage priors, KM-MIC: an improved maximum information coefficient based on K-medoids clustering, A convex hull approach for the reliability-based design optimization of nonlinear transient dynamic problems, An image segmentation method based on network clustering model, A soft computing model based on asymmetric Gaussian mixtures and Bayesian inference, The Kohonen self-organizing map method: An assessment, Neighborhood density information in clustering, Benchmarking penalized regression methods in machine learning for single cell RNA sequencing data, Advances in artificial neural networks -- methodological development and application, Clustering using an improved krill herd algorithm, Optimal projection of observations in a Bayesian setting, A new multiphase segmentation method using eigenvectors based on \(K\) real numbers, Experiments on individual strategy updating in iterated snowdrift game under random rematching, Model selection strategies for determining the optimal number of overlapping clusters in additive overlapping partitional clustering, Dimension-reduced clustering of functional data via subspace separation, Frequency and severity estimation of cyber attacks using spatial clustering analysis, Discriminative clustering via extreme learning machine, A Bayesian semiparametric factor analysis model for subtype identification, A hybrid algorithm with cluster analysis in modelling high dimensional data, Dynamic tail dependence clustering of financial time series, On endmember identification in hyperspectral images without pure pixels: a comparison of algorithms, A novel probabilistic clustering model for heterogeneous networks, Seasonal warranty prediction based on recurrent event data, TRANSFORM-ANN for online optimization of complex industrial processes: casting process as case study, Clustering methods for single-cell RNA-sequencing expression data: performance evaluation with varying sample sizes and cell compositions, A semiparametric and location-shift copula-based mixture model, Mini-batch learning of exponential family finite mixture models, Variance-based cluster selection criteria in a \(K\)-means framework for one-mode dissimilarity data, A unified approach to functional principal component analysis and functional multiple-set canonical correlation, Cluster analysis of longitudinal profiles with subgroups, Consensus rate-based label propagation for semi-supervised classification, An efficient method for clustered multi-metric learning, Vine copula approximation: a generic method for coping with conditional dependence, Optimal principal points estimators of multivariate distributions of location-scale and location-scale-rotation families, Hyperspectral image unsupervised classification by robust manifold matrix factorization, Fast construction of correcting ensembles for legacy artificial intelligence systems: algorithms and a case study, Multi-period classification: learning sequent classes from temporal domains, Efficient mixture model for clustering of sparse high dimensional binary data, Feature matching and heat flow in centro-affine geometry, Optimized assignment patterns in mobile edge cloud networks, Community detection based on first passage probabilities, Performance of protein-ligand docking with CDK4/6 inhibitors: a case study, Validity indices for clusters of uncertain data objects, Assessing the effective sample size for large spatial datasets: a block likelihood approach, Optimization under uncertainty of a chain of nonlinear resonators using a field representation, A mixed-integer programming approach to GRNN parameter estimation, Topological graph clustering with thin position, An iterated greedy heuristic for a market segmentation problem with multiple attributes, Mixtures of restricted skew-\(t\) factor analyzers with common factor loadings, Distributed consensus-based \(K\)-means algorithm in switching multi-agent networks, Energy-based function to evaluate data stream clustering, A fuzzy edge-weighted centroidal Voronoi tessellation model for image segmentation, Music and timbre segmentation by recursive constrained \(K\)-means clustering, Taxicab correspondence analysis, Multiple ellipses detection in noisy environments: A hierarchical approach, Enhanced bisecting \(k\)-means clustering using intermediate cooperation, A parametric \(k\)-means algorithm, Mixture of multivariate \(t\) nonlinear mixed models for multiple longitudinal data with heterogeneity and missing values, Community detection in node-attributed social networks: a survey, Pattern layer reduction for a generalized regression neural network by using a self-organizing map, Machine learning core inflation, Spatiotemporal extended fuzzy C-means clustering algorithm for hotspots detection and prediction, A variational approximations-DIC rubric for parameter estimation and mixture model selection within a family setting, Robust detection of neural spikes using sparse coding based features, Cluster forests, Discriminative clustering on manifold for adaptive transductive classification, Quick-means: accelerating inference for K-means by learning fast transforms, MCEN: a method of simultaneous variable selection and clustering for high-dimensional multinomial regression, Weighted co-association rate-based Laplacian regularized label description for semi-supervised regression, Using an iterative reallocation partitioning algorithm to verify test multidimensionality, Accelerating sequential Monte Carlo with surrogate likelihoods, New measures for comparing matrices and their application to image processing, Sequential approximate optimization for design under uncertainty problems utilizing Kriging metamodeling in augmented input space, MIPLIB 2017: data-driven compilation of the 6th mixed-integer programming library, Mixtures of factor analyzers with covariates for modeling multiply censored dependent variables, Efficient two-scale analysis with thermal residual stresses and strains based on self-consistent clustering analysis, Scalable computational measures for entropic detection of latent relations and their applications to magnetic imaging, Network embedding: taxonomies, frameworks and applications, On the behaviour of \(K\)-means clustering of a dissimilarity matrix by means of full multidimensional scaling, An empirical comparison between stochastic and deterministic centroid initialisation for K-means variations, K-bMOM: A robust Lloyd-type clustering algorithm based on bootstrap median-of-means, Wavelet multidimensional scaling analysis of European economic sentiment indicators, Versatile uncertainty quantification of contrastive behaviors for modeling networked anagram games, Dynamics of macroscopic diffusion across meta-populations with top-down and bottom-up approaches: a review, Clustering and representation of time series. Application to dissimilarities based on divergences, A comparative study of time aggregation techniques in relation to power capacity expansion modeling, Model-based clustering of censored data via mixtures of factor analyzers, A new fuzzy clustering algorithm based on multi-objective mathematical programming, A graphical model method for integrating multiple sources of genome-scale data, Optimal bandwidth selection for re-substitution entropy estimation, Combined relevance vector machine technique and subset simulation importance sampling for structural reliability, A Bayesian approach to model individual differences and to partition individuals: case studies in growth and learning curves, Data clustering based on the modified relaxation Cheeger cut model, Consumer price sensitivity in the retail industry: latitude of acceptance with heterogeneous demand