Estimating the number of clusters in a data set via the gap statistic
From MaRDI portal
Publication:65481
DOI10.1111/1467-9868.00293zbMATH Open0979.62046OpenAlexW2071949631MaRDI QIDQ65481FDOQ65481
Trevor Hastie, Robert Tibshirani, Guenther Walther
Publication date: 1 July 2001
Published in: Journal of the Royal Statistical Society. Series B. Statistical Methodology (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1111/1467-9868.00293
Cited In (only showing first 100 items - show all)
- A Unified Framework for Change Point Detection in High-Dimensional Linear Models
- Clustering confidence sets
- Clustering with the average silhouette width
- Estimating robot strengths with application to selection of alliance members in FIRST robotics competitions
- Generalized \(k\)-means in GLMs with applications to the outbreak of COVID-19 in the United States
- Functional distributional clustering using spatio-temporal data
- Validating clusters with the lower bound for sum-of-squares error
- A review on spectral clustering and stochastic block models
- Bayes factors in the presence of population stratification
- Likelihood ratio test for partial sphericity in high and ultra-high dimensions
- The cluster graphical Lasso for improved estimation of Gaussian graphical models
- Multilevel Functional Clustering Analysis
- An integrative pathway-based clinical-genomic model for cancer survival prediction
- Cluster analysis of longitudinal profiles with subgroups
- Vector quantization of amino acids: Analysis of the HIV V3 loop region
- A divisive clustering method for functional data with special consideration of outliers
- \(\gamma\)-SUP: a clustering algorithm for cryo-electron microscopy images of asymmetric particles
- Problems in gene clustering based on gene expression data
- Statistical challenges in functional genomics. (With comments and a rejoinder).
- A novel bagging approach for variable ranking and selection via a mixed importance measure
- High-dimensional variable selection with the plaid mixture model for clustering
- Fast wavelet-based stochastic simulation using training images
- Sparse \(\ell_ {1}\) regularisation of matrix valued models for acoustic source characterisation
- The hierarchical spectral merger algorithm: a new time series clustering procedure
- Spatial associations in global household bicycle ownership
- A regionalisation approach for rainfall based on extremal dependence
- On the nonparametric maximum likelihood estimator for Gaussian location mixture densities with application to Gaussian denoising
- Determining the Number of Clusters Using Multivariate Ranks
- Reducing data dimension for cluster detection
- Subspace clustering of high-dimensional data: a predictive approach
- Online phenotype discovery based on minimum classification error model
- Multimodal Language Acquisition Based on Motor Learning and Interaction
- Novel soft subspace clustering with multi-objective evolutionary approach for high-dimensional data
- Variational learning of a Dirichlet process of generalized Dirichlet distributions for simultaneous clustering and feature selection
- Use of symmetry and stability for data clustering
- Stability-based validation of bicluster solutions
- Sensor fusion for SLAM based on information theory
- A multivariate uniformity test for the case of unknown support
- Exhaustivek-nearest-neighbour subspace clustering
- Spontaneous Clustering via Minimum Gamma-Divergence
- Identification of relevant subtypes via preweighted sparse clustering
- Cross validation in LASSO and its acceleration
- The Kullback information criterion for mixture regression models
- Innovation in the cluster validating techniques
- Estimating and clustering curves in the presence of heteroscedastic errors
- Suboptimal comparison of partitions
- Nonparametric cluster significance testing with reference to a unimodal null distribution
- MCS: A method for finding the number of clusters
- Data filtering for cluster analysis by \(\ell _0\)-norm regularization
- Linearized alternating direction method of multipliers for sparse group and fused Lasso models
- Visual stability analysis for model selection in graded possibilistic clustering
- Agglomerative and divisive hierarchical Bayesian clustering
- Partition of interval-valued observations using regression
- Practical shape analysis and segmentation methods for point cloud models
- A sequential clustering algorithm with applications to gene expression data
- Segmentation uncertainty in multiple change-point models
- Clustering of time series using quantile autocovariances
- Title not available (Why is that?)
- Dynamic Tensor Clustering
- Model-based hierarchical clustering with Bregman divergences and Fishers mixture model: application to depth image analysis
- Optimized profitability of LFP and NMC Li-ion batteries in residential PV applications
- Alpha geodesic distances for clustering of shapes
- Model-based linear clustering
- Self-learning \(K\)-means clustering: a global optimization approach
- Estimating the number of clusters in a ranking data context
- Convex clustering for binary data
- Semiparametric partial common principal component analysis for covariance matrices
- Identifying Functional Connectivity in Large-Scale Neural Ensemble Recordings: A Multiscale Data Mining Approach
- Temporal gap statistic: a new internal index to validate time series clustering
- Temporally consistent tone mapping of images and video using optimal \(K\)-means clustering
- Multiscale blind source separation
- Markov-switching state space models for uncovering musical interpretation
- KM-MIC: an improved maximum information coefficient based on K-medoids clustering
- On the use of quantile regression to deal with heterogeneity: the case of multi-block data
- Cluster-based feedback control of turbulent post-stall separated flows
- The cluster correlation-network support vector machine for high-dimensional binary classification
- Some clustering-based exact distribution-free \(k\)-sample tests applicable to high dimension, low sample size data
- Clustering transformed compositional data usingK-means, with applications in gene expression and bicycle sharing system data
- Network modeling in biology: statistical methods for gene and brain networks
- A confusion index for measuring separation and clustering
- Subject-treatment interactions in crossover trials: performance evaluation of subgrouping methods
- Pattern layer reduction for a generalized regression neural network by using a self-organizing map
- Model-based feature selection and clustering of RNA-seq data for unsupervised subtype discovery
- Improving Spectral Clustering Using the Asymptotic Value of the Normalized Cut
- Determine the number of clusters by data augmentation
- Finding the Event Structure of Neuronal Spike Trains
- Model selection strategies for determining the optimal number of overlapping clusters in additive overlapping partitional clustering
- Poisson Kernel-Based Clustering on the Sphere: Convergence Properties, Identifiability, and a Method of Sampling
- Optimality of spectral clustering in the Gaussian mixture model
- \(K\)-means cloning: adaptive spherical \(K\)-means clustering
- Multiscale clustering for functional data
- Finding groups in structural equation modeling through the partial least squares algorithm
- On the behaviour of \(K\)-means clustering of a dissimilarity matrix by means of full multidimensional scaling
- A graph clustering approach to localization for adaptive covariance tuning in data assimilation based on state-observation mapping
- Deformation analysis in tunnels through curve clustering
- Title not available (Why is that?)
- Overlapping radial basis function interpolants for spectrally accurate approximation of functions of eigenvalues with application to buckling of composite plates
- A new internal index based on density core for clustering validation
- An empirical comparison between stochastic and deterministic centroid initialisation for K-means variations
- K-bMOM: A robust Lloyd-type clustering algorithm based on bootstrap median-of-means
This page was built for publication: Estimating the number of clusters in a data set via the gap statistic
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q65481)