MixSim

From MaRDI portal
Software:19930



swMATH7914CRANMixSimMaRDI QIDQ19930

Simulating Data to Study Performance of Clustering Algorithms

Wei-Chen Chen, Volodymyr Melnykov, Ranjan Maitra

Last update: 5 September 2023

Copyright license: GNU General Public License, version 3.0, GNU General Public License, version 2.0

Software version identifier: 1.1-6, 0.1-01, 0.1-02, 0.1-03, 0.1-04, 1.0-1, 1.0-2, 1.0-3, 1.0-4, 1.0-5, 1.0-7, 1.0-8, 1.0-9, 1.1-1, 1.1-2, 1.1-3, 1.1-4, 1.1-5, 1.1-7

Source code repository: https://github.com/cran/MixSim

The utility of this package is in simulating mixtures of Gaussian distributions with different levels of overlap between mixture components. Pairwise overlap, defined as a sum of two misclassification probabilities, measures the degree of interaction between components and can be readily employed to control the clustering complexity of datasets simulated from mixtures. These datasets can then be used for systematic performance investigation of clustering and finite mixture modeling algorithms. Among other capabilities of 'MixSim', there are computing the exact overlap for Gaussian mixtures, simulating Gaussian and non-Gaussian data, simulating outliers and noise variables, calculating various measures of agreement between two partitionings, and constructing parallel distribution plots for the graphical display of finite mixture models.




Related Items (38)

Sparse optimal discriminant clusteringFully Three-Dimensional Radial VisualizationUnnamed ItemSingleCross-clustering: an algorithm for finding elongated clusters with automatic estimation of outliers and number of clustersComputational aspects of fitting mixture models via the expectation-maximization algorithmOn the distribution of posterior probabilities in finite mixture models with application in clusteringAssessing trimming methodologies for clustering linear regression dataSemi-supervised model-based clustering with positive and negative constraintsAn effective strategy for initializing the EM algorithm in finite mixture modelsProbabilistic assessment of model-based clusteringSimulating mixtures of multivariate data with fixed cluster overlap in FSDA libraryUnnamed ItemMini-batch learning of exponential family finite mixture modelsVariance-based cluster selection criteria in a \(K\)-means framework for one-mode dissimilarity dataExtending mixtures of factor models using the restricted multivariate skew-normal distributionGaussian mixture modeling and model-based clustering under measurement inconsistencyInitializing the EM algorithm in Gaussian mixture models with an unknown number of componentsOn the expectation-maximization algorithm for Rice-Rayleigh mixtures with application to noise parameter estimation in magnitude MR datasetsOn \(K\)-means algorithm with the use of Mahalanobis distancesConvex fuzzy \(k\)-medoids clusteringA note on the formal implementation of the \(K\)-means algorithm with hard positive and negative constraintsSemi-supervised projected model-based clusteringAn extension of the \(K\)-means algorithm to clustering skewed dataMerging the components of a finite mixture using posterior probabilitiesInference on the Order of a Normal MixtureSpatial product partition modelsBootstrapping for Significance of Compact Clusters in Multidimensional DatasetsRoot selection in normal mixture modelsFinite mixture models and model-based clusteringclustraMixfMRIA semiparametric method for clustering mixed dataSAGMMManly transformation in finite mixture modelingAn efficient k‐means‐type algorithm for clustering datasets with incomplete recordsOn the behaviour of \(K\)-means clustering of a dissimilarity matrix by means of full multidimensional scalingProbability of misclassification in model-based clusteringUnnamed Item


This page was built for software: MixSim