A robust approach to model-based classification based on trimming and constraints. Semi-supervised learning in presence of outliers and label noise
From MaRDI portal
Publication:2201323
Abstract: In a standard classification framework a set of trustworthy learning data are employed to build a decision rule, with the final aim of classifying unlabelled units belonging to the test set. Therefore, unreliable labelled observations, namely outliers and data with incorrect labels, can strongly undermine the classifier performance, especially if the training size is small. The present work introduces a robust modification to the Model-Based Classification framework, employing impartial trimming and constraints on the ratio between the maximum and the minimum eigenvalue of the group scatter matrices. The proposed method effectively handles noise presence in both response and exploratory variables, providing reliable classification even when dealing with contaminated datasets. A robust information criterion is proposed for model selection. Experiments on real and simulated data, artificially adulterated, are provided to underline the benefits of the proposed method.
Recommendations
- Anomaly and novelty detection for robust semi-supervised learning
- Robust supervised classification with mixture models: learning from data with uncertain labels
- Robust variable selection for model-based learning in presence of adulteration
- Learning kernel logistic regression in the presence of class label noise
- A robust discriminant approach and its application
Cites work
- scientific article; zbMATH DE number 3673370 (Why is no real title available?)
- scientific article; zbMATH DE number 3567782 (Why is no real title available?)
- scientific article; zbMATH DE number 2034569 (Why is no real title available?)
- scientific article; zbMATH DE number 837698 (Why is no real title available?)
- scientific article; zbMATH DE number 845707 (Why is no real title available?)
- A constrained robust proposal for mixture modeling avoiding spurious solutions
- A decision-theoretic generalization of on-line learning and an application to boosting
- A fast algorithm for robust constrained clustering
- A general trimming approach to robust cluster analysis
- A likelihood-based constrained algorithm for multivariate normal mixture models
- A review of robust clustering methods
- A reweighting approach to robust clustering
- Avoiding spurious local maximizers in mixture modeling
- Best approximations to random variables based on trimming procedures
- Class noise vs. attribute noise: A quantitative study of their impacts
- Density-based silhouette diagnostics for clustering methods
- Estimating common principal components in high dimensions
- Estimating the dimension of a model
- Exploring the number of groups in robust model-based clustering
- Finding the Number of Normal Groups in Model-Based Clustering via Constrained Likelihoods
- High-Breakdown Linear Discriminant Analysis
- Model-Based Clustering, Discriminant Analysis, and Density Estimation
- Model-Based Gaussian and Non-Gaussian Clustering
- Multivariate Clustering Procedures with Variable Metrics
- Noise modelling and evaluating learning from examples
- On the breakdown point of multivariate location estimators based on trimming procedures
- Regularized Gaussian Discriminant Analysis Through Eigenvalue Decomposition
- Robust fitting of mixtures using the trimmed likelihood estimator
- Robust inference for parsimonious model-based clustering
- Robust supervised classification with mixture models: learning from data with uncertain labels
- Support-vector networks
- The EM Algorithm and Extensions, 2E
- The distribution of the likelihood ratio for mixtures of densities from the one-parameter exponential family
- The joint role of trimming and constraints in robust estimation for mixtures of Gaussian factor analyzers
- Trimmed \(k\)-means: An attempt to robustify quantizers
- Using Unlabelled Data to Update Classification Rules with Applications in Food Authenticity Studies
Cited in
(7)- Robust variable selection for model-based learning in presence of adulteration
- Distance-based directional depth classifiers: a robustness study
- Harmless label noise and informative soft-labels in supervised classification
- Anomaly and novelty detection for robust semi-supervised learning
- Robust image classification
- Consistency factor for the MCD estimator at the Student-\(t\) distribution
- Robust supervised classification with mixture models: learning from data with uncertain labels
This page was built for publication: A robust approach to model-based classification based on trimming and constraints. Semi-supervised learning in presence of outliers and label noise
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2201323)