scutr (Q93101)

From MaRDI portal





Balancing Multiclass Datasets for Classification Tasks
Language Label Description Also known as
default for all languages
No label defined
    English
    scutr
    Balancing Multiclass Datasets for Classification Tasks

      Statements

      0 references
      0.1.2
      24 June 2021
      0 references
      0.2.0
      17 November 2023
      0 references
      0 references
      0 references
      17 November 2023
      0 references
      Imbalanced training datasets impede many popular classifiers. To balance training data, a combination of oversampling minority classes and undersampling majority classes is useful. This package implements the SCUT (SMOTE and Cluster-based Undersampling Technique) algorithm as described in Agrawal et. al. (2015) <doi:10.5220/0005595502260234>. Their paper uses model-based clustering and synthetic oversampling to balance multiclass training datasets, although other resampling methods are provided in this package.
      0 references
      0 references
      0 references
      0 references
      0 references

      Identifiers

      0 references