A novel boundary oversampling algorithm based on neighborhood rough set model: NRSBoundary-SMOTE (Q473988)

From MaRDI portal





scientific article; zbMATH DE number 6372592
Language Label Description Also known as
default for all languages
No label defined
    English
    A novel boundary oversampling algorithm based on neighborhood rough set model: NRSBoundary-SMOTE
    scientific article; zbMATH DE number 6372592

      Statements

      A novel boundary oversampling algorithm based on neighborhood rough set model: NRSBoundary-SMOTE (English)
      0 references
      0 references
      0 references
      24 November 2014
      0 references
      Summary: Rough set theory is a powerful mathematical tool introduced by Pawlak to deal with imprecise, uncertain, and vague information. The Neighborhood-Based Rough Set Model expands the rough set theory; it could divide the dataset into three parts. And the boundary region indicates that the majority class samples and the minority class samples are overlapped. On the basis of what we know about the distribution of original dataset, we only oversample the minority class samples, which are overlapped with the majority class samples, in the boundary region. So, the NRSBoundary-SMOTE can expand the decision space for the minority class; meanwhile, it will shrink the decision space for the majority class. After conducting an experiment on four kinds of classifiers, NRSBoundary-SMOTE has higher accuracy than other methods when C4.5, CART, and KNN are used but it is worse than SMOTE on classifier SVM.
      0 references

      Identifiers