A cost-sensitive ensemble method for class-imbalanced datasets (Q369767)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: A cost-sensitive ensemble method for class-imbalanced datasets |
scientific article; zbMATH DE number 6209205
| Language | Label | Description | Also known as |
|---|---|---|---|
| default for all languages | No label defined |
||
| English | A cost-sensitive ensemble method for class-imbalanced datasets |
scientific article; zbMATH DE number 6209205 |
Statements
A cost-sensitive ensemble method for class-imbalanced datasets (English)
0 references
19 September 2013
0 references
Summary: In imbalanced learning methods, resampling methods modify an imbalanced dataset to form a balanced dataset. Balanced data sets perform better than imbalanced datasets for many base classifiers. This paper proposes a cost-sensitive ensemble method based on cost-sensitive support vector machine (SVM), and query-by-committee (QBC) to solve imbalanced data classification. The proposed method first divides the majority-class dataset into several subdatasets according to the proportion of imbalanced samples and trains subclassifiers using the AdaBoost method. Then, the proposed method generates candidate training samples by the QBC active learning method and uses cost-sensitive SVMs to learn the training samples. By using 5 class-imbalanced datasets, experimental results show that the proposed method has higher area under ROC curve (AUC), F-measure, and G-mean than many existing class-imbalanced learning methods.
0 references
cost-sensitive ensemble method
0 references
cost-sensitive support vector machine
0 references
query-by-committee
0 references
imbalanced learning
0 references
imbalanced dataset
0 references
AdaBoost
0 references
0.7589427828788757
0 references
0.7551625370979309
0 references
0.7468834519386292
0 references
0.7406993508338928
0 references