Abstract
Class imbalance learning tackles supervised learning problems where some classes have significantly more examples than others. Most of the existing research focused only on binary-class cases. In this paper, we study multiclass imbalance problems and propose a dynamic sampling method (DyS) for multilayer perceptrons (MLP). In DyS, for each epoch of the training process, every example is fed to the current MLP and then the probability of it being selected for training the MLP is estimated. DyS dynamically selects informative data to train the MLP. In order to evaluate DyS and understand its strength and weakness, comprehensive experimental studies have been carried out. Results on 20 multiclass imbalanced data sets show that DyS can outperform the compared methods, including pre-sample methods, active learning methods, cost-sensitive methods, and boosting-type methods. © 2012 IEEE.
Original language | English |
---|---|
Article number | 6449324 |
Pages (from-to) | 647-660 |
Number of pages | 14 |
Journal | IEEE Transactions on Neural Networks and Learning Systems |
Volume | 24 |
Issue number | 4 |
Early online date | 4 Feb 2013 |
DOIs | |
Publication status | Published - Apr 2013 |
Externally published | Yes |
Funding
This work was supported in part by the 973 Program of China under Grant 2011CB707006, the National Natural Science Foundation of China under Grant 61175065, Grant U0835002, and Grant 61028009, the National Natural Science Foundation of Anhui Province under Grant 1108085J16, and the European Union Seventh Framework Programme under Grant 247619 and Grant 270428.
Keywords
- Cost-sensitive learning
- dynamic sampling
- multiclass imbalance learning
- multilayer perceptrons