Abstract
Feature selection based on fuzzy rough sets is an effective approach to select a compact feature subset that optimally predicts a given decision label. Despite being studied extensively, most existing methods of fuzzy rough set based feature selection are restricted to computing the whole dataset in batch, which is often costly or even intractable for large datasets. To improve the time efficiency, we investigate the incremental perspective for fuzzy rough set based feature selection assuming data can be presented in sample subsets one after another. The key challenge for the incremental perspective is how to add and delete features with the subsequent arrival of sample subsets. We tackle this challenge with strategies of adding and deleting features based on the relative discernibility relations that are updated as subsets arrive sequentially. Two incremental algorithms for fuzzy rough set based feature selection are designed based on the strategies. One updates the selected features as each sample subset arrives, and outputs the final feature subset where no sample subset is left. The other updates the relative discernibility relations but only performs feature selection where there is no further subset arriving. Experimental comparisons suggest our incremental algorithms expedite fuzzy rough set based feature selection without compromising performance.
Original language | English |
---|---|
Pages (from-to) | 1257-1273 |
Number of pages | 17 |
Journal | IEEE Transactions on Fuzzy Systems |
Volume | 26 |
Issue number | 3 |
Early online date | 27 Jun 2017 |
DOIs | |
Publication status | Published - Jun 2018 |
Externally published | Yes |
Bibliographical note
This work was supported in part by the NSFC under Grant 71471060, Grant 61170040, Grant 71371063, in part by the JCYJ under Grant 20150324140036825, in part by the Ulster University’s Research Challenge Fund under Grant 70595Q, and in part by Fundamental Research Funds for the Central Universities (2018ZD06).Keywords
- Attribute reduction
- feature selection
- fuzzy rough sets
- incremental learning
- relative discernibility relation