Abstract
Markov Blanket (MB) is a currently popular approach to feature selection that helps to effectively select correlated features and eliminate redundant features. However, existing MB-based approaches involve complex computations and extensive search. Therefore, we propose a novel concept, Class-specific Approximate Markov Blanket (CSAMB), to solve the above two problems from a class-specific perspective. This concept involves the transformation of decision attributes and features in the specific class using a proposed Rough Set-based Mapping (RSM) method, facilitating the selection results with high classification correlation and low inter-redundancy. The RSM not only preserves the positive, negative and boundary regions of a specific class with respect to a given feature, but also accurately quantifies the relationship between features within that class. Notably, we explore the approximate upper and lower bounds of grouping of correlation features via CSAMB. We then design a CSAMB-based algorithm, and extend it to two variants: CSAMB-min and CSAMB-max using the approximate upper and lower bounds, which demonstrates the performance range of our algorithm. Experiments shows that our algorithms outperform state-of-the-art algorithms regarding accuracy and efficiency, especially for large-scale and high-dimensional datasets.
Original language | English |
---|---|
Article number | 113154 |
Journal | Applied Soft Computing |
Volume | 176 |
Early online date | 23 Apr 2025 |
DOIs | |
Publication status | Published - May 2025 |
Bibliographical note
Publisher Copyright:© 2025 Elsevier B.V.
Funding
This work was supported by the National Natural Science Foundation of China (72271063, 71871069), Guangdong Province Philosophy and Social Science Planning 2024 Annual General Project (GD24CGL45), The Open Research Fund of Guangxi Key Lab of Human-machine Interaction (GXHID2203), National Social Science Foundation of China (24BGL052), Guangdong Province Philosophy and Social Science Planning 2022 Discipline Co-construction Funds (GD22XGL27), Natural Science Foundation of Guangdong Province (2025A1515010598).
Keywords
- Feature selection
- Markov Blanket
- Class-specific perspective
- Rough set