进化双层自适应局部特征选择

Translated title of the contribution: Evolutionary bi-level adaptive local feature selection

高麟, 周宇, 鄺得互

Research output: Journal PublicationsJournal Article (refereed)peer-review

Abstract

局部特征选择(LFS)方法将样本空间划分为多个局部区域并为每个区域选择最优特征子集以反应局部异质信息。然而,现有的LFS方法以每个样本为中心划分局部区域并找到最优特征子集,导致优化效率低下且适用场景受限。为了解决这个问题,提出一种进化双层自适应局部特征选择(BiLFS)算法。LFS问题被建模为双层优化问题,特征子集和待优化局部区域是该问题的两个决策变量。在问题的上层,使用非支配排序遗传算法-II求解被选择的局部区域的最优特征子集,区域纯度和被选择特征比率是目标函数。在问题的下层,根据上层求解的最优特征子集,首先使用局部区域聚类分析得到区域内的中心样本,然后通过局部区域融合来消除非必要区域并更新必要区域的种群。在11个UCI数据集上的测试表明,相较于基于进化算法的非自适应LFS方法,BiLFS的平均分类准确率达到前者的98.48%,且平均所需计算用时仅为前者9.51%,运算效率得到大幅提升达到基于线性规划的LFS方法的水准。对迭代过程中BiLFS算法选择的用于优化的局部区域进行可视化分析,结果表明,BiLFS选择必要局部区域具有稳定性和可靠性。

Abstract: Local feature selection (LFS) methods partition the sample space into multiple local regions and select the optimal feature subset for each region to reflect local heterogeneous information. However, existing LFS methods partition local regions around each sample and find the optimal feature subset, resulting in low optimization efficiency and limited applicability. To address this issue, a novel Evolutionary Bi-level Adaptive Local Feature Selection (BiLFS) algorithm is proposed. The LFS problem is formulated as a bi-level optimization problem, with feature subsets and locally optimized regions as the decision variables. At the upper level, Non-Dominated Sorting Genetic Algorithm-II is employed to find the optimal feature subset for the selected local regions, with region purity and selected feature ratio as the objective functions. At the lower level, based on the upper-level solution, local region clustering analysis is used to obtain center samples within each region, followed by local region fusion to eliminate unnecessary regions and update the population of necessary regions. Experimental results on 11 UCI datasets demonstrate that compared to non-adaptive LFS methods based on evolutionary algorithms, BiLFS achieves an average classification accuracy of 98.48%, with an average computation time of only 9.51%, significantly improving computational efficiency to the level of Linear Programming-based LFS methods. Visual analysis of the locally optimized regions selected by the BiLFS algorithm during the iteration process indicates the stability and reliability of selecting necessary local regions.
Translated title of the contributionEvolutionary bi-level adaptive local feature selection
Original languageChinese (Simplified)
Journal计算机应用
DOIs
Publication statusE-pub ahead of print - 17 Apr 2024

Keywords

  • feature selection
  • bi-level optimization
  • genetic algorithm
  • multi-objective
  • optimization
  • clustering
  • 特征选择
  • 双层优化
  • 遗传算法
  • 多目标优化
  • 聚类

Fingerprint

Dive into the research topics of 'Evolutionary bi-level adaptive local feature selection'. Together they form a unique fingerprint.

Cite this