Unsupervised feature selection using clustering ensembles and population based incremental learning algorithm

Yi HONG, Sam KWONG, Yuchou CHANG, Qingsheng REN

Research output: Journal PublicationsJournal Article (refereed)peer-review

137 Citations (Scopus)

Abstract

This paper describes a novel feature selection algorithm for unsupervised clustering, that combines the clustering ensembles method and the population based incremental learning algorithm. The main idea of the proposed unsupervised feature selection algorithm is to search for a subset of all features such that the clustering algorithm trained on this feature subset can achieve the most similar clustering solution to the one obtained by an ensemble learning algorithm. In particular, a clustering solution is firstly achieved by a clustering ensembles method, then the population based incremental learning algorithm is adopted to find the feature subset that best fits the obtained clustering solution. One advantage of the proposed unsupervised feature selection algorithm is that it is dimensionality-unbiased. In addition, the proposed unsupervised feature selection algorithm leverages the consensus across multiple clustering solutions. Experimental results on several real data sets demonstrate that the proposed unsupervised feature selection algorithm is often able to obtain a better feature subset when compared with other existing unsupervised feature selection algorithms. © 2008 Elsevier Ltd. All rights reserved.
Original languageEnglish
Pages (from-to)2742-2756
JournalPattern Recognition
Volume41
Issue number9
DOIs
Publication statusPublished - Sept 2008
Externally publishedYes

Funding

The work was partially supported by a grant from the Research Grants Council of Hong Kong Special Administrative Region, China Project No. 9041236/CityU 114707. The authors would like to thank the comments and suggestions from the reviewers.

Keywords

  • Clustering ensembles
  • Dimensionality unbiased
  • Population based incremental learning algorithm
  • Unsupervised feature selection

Fingerprint

Dive into the research topics of 'Unsupervised feature selection using clustering ensembles and population based incremental learning algorithm'. Together they form a unique fingerprint.

Cite this