Diversity creation methods: A survey and categorisation

Gavin BROWN, Jeremy WYATT, Rachel HARRIS, Xin YAO

Research output: Journal PublicationsJournal Article (refereed)peer-review

802 Citations (Scopus)

Abstract

Ensemble approaches to classification and regression have attracted a great deal of interest in recent years. These methods can be shown both theoretically and empirically to outperform single predictors on a wide range of tasks. One of the elements required for accurate prediction when using an ensemble is recognised to be error "diversity". However, the exact meaning of this concept is not clear from the literature, particularly for classification tasks. In this paper we first review the varied attempts to provide a formal explanation of error diversity, including several heuristic and qualitative explanations in the literature. For completeness of discussion we include not only the classification literature but also some excerpts of the rather more mature regression literature, which we believe can still provide some insights. We proceed to survey the various techniques used for creating diverse ensembles, and categorise them, forming a preliminary taxonomy of diversity creation methods. As part of this taxonomy we introduce the idea of implicit and explicit diversity creation methods, and three dimensions along which these may be applied. Finally we propose some new directions that may prove fruitful in understanding classification error diversity. © 2004 Elsevier B.V. All rights reserved.
Original languageEnglish
Pages (from-to)5-20
Number of pages16
JournalInformation Fusion
Volume6
Issue number1
Early online date29 May 2004
DOIs
Publication statusPublished - Mar 2005
Externally publishedYes

Bibliographical note

R. Harris would like to acknowledge the support of a BTExact and EPSRC CASE studentship.

Keywords

  • Diversity
  • Ensemble
  • Neural networks
  • Survey
  • Taxonomy

Fingerprint

Dive into the research topics of 'Diversity creation methods: A survey and categorisation'. Together they form a unique fingerprint.

Cite this