Gaussian prior based adaptive synthetic sampling with non-linear sample space for imbalanced learning

Tianlun ZHANG, Yang LI, Xizhao WANG*

*Corresponding author for this work

Research output: Journal PublicationsJournal Article (refereed)peer-review

11 Citations (Scopus)

Abstract

In the presence of skewed category distribution, most learning algorithms fail to provide favorable performance on the representation about data characteristics. Thus learning from imbalanced data is a crucial challenge in the field of data engineering and knowledge discovery. In this work, we proposed an imbalanced learning method to generate minority samples for the compensation of class distribution skews. Different from existing synthetic over-sampling techniques, the data generation is conducted within the hyperplane rather than on the hyperline, thus the proposed method breaks down the ties imposed by the linear interpolation. In addition, this proposed method minimizes the sampling uncertain and risk by integrating a prior knowledge about the minority class instances. Moreover, a multi-objective optimization combined with error bound model develops this proposed method into an adaptive imbalanced learning. Extensive experiments have been performed on imbalanced issues, and the experimental results demonstrate that this method can improve the performance of different classification algorithms.

Original languageEnglish
Article number105231
JournalKnowledge-Based Systems
Volume191
Early online date18 Nov 2019
DOIs
Publication statusPublished - 5 Mar 2020
Externally publishedYes

Bibliographical note

Publisher Copyright:
© 2019 Elsevier B.V.

Keywords

  • Adaptive method
  • Classification algorithm
  • Error bound model
  • Gaussian mixture model
  • Imbalanced learning

Fingerprint

Dive into the research topics of 'Gaussian prior based adaptive synthetic sampling with non-linear sample space for imbalanced learning'. Together they form a unique fingerprint.

Cite this