TY - GEN
T1 - Rectified Encoder Network for High-Dimensional Imbalanced Learning
AU - ZHENG, Tao
AU - CHEN, Wei-Jie
AU - TSANG, Ivor
AU - YAO, Xin
PY - 2019
Y1 - 2019
N2 - Many existing works have studied the learning on imbalanced data, however, it is still very challenging to handle high-dimensional imbalanced data. One key challenge of learning on imbalanced data is that most learning models usually have a bias towards the majority and its performance will deteriorate in the presence of underrepresented data and severe class distribution skews. One solution is to synthesize the minority data to balance the class distribution, but it may lead to more overlapping, especially in the high-dimensional setting. To alleviate the above challenges, in this paper, we present a novel Rectified Encoder Network (REN) for high-dimensional imbalanced learning tasks. The main contribution is that: (1) To deal with high-dimensionality, REN encodes high-dimensional imbalanced data into low dimensional latent codes as a latent representation. (2) To obtain a discriminative representation, we introduce a Rectifier to match the latent codes with our proposed Predefined Codes, which disentangles the overlapping among classes. (3) During rectification, in the Predefined Latent Distribution, we can efficiently identify and generate informative samples to maintain the balance of class distribution, so that the minority classes will not be neglected. The experimental results on several high-dimensional and image imbalanced data sets indicate that our REN obtains good representation code for classification and visualize the reason why REN gets better performance in high-dimensional imbalanced learning. © 2019, Springer Nature Switzerland AG.
AB - Many existing works have studied the learning on imbalanced data, however, it is still very challenging to handle high-dimensional imbalanced data. One key challenge of learning on imbalanced data is that most learning models usually have a bias towards the majority and its performance will deteriorate in the presence of underrepresented data and severe class distribution skews. One solution is to synthesize the minority data to balance the class distribution, but it may lead to more overlapping, especially in the high-dimensional setting. To alleviate the above challenges, in this paper, we present a novel Rectified Encoder Network (REN) for high-dimensional imbalanced learning tasks. The main contribution is that: (1) To deal with high-dimensionality, REN encodes high-dimensional imbalanced data into low dimensional latent codes as a latent representation. (2) To obtain a discriminative representation, we introduce a Rectifier to match the latent codes with our proposed Predefined Codes, which disentangles the overlapping among classes. (3) During rectification, in the Predefined Latent Distribution, we can efficiently identify and generate informative samples to maintain the balance of class distribution, so that the minority classes will not be neglected. The experimental results on several high-dimensional and image imbalanced data sets indicate that our REN obtains good representation code for classification and visualize the reason why REN gets better performance in high-dimensional imbalanced learning. © 2019, Springer Nature Switzerland AG.
KW - Characteristics extraction
KW - High-dimensionality
KW - Imbalanced learning
KW - Representation learning
UR - http://www.scopus.com/inward/record.url?scp=85072866437&partnerID=8YFLogxK
U2 - 10.1007/978-3-030-29911-8_53
DO - 10.1007/978-3-030-29911-8_53
M3 - Conference paper (refereed)
SN - 9783030299101
T3 - Lecture Notes in Computer Science
SP - 684
EP - 697
BT - PRICAI 2019: Trends in Artificial Intelligence : 16th Pacific Rim International Conference on Artificial Intelligence, Cuvu, Yanuca Island, Fiji, August 26–30, 2019, Proceedings, Part II
A2 - NAYAK, Abhaya C.
A2 - SHARMA, Alok
PB - Springer
T2 - 16th Pacific Rim International Conference on Artificial Intelligence
Y2 - 26 August 2019 through 30 August 2019
ER -