Abstract
Anomaly detection (AD) is typically regarded as an unsupervised learning task, where the training data either do not contain any anomalous samples or contain only a few unlabeled anomalous samples. In fact, in many real scenarios such as fault diagnosis and disease detection, a small number of anomalous samples labeled by domain experts are often available during the training phase, which makes semi-supervised AD (SAD) more appealing, though the related study is quite limited. Existing semi-supervised AD methods directly add optimization terms of anomalous samples to the optimization objective of unsupervised AD (UAD), where the effects of the limited labeled anomalous data on the optimization process become trivial and they cannot fully contribute to the detection task. To cover the shortage, in this work, we propose a novel semi-supervised AD method to fully use the limited labeled anomalous data and further to boost detection performance. The proposed method learns a nonlinear transformation to project normal data into a compact target distribution and simultaneously to project exposed anomalous samples into another target distribution, where the two target distributions do not overlap each other. The goal is difficult to achieve because of the scarcity of anomalous samples. To address this problem, we propose to generate a large number of intermediate samples interpolating between normal and anomalous data and project them into a third target distribution lying between the aforementioned two target distributions. Empirical results on multiple benchmarks with varying domains demonstrate the superiority of our method over existing supervised and semi-supervised AD methods.
| Original language | English |
|---|---|
| Pages (from-to) | 17966-17977 |
| Number of pages | 12 |
| Journal | IEEE Transactions on Neural Networks and Learning Systems |
| Volume | 36 |
| Issue number | 10 |
| Early online date | 4 Jul 2025 |
| DOIs | |
| Publication status | Published - Oct 2025 |
Bibliographical note
Publisher Copyright:© 2025 IEEE.
Funding
This work was supported in part by the General Program of Natural Science Foundation of Guangdong Province under Grant 2024A1515011771, in part by the National Science Fund for Distinguished Young Scholars of China under Grant 62225303, in part by the Research Impact Fund by RGC of Hong Kong under Project 130272, in part by the InnoHK Initiative of the Government of the HKSAR for the Laboratory for AI-Powered Financial Technologies, and in part by the Math and Application Project through the National Key Research and Development Program under Grant 2021YFA1003504.
Keywords
- Anomaly detection (AD)
- deep learning
- fault detection
- semi-supervised learning