Abstract
We present a probabilistic semi-supervised learning (SSL) framework based on sparse graph structure learning. Different from existing SSL methods with either a predefined weighted graph heuristically constructed from the input data or a learned graph based on the locally linear embedding assumption, the proposed SSL model is capable of learning a sparse weighted graph from the unlabeled high-dimensional data and a small amount of labeled data, as well as dealing with the noise of the input data. Our representation of the weighted graph is indirectly derived from a unified model of density estimation and pairwise distance preservation in terms of various distance measurements, where latent embeddings are assumed to be random variables following an unknown density function to be learned, and pairwise distances are then calculated as the expectations over the density for the model robustness to the data noise. Moreover, the labeled data based on the same distance representations are leveraged to guide the estimated density for better class separation and sparse graph structure learning. A simple inference approach for the embeddings of unlabeled data based on point estimation and kernel representation is presented. Extensive experiments on various data sets show promising results in the setting of SSL compared with many existing methods and significant improvements on small amounts of labeled data.
Original language | English |
---|---|
Article number | 9063663 |
Pages (from-to) | 853-867 |
Number of pages | 15 |
Journal | IEEE Transactions on Neural Networks and Learning Systems |
Volume | 32 |
Issue number | 2 |
DOIs | |
Publication status | Published - Feb 2021 |
Externally published | Yes |
Bibliographical note
Publisher Copyright:© 2012 IEEE.
Funding
The work of Raymond Chan was supported in part by the Hong Kong Research Grants Council (HKRGC) under Grant CUHK14306316 and Grant CUHK14301718, in part by the City University of Hong Kong (CityU) under Grant 9380101, and in part by the Collaborative Research Fund (CRF) under Grant C1007-15G and Grant AoE/M-05/12. The work of Tieyong Zeng was supported in part by the National Natural Science Foundation of China (NSFC) under Grant 11671002, in part by The Chinese University of Hong Kong (CUHK) start-up under Grant CUHK DAG 4053342, in part by the Research Grants Council (RGC) under Grant 14300219, and in part by the NSFC/RGC under Grant N_CUHK 415/19.
Keywords
- Graph structure learning
- kernel learning
- latent variable model
- semi-supervised learning (SSL)