Abstract
This article explores the problem of semisupervised affinity matrix learning, that is, learning an affinity matrix of data samples under the supervision of a small number of pairwise constraints (PCs). By observing that both the matrix encoding PCs, called pairwise constraint matrix (PCM) and the empirically constructed affinity matrix (EAM), express the similarity between samples, we assume that both of them are generated from a latent affinity matrix (LAM) that can depict the ideal pairwise relation between samples. Specifically, the PCM can be thought of as a partial observation of the LAM, while the EAM is a fully observed one but corrupted with noise/outliers. To this end, we innovatively cast the semisupervised affinity matrix learning as the recovery of the LAM guided by the PCM and EAM, which is technically formulated as a convex optimization problem. We also provide an efficient algorithm for solving the resulting model numerically. Extensive experiments on benchmark datasets demonstrate the significant superiority of our method over state-of-the-art ones when used for constrained clustering and dimensionality reduction. The code is publicly available at https://github.com/jyh-learning/LAM.
Original language | English |
---|---|
Pages (from-to) | 7919-7930 |
Journal | IEEE Transactions on Cybernetics |
Volume | 52 |
Issue number | 8 |
Early online date | 8 Jan 2021 |
DOIs | |
Publication status | Published - Aug 2022 |
Externally published | Yes |
Bibliographical note
This article was recommended by Associate Editor S. Cruces.Funding
This work was supported in part by the Key Project of Science and Technology Innovation 2030 through the Ministry of Science and Technology of China under Grant 2018AAA0101301; in part by the Natural Science Foundation of China under Grant 61871342, Grant 61772344, and Grant 61672443; in part by the Hong Kong RGC General Research Funds under Grant 9042820 (CityU 11219019), Grant 9042816 (CityU 11209819), and Grant 9048123 (CityU 21211518); and in part by the Southeast University Start-Up Grant for New Faculty under Grant 1109007129.
Keywords
- Clustering
- graph learning
- semisupervised learning