Abstract
Symmetric nonnegative matrix factorization (SNMF) has demonstrated to be a powerful method for data clustering. However, SNMF is mathematically formulated as a non-convex optimization problem, making it sensitive to the initialization of variables. Inspired by ensemble clustering that aims to seek a better clustering result from a set of clustering results, we propose self-supervised SNMF (S3NMF), which is capable of boosting clustering performance progressively by taking advantage of the sensitivity to initialization characteristic of SNMF, without relying on any additional information. Specifically, we first perform SNMF repeatedly with a random positive matrix for initialization each time, leading to multiple decomposed matrices. Then, we rank the quality of the resulting matrices with adaptively learned weights, from which a new similarity matrix that is expected to be more discriminative is reconstructed for SNMF again. These two steps are iterated until the stopping criterion/maximum number of iterations is achieved. We mathematically formulate S3NMF as a constrained optimization problem, and provide an alternative optimization algorithm to solve it with the theoretical convergence guaranteed. Extensive experimental results on 10 commonly used benchmark datasets demonstrate the significant advantage of our S3NMF over 14 state-of-the-art methods in terms of 5 quantitative metrics. The source code is publicly available at https://github.com/jyh-learning/SSSNMF.
Original language | English |
---|---|
Pages (from-to) | 4526-4537 |
Number of pages | 12 |
Journal | IEEE Transactions on Circuits and Systems for Video Technology |
Volume | 32 |
Issue number | 7 |
Early online date | 18 Nov 2021 |
DOIs | |
Publication status | Published - Jul 2022 |
Externally published | Yes |
Funding
This work was supported in part by the Hong Kong Research Grants Council under Grant CityU 11219019, Grant 11202320, and Grant 11218121; in part by the National Natural Science Foundation of China under Grant 62106044; in part by the Natural Science Foundation of Jiangsu Province under Grant BK20210221; and in part by the Jiangsu Provincial Double-Innovation Doctor Program under Grant JSSCBS20210083. The work of Hui Liu was supported by the Hong Kong University Grants Committee under the Institutional Development Scheme Research Infrastructure Grant UGC/IDS11/19.
Keywords
- clustering
- dimensionality reduction
- Symmetric nonnegative matrix factorization