Abstract
We focus on developing a novel scalable graph-based semi-supervised learning (SSL) method for input data consisting of a small amount of labeled data and a large amount of unlabeled data. Due to the lack of labeled data and the availability of large-scale unlabeled data, existing SSL methods usually either encounter suboptimal performance because of an improper graph constructed from input data or are impractical due to the high-computational complexity of solving large-scale optimization problems. In this paper, we propose to address both problems by constructing a novel graph of input data for graph-based SSL methods. A density-based approach is proposed to learn a latent graph from input data. Based on the latent graph, a novel graph construction approach is proposed to construct the graph of input data by an efficient formula. With this formula, two transductive graph-based SSL methods are devised with the computational complexity linear in the number of input data points. Extensive experiments on synthetic data and real datasets demonstrate that the proposed methods not only are scalable for large-scale data, but also achieve good classification performance, especially for an extremely small number of labeled data.
Original language | English |
---|---|
Title of host publication | Machine Learning and Knowledge Discovery in Databases: European Conference, ECML PKDD 2022, Proceedings |
Editors | Massih-Reza AMINI, Stéphane CANU, Asja FISCHER, Tias GUNS, Petra KRALJ NOVAK, Grigorios TSOUMAKAS |
Publisher | Springer, Cham |
Pages | 367-383 |
Number of pages | 17 |
ISBN (Electronic) | 9783031264122 |
ISBN (Print) | 9783031264115 |
DOIs | |
Publication status | Published - 2023 |
Externally published | Yes |
Event | 22nd Joint European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, ECML PKDD 2022 - Grenoble, France Duration: 19 Sept 2022 → 23 Sept 2022 |
Publication series
Name | Lecture Notes in Computer Science |
---|---|
Volume | 13716 |
ISSN (Print) | 0302-9743 |
ISSN (Electronic) | 1611-3349 |
Conference
Conference | 22nd Joint European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, ECML PKDD 2022 |
---|---|
Country/Territory | France |
City | Grenoble |
Period | 19/09/22 → 23/09/22 |
Bibliographical note
Publisher Copyright:© 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG.
Keywords
- Graph structure learning
- Graph-based semi-supervised learning
- Large-scale learning