An Efficient and Versatile Variational Method for High-Dimensional Data Classification

Xiaohao CAI, Raymond H. CHAN, Xiaoyu XIE, Tieyong ZENG

Research output: Journal PublicationsJournal Article (refereed)peer-review

Abstract

High-dimensional data classification is a fundamental task in machine learning and imaging science. In this paper, we propose an efficient and versatile multi-class semi-supervised classification method for classifying high-dimensional data and unstructured point clouds. To begin with, a warm initialization is generated by using a fuzzy classification method such as the standard support vector machine or random labeling. Then an unconstraint convex variational model is proposed to purify and smooth the initialization, followed by a step which is to project the smoothed partition obtained previously to a binary partition. These steps can be repeated, with the latest result as a new initialization, to keep improving the classification quality. We show that the convex model of the smoothing step has a unique solution and can be solved by a specifically designed primal–dual algorithm whose convergence is guaranteed. We test our method and compare it with the state-of-the-art methods on several benchmark data sets. Thorough experimental results demonstrate that our method is superior in both the classification accuracy and computation speed for high-dimensional data and point clouds.
Original languageEnglish
Article number81
JournalJournal of Scientific Computing
Volume100
Issue number3
Early online date1 Aug 2024
DOIs
Publication statusPublished - 1 Sept 2024

Bibliographical note

Publisher Copyright:
© The Author(s) 2024.

Funding

This work of R. Chan is partially supported by HKRGC Grants No. CityU12500915, CityU14306316, HKRGC CRF Grant C1007-15 G, and HKRGC AoE Grant AoE/M-05/12. This work of T. Zeng is partially supported by the National Natural Science Foundation of China under Grant 11671002, CUHK start-up and CUHK DAG 4053296, 4053342. We thank Prof. Xue-Cheng Tai, Dr Ke Yin, Dr Egil Bae and Prof. Ekaterina Merkurjev for providing the codes of their methods [1, 2].

Keywords

  • Graph Laplacian
  • Point cloud classification
  • Semi-supervised clustering
  • Variational methods

Fingerprint

Dive into the research topics of 'An Efficient and Versatile Variational Method for High-Dimensional Data Classification'. Together they form a unique fingerprint.

Cite this