Skip to main navigation Skip to search Skip to main content

Copula Guided Parallel Gibbs Sampling for Nonparametric and Coherent Topic Discovery (Extended Abstract)

  • Lihui LIN
  • , Yanghui RAO
  • , Haoran XIE
  • , Raymond Y. K. LAU
  • , Jian YIN
  • , Fu Lee WANG
  • , Qing LI

Research output: Book Chapters | Papers in Conference ProceedingsConference (Extended Abstracts)peer-review

Abstract

In terms of the generative process, the Gamma-Gamma-Poisson Process (G2PP) is equivalent to the nonparametric topic model of Hierarchical Dirichlet Process (HDP). Considering the high computational cost of estimating parameters in HDP, a parallel G2PP was developed to generate topics efficiently via multi-threading. Unfortunately, the above model needs to predefine the number of topics. To address this issue, we first propose a Topic Self-Adaptive Model (TSAM) for nonparametric and parallel topic discovery. In TSAM, a monitor-executor mechanism is developed to manage the global topic information using a hierarchical structure of threads. Based on the apparatus of copulas, we further extend our TSAM to TSAMcop for coherent topic modeling by exploiting a copula guided parallel Gibbs sampling algorithm. Extensive experiments validate the effectiveness of both TSAM and TSAMcop.
Original languageEnglish
Title of host publicationProceedings - 2023 IEEE 39th International Conference on Data Engineering, ICDE 2023
Pages3823-3824
Number of pages2
ISBN (Electronic)9798350322279
DOIs
Publication statusPublished - 26 Jul 2023
Event2023 IEEE 39th International Conference on Data Engineering (ICDE) - Anaheim, United States
Duration: 3 Apr 20237 Apr 2023
https://icde2023.ics.uci.edu/

Publication series

NameProceedings - International Conference on Data Engineering
Volume2023-April
ISSN (Print)1084-4627

Conference

Conference2023 IEEE 39th International Conference on Data Engineering (ICDE)
Country/TerritoryUnited States
CityAnaheim
Period3/04/237/04/23
Internet address

Bibliographical note

Publisher Copyright:
© 2023 IEEE.

Funding

This work was supported by a grant from the Research Grants Council of the Hong Kong Special Administrative Region, China (UGC/FDS16/E01/19), the National Natural Science Foundation of China (61972426), the Direct Grant (DR23B2) and the Faculty Research Grant (DB23A3) of Lingnan University, Hong Kong, a grant from the Research Grants Council of the HKSAR, China (Project: CityU 11507219), and a grant from the City University of Hong Kong SRG (Project: 7005780).

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 9 - Industry, Innovation, and Infrastructure
    SDG 9 Industry, Innovation, and Infrastructure

Keywords

  • copulas
  • parallel gibbs sampling
  • topic modelling

Fingerprint

Dive into the research topics of 'Copula Guided Parallel Gibbs Sampling for Nonparametric and Coherent Topic Discovery (Extended Abstract)'. Together they form a unique fingerprint.

Cite this