Leveraging ChatGPT-based augmentation and contrastive learning for Chinese MOOC sentiment analysis

Research output: Journal PublicationsJournal Article (refereed)peer-review

1 Citation (Scopus)

Abstract

This study addresses the unique challenges of sentiment analysis in Chinese massive open online course (MOOC) reviews, where pedagogically embedded language, intra-sentence sentiment shifts, and class imbalance complicate classification tasks. To tackle these domain-specific issues, we integrated ChatGPT-based data augmentation with contrastive learning within a Bidirectional Encoder Representations from Transformers (BERT)–Chinese framework. We evaluated ChatGPT-based augmentation (GPTaug), similar word replacement, and random word deletion under a dual-loss setup that combines supervised cross-entropy and InfoNCE (information noise-constrastive estimation) contrastive learning, focusing on how they enhance model performance across sentiment categories. The results revealed that the integration of contrastive learning with data augmentation strategies substantially improved sentiment classification in Chinese MOOC reviews. Especially, GPTaug demonstrated robust and balanced performance across polarity categories, particularly enhancing the detection of underrepresented neutral sentiments. These findings suggest that generative augmentation, when aligned with contrastive objectives, mitigates data sparsity and semantic ambiguity in educational sentiment analysis.

Original languageEnglish
Pages (from-to)64-72
Number of pages9
JournalIEEE Intelligent Systems
Volume40
Issue number6
Early online date11 Aug 2025
DOIs
Publication statusPublished - Nov 2025

Bibliographical note

Publisher Copyright:
© 2001-2011 IEEE.

Funding

This study was supported by the National Natural Science Foundation of China (Grant 62307010), Philosophy and Social Science Planning Project of Guangdong Province of China (Grant GD24XJY17), Research Grants Council of the Hong Kong Special Administrative Region, China (Grant R1015-23), Faculty Research Grant (SDS24A8), the Direct Grant of Lingnan University, Hong Kong, (DR25E8), and the 2023 Nanjing International/ Hong Kong, Macao, and Taiwan Science and Technology CooperationProgram(jointresearch)(Grant202308010).

Fingerprint

Dive into the research topics of 'Leveraging ChatGPT-based augmentation and contrastive learning for Chinese MOOC sentiment analysis'. Together they form a unique fingerprint.

Cite this