A hybrid model for opinion mining based on domain sentiment dictionary

Yi CAI*, Kai YANG, Dongping HUANG, Zikai ZHOU, Xue LEI, Haoran XIE, Tak Lam WONG

*Corresponding author for this work

Research output: Journal PublicationsJournal Article (refereed)peer-review

48 Citations (Scopus)

Abstract

Sentiment classification is an application of sentiment analysis, which is a popular research field in NLP. It can classify documents into different categories according to their sentiments. For a sentiment classification task, the first step is to extract sentimental features from documents, and then classify them using some classifiers. In the first step, a traditional way to extract sentimental features is to apply sentiment dictionaries. However, sentiment words may have different sentiment tendencies in different contexts, and traditional sentiment dictionaries does not consider this situation where wrong sentiment tendencies may be selected for sentiment words. In our research, we find that sentiment words will not have diverse meanings when they associate with the nearby aspects and entities in documents. Then, we propose a three layers sentiment dictionary, which can associate sentiment words with the corresponding entities and aspects together to reduce their multiple meanings. In the second step of the sentiment classification task, many classification models, such as SVM, GBDT, can be used to classify documents according to the extracted sentiment words. However, different classifiers have different weaknesses. A Stacking-based hybrid model is applied to combine SVM and GBDT together to overcome their weaknesses and reach higher performance. This hybrid model contains two layers, and the output of the first layer will become the input of the second layer. The first layer will generate different classification results according to different classifiers, while the second layer will automatically learn how to select a probable one as the final result. The experimental results show that our hybrid model outperforms the baseline single models.

Original languageEnglish
Pages (from-to)2131-2142
Number of pages12
JournalInternational Journal of Machine Learning and Cybernetics
Volume10
Issue number8
Early online date12 Dec 2017
DOIs
Publication statusPublished - 1 Aug 2019
Externally publishedYes

Funding

This work is supported by the Fundamental Research Funds for the Central Universities, SCUT (No. 2017ZD048), Tiptop Scientific and Technical Innovative Youth Talents of Guangdong special support program (No. 2015TQ01X633), Science and Technology Planning Project of Guangdong Province, China (No. 2017B050506004), Science and Technology Program of Guangzhou (International Science & Technology Cooperation Program No. 201704030076), and the Internal Research Grant (RG 66/2016-2017) and the Funding Support to ECS Proposal (RG 23/2017-2018R) of The Education University of Hong Kong.

Keywords

  • Hybrid model
  • Natural language processing
  • Opinion mining

Fingerprint

Dive into the research topics of 'A hybrid model for opinion mining based on domain sentiment dictionary'. Together they form a unique fingerprint.

Cite this