Semi-supervised learning for sentiment classification with ensemble multi-classifier approach

Aribowo, Agus Sasmito and Basiron, Halizah and Abd Yusof, Noor Fazilla (2022) Semi-supervised learning for sentiment classification with ensemble multi-classifier approach. International Journal of Advances in Intelligent Informatics, 8 (3). pp. 349-361. ISSN 2442-6571

[img] Text
AGUS-2022.PDF

Download (477kB)

Abstract

Supervised sentiment analysis ideally uses a fully labeled data set for modeling. However, this ideal condition requires a struggle in the label annotation process. Semi-supervised learning (SSL) has emerged as a promising method to avoid time-consuming and expensive data labeling without reducing model performance. However, the research on SSL is still limited and its performance needs to be improved. Thus, this study aims to create a new SSL-Model for sentiment analysis. The Ensemble Classifier SSL model for sentiment classification is introduced. The research went through pre-processing, vectorization, and feature extraction using TF-IDF and n-grams. Support Vector Machine (SVM) or Random Forest for tokenization was used to separate unigram, bigram, and trigram in model generation. Then, the outputs of these models were combined using stacking ensemble approach. Accuracy and F1-score were used for the evaluation. IMDB datasets and US Airlines were used to test the new SSL models. The conclusion is that the sentiment annotation accuracy is highly dependent on the suitability of the dataset with the machine learning algorithm. In IMDB dataset, which consists of two classes, it is better to use SVM. In the US Airlines consisting of three classes, SVM is better at improving the model performance against the baseline, but RF is better at achieving the baseline performance even though it fails to maintain the model performance.

Item Type: Article
Uncontrolled Keywords: Ensemble Multi-classifier, Semi-supervised, Sentiment analysis, SVM, Random forest
Divisions: Faculty of Information and Communication Technology
Depositing User: mr eiisaa ahyead
Date Deposited: 22 Mar 2023 16:56
Last Modified: 22 Mar 2023 16:56
URI: http://eprints.utem.edu.my/id/eprint/26587
Statistic Details: View Download Statistic

Actions (login required)

View Item View Item