Subjectivity analysis of an enhanced feature set for code-switching text

Basiron, Halizah and Kasmuri, Emaliana (2024) Subjectivity analysis of an enhanced feature set for code-switching text. International Journal Of Advanced Computer Science And Applications(Ijacsa),Volume 9 Issue 6,2018., 15 (9). pp. 450-460. ISSN 2158-107X

[img] Text
00979231020241027201215.pdf
Available under License Creative Commons Attribution.

Download (824kB)

Abstract

The phenomenon of code-switching has posed a new challenge to the linguistic computing area. Conventionally, the computer will process monolingual text or multilingual text. However, code-switching is different from this kind of text. Two or more languages are used to construct a piece of code-switching text, particularly a code-switching sentence. It is challenging for the computer to process a piece of code-switching text with languages that exist simultaneously. The challenge is more intense for the computer in subjectivity analysis, where the computer should distinguish subjective from objective code-switching text. This paper proposed three feature sets for subjectivity analysis on Malay-English code-switching text: Embedded Code-Switching Feature Sets, Unified Code-Switching Feature Sets, and Stylistic Feature Sets. These feature sets were enhanced from the monolingual feature set of subjectivity analysis. Experiments were conducted using the data harvested from Malay-English blogs. These data were labelled as either subjective or objective. Two machine learning classifiers – the Support Vector Machine (SVM) and Naive-Bayes, were used to evaluate the classification performance of the proposed feature sets. The experiments were carried out on individual feature sets and the combination of them. The results show the classification performance from combining the unified and stylistic feature sets surpassed other proposed feature sets at 59% accuracy. Therefore, it is concluded that the combination of unified and stylistic feature sets is necessary for the subjectivity analysis of Malay-English code-switching text.

Item Type: Article
Uncontrolled Keywords: Subjectivity analysis, Code-switching, Enhanced feature sets, Malay-English text
Divisions: Faculty of Information and Communication Technology
Depositing User: Norfaradilla Idayu Ab. Ghafar
Date Deposited: 11 Dec 2025 02:47
Last Modified: 11 Dec 2025 02:47
URI: http://eprints.utem.edu.my/id/eprint/29200
Statistic Details: View Download Statistic

Actions (login required)

View Item View Item