Evaluating CAMeL-BERT for Sentiment Analysis of Customer Satisfaction with STC (Saudi Telecom Company) Services

No Thumbnail Available

Date

2024-08-15

Journal Title

Journal ISSN

Volume Title

Publisher

The University of Sussex

Abstract

In the age of informatics platforms such as Twitter (X) plays a crucial role for measuring public sentiment, especially in both private and public sectors. This study explores the application of machine learning, particularly deep learning, to perform sentiment analysis on tweets about Saudi Telecom Company (STC) services in Saudi Arabia. A comparative analysis was conducted between pre-trained sentiment analysis models in English and in Arabic to assess their effectiveness in classifying sentiments. In addition, the study highlights a challenge in existing Arabic models, which are based on English model architectures but trained on varied datasets, such as Modern Standard Arabic and Classical Arabic (Al-Fus’ha). These models often lack the capability to handle the diverse Arabic dialects commonly used on social media. To overcome this issue, the study involved fine-tuning a pre-trained Arabic model using a dataset of tweets related to STC services, specifically focusing on the Saudi dialect. Data was collected from Twitter (X), focusing on mentions of the Saudi Telecom Company (STC). Both English and Arabic models were applied to this data, and their performance in sentiment analysis was evaluated. The fine-tuned Arabic model (CAMeL-BERT) demonstrated improved accuracy and a better understanding of local dialects compared to its initial version. The results highlight the importance of model adaptation for specific languages and contexts and underline the potential of CAMeL-BERT in sentiment analysis for Arabic-language content. The findings offer practical implications for enhancing customer service and engagement through more accurate sentiment analysis of social media content in the service providers sector.

Description

This thesis explores the use of machine learning, particularly deep learning, to analyze public sentiment about Saudi Telecom Company (STC) services on Twitter (X). A comparative study was conducted between pre-trained sentiment analysis models in English and Arabic, highlighting the challenges Arabic models face due to their reliance on diverse datasets like Modern Standard Arabic and Classical Arabic, which may not capture dialectal variations. The study addresses this by fine-tuning the CAMeL-BERT model on tweets specific to the Saudi dialect. Results show that the fine-tuned model offers improved accuracy in sentiment classification, demonstrating the importance of adapting models to specific dialects and contexts. This research has practical applications for enhancing customer service through better sentiment analysis of social media content.

Keywords

sentiment, data science, CAMeL-BERT, NLP, machine learning, AI, Saudi dialect, Customer Satisfaction, BERT

Citation

Endorsement

Review

Supplemented By

Referenced By

Copyright owned by the Saudi Digital Library (SDL) © 2024