Evaluating CAMeL-BERT for Sentiment Analysis of Customer Satisfaction with STC (Saudi Telecom Company) Services
dc.contributor.advisor | Pay, Jack | |
dc.contributor.author | Alotaibi, Fahad | |
dc.date.accessioned | 2024-10-29T11:15:38Z | |
dc.date.issued | 2024-08-15 | |
dc.description | This thesis explores the use of machine learning, particularly deep learning, to analyze public sentiment about Saudi Telecom Company (STC) services on Twitter (X). A comparative study was conducted between pre-trained sentiment analysis models in English and Arabic, highlighting the challenges Arabic models face due to their reliance on diverse datasets like Modern Standard Arabic and Classical Arabic, which may not capture dialectal variations. The study addresses this by fine-tuning the CAMeL-BERT model on tweets specific to the Saudi dialect. Results show that the fine-tuned model offers improved accuracy in sentiment classification, demonstrating the importance of adapting models to specific dialects and contexts. This research has practical applications for enhancing customer service through better sentiment analysis of social media content. | |
dc.description.abstract | In the age of informatics platforms such as Twitter (X) plays a crucial role for measuring public sentiment, especially in both private and public sectors. This study explores the application of machine learning, particularly deep learning, to perform sentiment analysis on tweets about Saudi Telecom Company (STC) services in Saudi Arabia. A comparative analysis was conducted between pre-trained sentiment analysis models in English and in Arabic to assess their effectiveness in classifying sentiments. In addition, the study highlights a challenge in existing Arabic models, which are based on English model architectures but trained on varied datasets, such as Modern Standard Arabic and Classical Arabic (Al-Fus’ha). These models often lack the capability to handle the diverse Arabic dialects commonly used on social media. To overcome this issue, the study involved fine-tuning a pre-trained Arabic model using a dataset of tweets related to STC services, specifically focusing on the Saudi dialect. Data was collected from Twitter (X), focusing on mentions of the Saudi Telecom Company (STC). Both English and Arabic models were applied to this data, and their performance in sentiment analysis was evaluated. The fine-tuned Arabic model (CAMeL-BERT) demonstrated improved accuracy and a better understanding of local dialects compared to its initial version. The results highlight the importance of model adaptation for specific languages and contexts and underline the potential of CAMeL-BERT in sentiment analysis for Arabic-language content. The findings offer practical implications for enhancing customer service and engagement through more accurate sentiment analysis of social media content in the service providers sector. | |
dc.format.extent | 37 | |
dc.identifier.uri | https://hdl.handle.net/20.500.14154/73360 | |
dc.language.iso | en | |
dc.publisher | The University of Sussex | |
dc.subject | sentiment | |
dc.subject | data science | |
dc.subject | CAMeL-BERT | |
dc.subject | NLP | |
dc.subject | machine learning | |
dc.subject | AI | |
dc.subject | Saudi dialect | |
dc.subject | Customer Satisfaction | |
dc.subject | BERT | |
dc.title | Evaluating CAMeL-BERT for Sentiment Analysis of Customer Satisfaction with STC (Saudi Telecom Company) Services | |
dc.type | Thesis | |
sdl.degree.department | School of Mathematical and Physical Sciences | |
sdl.degree.discipline | Data Science | |
sdl.degree.grantor | The University of Sussex | |
sdl.degree.name | MSc in Data Science |