Developing a Generative AI Model to Enhance Sentiment Analysis for the Saudi Dialect

dc.contributor.advisorZhuang, Yu
dc.contributor.authorAftan, Sulaiman
dc.date.accessioned2024-12-05T07:53:01Z
dc.date.issued2024-12
dc.description.abstractSentiment Analysis (SA) is a fundamental task in Natural Language Processing (NLP) with broad applications across various real-world domains. While Arabic is a globally significant language with several well-developed NLP models for its standard form, achieving high performance in sentiment analysis for the Saudi Dialect (SD) remains challenging. A key factor contributing to this difficulty is inadequate SD datasets for training of NLP models. This study introduces a novel method for adapting a high-resource language model to a closely related but low-resource dialect by combining moderate effort in SD data collection with generative AI to address this problem of inadequacy in SD datasets. Then, AraBERT was fine-tuned using a combination of collected SD data and additional SD data generated by GPT. The results demonstrate a significant improvement in SD sentiment analysis performance compared to the AraBERT model, which is fine-tuned with only collected SD datasets. This approach highlights an efficient approach to generating high-quality datasets for fine-tuning a model trained on a high-resource language to perform well in a low-resource dialect. Leveraging generative AI enables reduced effort in data collection, making our approach a promising avenue for future research in low-resource NLP tasks.
dc.format.extent82
dc.identifier.urihttps://hdl.handle.net/20.500.14154/74020
dc.language.isoen_US
dc.publisherTexas Tech University
dc.subjectGenerative AI
dc.subjectSentiment Analysis
dc.subjectSaudi Dialect
dc.subjectNLP
dc.titleDeveloping a Generative AI Model to Enhance Sentiment Analysis for the Saudi Dialect
dc.typeThesis
sdl.degree.departmentComputer Science
sdl.degree.disciplineComputer Science
sdl.degree.grantorTexas Tech University
sdl.degree.nameDOCTOR OF PHILOSOPHY

Files

Original bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
SACM-Dissertation.pdf
Size:
1.5 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.61 KB
Format:
Item-specific license agreed to upon submission
Description:

Copyright owned by the Saudi Digital Library (SDL) © 2024