Optimizing Hate Text Detection using Custom NLP Techniques and an Adapted DeBERTa-based Machine Learning Model

dc.contributor.advisorAlYamani, Abdulghani
dc.contributor.authorAljabbar, Abdullah
dc.date.accessioned2025-11-05T15:22:46Z
dc.date.issued2025
dc.description.abstractThe rapid expansion of social media has transformed online communication, providing platforms for public debate and community engagement. However, this openness has also facilitated the spread of harmful content, particularly hate speech, which poses significant risks to individual well-being, social cohesion, and digital trust. Detecting such content remains a major challenge due to the subtle, context-dependent, and evolving nature of hateful expressions. Traditional machine learning models, though useful as early baselines, often fail to capture linguistic nuance and contextual depth. Recent advances in natural language processing (NLP), particularly Transformer-based architectures, have significantly improved text classification tasks by enabling context-sensitive embeddings. This research investigates the effectiveness of DeBERTa (Decoding-enhanced BERT with Disentangled Attention) for hate speech detection. The study employs a systematic methodology consisting of four stages: data preparation and preprocessing, exploratory data analysis, model development, and evaluation. A curated dataset of 2,041 social media posts, derived from a larger corpus, was pre-processed to remove noise, normalise text, and correct class imbalance. The DeBERTa-v3-large model was fine-tuned using cross-entropy loss and AdamW optimisation. Performance was assessed with accuracy, precision, recall, F1-score, ROC, and PR curves, while error analysis and confusion matrices were used to identify common misclassifications. The findings demonstrate that DeBERTa can effectively capture indirect meaning and grammar connections. Additionally, outperforming traditional approaches and offering robust classification of hate and non- hate content. The study contributes to both NLP research and the wider cybersecurity domain by supporting the development of more reliable automated moderation tools that promote safer digital environments.
dc.format.extent42
dc.identifier.citationAli, H. (2025). Effectiveness of a cognitive behavioral program to reduce some psychological disorders among Saudi dependents on psychoactive substances. Journal of Psychological Studies,
dc.identifier.issnapa 7
dc.identifier.urihttps://hdl.handle.net/20.500.14154/76877
dc.language.isoen
dc.publisherSaudi Digital Library
dc.subjectHate Speech
dc.subjectDeBERTa
dc.subjectNLP
dc.subjectTransformers
dc.subjectSocial Media
dc.subjectCybersecurity
dc.subjectClassification
dc.titleOptimizing Hate Text Detection using Custom NLP Techniques and an Adapted DeBERTa-based Machine Learning Model
dc.title.alternativeHow can recycled waste hair from Hajj and Umrah in Saudi Arabia be employed in the beauty industry
dc.typeThesis
sdl.degree.departmentComputing
sdl.degree.disciplineCybersecurity
sdl.degree.grantorDe Montfort
sdl.degree.nameMaster of Cybersecurity

Files

Original bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
SACM-Dissertation.pdf
Size:
1018.1 KB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.61 KB
Format:
Item-specific license agreed to upon submission
Description:

Copyright owned by the Saudi Digital Library (SDL) © 2026