Optimizing Hate Text Detection using Custom NLP Techniques and an Adapted DeBERTa-based Machine Learning Model

Aljabbar, Abdullah

Optimizing Hate Text Detection using Custom NLP Techniques and an Adapted DeBERTa-based Machine Learning Model

dc.contributor.advisor	AlYamani, Abdulghani
dc.contributor.author	Aljabbar, Abdullah
dc.date.accessioned	2025-11-05T15:22:46Z
dc.date.issued	2025
dc.description.abstract	The rapid expansion of social media has transformed online communication, providing platforms for public debate and community engagement. However, this openness has also facilitated the spread of harmful content, particularly hate speech, which poses significant risks to individual well-being, social cohesion, and digital trust. Detecting such content remains a major challenge due to the subtle, context-dependent, and evolving nature of hateful expressions. Traditional machine learning models, though useful as early baselines, often fail to capture linguistic nuance and contextual depth. Recent advances in natural language processing (NLP), particularly Transformer-based architectures, have significantly improved text classification tasks by enabling context-sensitive embeddings. This research investigates the effectiveness of DeBERTa (Decoding-enhanced BERT with Disentangled Attention) for hate speech detection. The study employs a systematic methodology consisting of four stages: data preparation and preprocessing, exploratory data analysis, model development, and evaluation. A curated dataset of 2,041 social media posts, derived from a larger corpus, was pre-processed to remove noise, normalise text, and correct class imbalance. The DeBERTa-v3-large model was fine-tuned using cross-entropy loss and AdamW optimisation. Performance was assessed with accuracy, precision, recall, F1-score, ROC, and PR curves, while error analysis and confusion matrices were used to identify common misclassifications. The findings demonstrate that DeBERTa can effectively capture indirect meaning and grammar connections. Additionally, outperforming traditional approaches and offering robust classification of hate and non- hate content. The study contributes to both NLP research and the wider cybersecurity domain by supporting the development of more reliable automated moderation tools that promote safer digital environments.
dc.format.extent	42
dc.identifier.citation	Ali, H. (2025). Effectiveness of a cognitive behavioral program to reduce some psychological disorders among Saudi dependents on psychoactive substances. Journal of Psychological Studies,
dc.identifier.issn	apa 7
dc.identifier.uri	https://hdl.handle.net/20.500.14154/76877
dc.language.iso	en
dc.publisher	Saudi Digital Library
dc.subject	Hate Speech
dc.subject	DeBERTa
dc.subject	NLP
dc.subject	Transformers
dc.subject	Social Media
dc.subject	Cybersecurity
dc.subject	Classification
dc.title	Optimizing Hate Text Detection using Custom NLP Techniques and an Adapted DeBERTa-based Machine Learning Model
dc.title.alternative	How can recycled waste hair from Hajj and Umrah in Saudi Arabia be employed in the beauty industry
dc.type	Thesis
sdl.degree.department	Computing
sdl.degree.discipline	Cybersecurity
sdl.degree.grantor	De Montfort
sdl.degree.name	Master of Cybersecurity

Files

Original bundle

Now showing 1 - 1 of 1

Name:: SACM-Dissertation.pdf
Size:: 1018.1 KB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.61 KB
Format:: Item-specific license agreed to upon submission
Description:

Download

Collections

SACM - United Kingdom