ADVANCED LARGE LANGUAGE MODEL APPROACHES AND NATURAL LANGUAGE PROCESSING TECHNIQUES TO IMPROVE HATE SPEECH DETECTION USING AI

Almohaimeed, Saad

ADVANCED LARGE LANGUAGE MODEL APPROACHES AND NATURAL LANGUAGE PROCESSING TECHNIQUES TO IMPROVE HATE SPEECH DETECTION USING AI

dc.contributor.advisor	Boloni, Ladislau
dc.contributor.author	Almohaimeed, Saad
dc.date.accessioned	2025-05-12T06:53:43Z
dc.date.issued	2025
dc.description.abstract	The proliferation of hate speech on social networks can create a significant negative social effect, making the development of AI-based classifiers that can identify and characterize different types of hateful speech in messages highly important for stakeholders. While this is a highly challenging problem, recent advances in language models promise to advance the state of the art such that even subtle and indirect forms of hate speech can be detected. In this dissertation we present a series of contributions that improve different aspects of hate speech classification. We developed THOS, a hate speech dataset consisting of 8.3k tweets. Compared to previous datasets, THOS contains fine-grained labels that identify not only whether a tweet is offensive or hateful, but also the target of the hate. Using this dataset, we studied the degree to which finer grained classification of tweets is feasible. In the follow-up work, we focus on the difficult problem of implicit hate speech, where hate is conveyed through subtle verbal constructs and allusions, without the use of explicitly offensive terms. We evaluate the efficacy of lexicon-based methods, transfer learning, and advanced LLMs such as GPT-4 on this problem. We found that the proposed techniques can boost the detection performance of implicit hate, although even advanced models often struggle with certain interpretations. In our third contribution, we introduce the Closest Positive Cluster (CPC) auxiliary loss, which improves the generalizability of classifiers across a wide range of datasets, resulting in enhanced performance for both explicit and implicit hate speech scenarios. Finally, given the scarcity of implicit hate speech datasets and the abundance of explicit hate datasets, we proposed an approach to generalize explicit hate datasets for the classification of implicit hate speech. Additionally, the proposed approach addresses noisy label correction issues commonly found in crowd-sourced datasets. Our method comprises three key components: influential sample identification, reannotation, and augmentation. We show that the approach improves generalization across datasets and enhances implicit hate classification.
dc.format.extent	97
dc.identifier.uri	https://hdl.handle.net/20.500.14154/75360
dc.language.iso	en_US
dc.publisher	University of Central Florida
dc.subject	artificial intelligence
dc.subject	natural language processing
dc.subject	llm
dc.subject	large language model
dc.subject	hate speech detection
dc.subject	implicit hate
dc.subject	hate speech classifiers
dc.subject	ai
dc.subject	nlp
dc.subject	LLM
dc.title	ADVANCED LARGE LANGUAGE MODEL APPROACHES AND NATURAL LANGUAGE PROCESSING TECHNIQUES TO IMPROVE HATE SPEECH DETECTION USING AI
dc.type	Thesis
sdl.degree.department	Computer Science
sdl.degree.discipline	Computer Science
sdl.degree.grantor	University of Central Florida
sdl.degree.name	Doctor of Philosophy

Files

Original bundle

Now showing 1 - 1 of 1

Name:: SACM-Dissertation.pdf
Size:: 3.92 MB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.61 KB
Format:: Item-specific license agreed to upon submission
Description:

Download

Collections

SACM - United States of America