ADVANCED LARGE LANGUAGE MODEL APPROACHES AND NATURAL LANGUAGE PROCESSING TECHNIQUES TO IMPROVE HATE SPEECH DETECTION USING AI

dc.contributor.advisorBoloni, Ladislau
dc.contributor.authorAlmohaimeed, Saad
dc.date.accessioned2025-05-12T06:53:43Z
dc.date.issued2025
dc.description.abstractThe proliferation of hate speech on social networks can create a significant negative social effect, making the development of AI-based classifiers that can identify and characterize different types of hateful speech in messages highly important for stakeholders. While this is a highly challenging problem, recent advances in language models promise to advance the state of the art such that even subtle and indirect forms of hate speech can be detected. In this dissertation we present a series of contributions that improve different aspects of hate speech classification. We developed THOS, a hate speech dataset consisting of 8.3k tweets. Compared to previous datasets, THOS contains fine-grained labels that identify not only whether a tweet is offensive or hateful, but also the target of the hate. Using this dataset, we studied the degree to which finer grained classification of tweets is feasible. In the follow-up work, we focus on the difficult problem of implicit hate speech, where hate is conveyed through subtle verbal constructs and allusions, without the use of explicitly offensive terms. We evaluate the efficacy of lexicon-based methods, transfer learning, and advanced LLMs such as GPT-4 on this problem. We found that the proposed techniques can boost the detection performance of implicit hate, although even advanced models often struggle with certain interpretations. In our third contribution, we introduce the Closest Positive Cluster (CPC) auxiliary loss, which improves the generalizability of classifiers across a wide range of datasets, resulting in enhanced performance for both explicit and implicit hate speech scenarios. Finally, given the scarcity of implicit hate speech datasets and the abundance of explicit hate datasets, we proposed an approach to generalize explicit hate datasets for the classification of implicit hate speech. Additionally, the proposed approach addresses noisy label correction issues commonly found in crowd-sourced datasets. Our method comprises three key components: influential sample identification, reannotation, and augmentation. We show that the approach improves generalization across datasets and enhances implicit hate classification.
dc.format.extent97
dc.identifier.urihttps://hdl.handle.net/20.500.14154/75360
dc.language.isoen_US
dc.publisherUniversity of Central Florida
dc.subjectartificial intelligence
dc.subjectnatural language processing
dc.subjectllm
dc.subjectlarge language model
dc.subjecthate speech detection
dc.subjectimplicit hate
dc.subjecthate speech classifiers
dc.subjectai
dc.subjectnlp
dc.subjectLLM
dc.titleADVANCED LARGE LANGUAGE MODEL APPROACHES AND NATURAL LANGUAGE PROCESSING TECHNIQUES TO IMPROVE HATE SPEECH DETECTION USING AI
dc.typeThesis
sdl.degree.departmentComputer Science
sdl.degree.disciplineComputer Science
sdl.degree.grantorUniversity of Central Florida
sdl.degree.nameDoctor of Philosophy

Files

Original bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
SACM-Dissertation.pdf
Size:
3.92 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.61 KB
Format:
Item-specific license agreed to upon submission
Description:

Copyright owned by the Saudi Digital Library (SDL) © 2025