Cross Dataset Fairness Evaluation of Transformer Based Sentiment Models

Zuiran, Sara

Cross Dataset Fairness Evaluation of Transformer Based Sentiment Models

dc.contributor.advisor	Bhattacharyya, Siddhartha
dc.contributor.author	Zuiran, Sara
dc.date.accessioned	2025-07-22T06:56:47Z
dc.date.issued	2025-05-10
dc.description.abstract	With the growing exploration of Natural Language Processing (NLP) systems in decision-making environments, it is essential to evaluate technical and ethical aspects of the dataset and the NLP model to improve fairness. To assess fairness, the thesis examines demographic imbalances in sentiment classification models by evaluating transformer-based models fine-tuned on the Stanford Sentiment Treebank version 2 dataset (SST-2) against the demographically annotated Comprehensive Assessment of Language Model dataset (CALM). This work identifies performance disparities in sentiment prediction across demographic groups by examining sensitive attributes such as gender and race. The study evaluates both the RoBERTa and MentalBERT transformer models using a complete set of fairness metrics consisting of Statistical Parity Difference (SPD), Equal Opportunity Difference (EOD), False Positive Rates (FPR), False Negative Rates (FNR), Jensen-Shannon Divergence (JSD), and Wasserstein Distance (WD). The analysis examines both group-vs-rest and pairwise subgroup comparisons, including gender and ethnicity. Results show that applying adversarial mitigation reduced fairness disparities across demographic subgroups, with the most notable improvements observed for non-binary and Asian users. The observed disparities emphasize the challenge of reducing performance gaps across demographic subgroups in sentiment classification tasks. The thesis introduces a practical framework for evaluating demographic dis- disparities, extends fairness analysis, and assesses the impact of mitigation techniques in cross-dataset sentiment classification. This research proposes a framework that demonstrates a path toward creating inclusive NLP systems and establishes the groundwork for upcoming ethical Artificial Intelligence (AI) studies.
dc.format.extent	167
dc.identifier.uri	https://hdl.handle.net/20.500.14154/75931
dc.language.iso	en_US
dc.publisher	Saudi Digital Library
dc.subject	Natural Language Processing
dc.subject	Sentiment Analysis
dc.subject	Machine Learning
dc.subject	Fairness in AI
dc.subject	Bias Mitigation
dc.subject	Transformer Models
dc.subject	Demographic Bias
dc.subject	Social Bias in NLP
dc.subject	CALM dataset
dc.subject	SST-2 dataset
dc.title	Cross Dataset Fairness Evaluation of Transformer Based Sentiment Models
dc.type	Thesis
sdl.degree.department	Electrical Engineering and Computer Science
sdl.degree.discipline	Software Engineering
sdl.degree.grantor	Florida Institute of Technology
sdl.degree.name	Master of Science in Software Engineering

Files

Original bundle

Now showing 1 - 1 of 1

Name:: SACM-Dissertation.pdf
Size:: 4.73 MB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.61 KB
Format:: Item-specific license agreed to upon submission
Description:

Download

Collections

SACM - United States of America