AI GENERATED TEXT VS. HUMAN GENERATED TEXT

Hadi, Nedaa

AI GENERATED TEXT VS. HUMAN GENERATED TEXT

dc.contributor.advisor	Misri, Kazhan
dc.contributor.author	Hadi, Nedaa
dc.date.accessioned	2024-11-11T07:07:52Z
dc.date.issued	2024-09
dc.description.abstract	The ability to distinguish between AI-generated and human-generated texts is becom- ing increasingly critical as AI technologies advance. This dissertation explores the development and evaluation of various machine learning models to accurately classify text as either AI-generated or human-generated. The research aims to identify the most effective classification techniques and preprocessing methods to enhance model performance and generalization across different text datasets. A range of machine learning and deep learning models, including Support Vec- tor Machine (SVM), Random Forest, Logistic Regression, Decision Tree, BERT, and LSTM, were employed to evaluate their effectiveness in distinguishing between the two types of texts. The study utilized a balanced and representative dataset through data sampling and augmentation techniques. Key preprocessing steps were implemented to refine the input data, and hyperparameter tuning was conducted to optimize model performance. The generalization capabilities of the models were further tested on additional datasets with varying text characteristics. The findings revealed that SVM and Random Forest models achieved the highest accuracy and reliability in classifying texts, demonstrating strong performance across multiple evaluation metrics. In contrast, deep learning models like BERT and LSTM were less effective under the given conditions, suggesting a need for more extensive datasets and computational resources to leverage their full potential. These results highlight the strengths and limitations of different approaches to text classification, providing a foundation for future research to enhance AI detection in diverse applications.
dc.format.extent	46
dc.identifier.uri	https://hdl.handle.net/20.500.14154/73548
dc.language.iso	en_US
dc.publisher	University of East Anglia
dc.subject	Artificial intelligence
dc.subject	AI
dc.subject	Data Mining
dc.subject	Text Classification
dc.subject	Human Text
dc.subject	AI Text
dc.title	AI GENERATED TEXT VS. HUMAN GENERATED TEXT
dc.type	Thesis
sdl.degree.department	Computer Science
sdl.degree.discipline	Data Science
sdl.degree.grantor	University of East Anglia
sdl.degree.name	Master of Science

Files

Original bundle

Now showing 1 - 1 of 1

Name:: SACM-Dissertation.pdf
Size:: 5.51 MB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.61 KB
Format:: Item-specific license agreed to upon submission
Description:

Download

Collections

SACM - United Kingdom