AI GENERATED TEXT VS. HUMAN GENERATED TEXT

dc.contributor.advisorMisri, Kazhan
dc.contributor.authorHadi, Nedaa
dc.date.accessioned2024-11-11T07:07:52Z
dc.date.issued2024-09
dc.description.abstractThe ability to distinguish between AI-generated and human-generated texts is becom- ing increasingly critical as AI technologies advance. This dissertation explores the development and evaluation of various machine learning models to accurately classify text as either AI-generated or human-generated. The research aims to identify the most effective classification techniques and preprocessing methods to enhance model performance and generalization across different text datasets. A range of machine learning and deep learning models, including Support Vec- tor Machine (SVM), Random Forest, Logistic Regression, Decision Tree, BERT, and LSTM, were employed to evaluate their effectiveness in distinguishing between the two types of texts. The study utilized a balanced and representative dataset through data sampling and augmentation techniques. Key preprocessing steps were implemented to refine the input data, and hyperparameter tuning was conducted to optimize model performance. The generalization capabilities of the models were further tested on additional datasets with varying text characteristics. The findings revealed that SVM and Random Forest models achieved the highest accuracy and reliability in classifying texts, demonstrating strong performance across multiple evaluation metrics. In contrast, deep learning models like BERT and LSTM were less effective under the given conditions, suggesting a need for more extensive datasets and computational resources to leverage their full potential. These results highlight the strengths and limitations of different approaches to text classification, providing a foundation for future research to enhance AI detection in diverse applications.
dc.format.extent46
dc.identifier.urihttps://hdl.handle.net/20.500.14154/73548
dc.language.isoen_US
dc.publisherUniversity of East Anglia
dc.subjectArtificial intelligence
dc.subjectAI
dc.subjectData Mining
dc.subjectText Classification
dc.subjectHuman Text
dc.subjectAI Text
dc.titleAI GENERATED TEXT VS. HUMAN GENERATED TEXT
dc.typeThesis
sdl.degree.departmentComputer Science
sdl.degree.disciplineData Science
sdl.degree.grantorUniversity of East Anglia
sdl.degree.nameMaster of Science

Files

Original bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
SACM-Dissertation.pdf
Size:
5.51 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.61 KB
Format:
Item-specific license agreed to upon submission
Description:

Copyright owned by the Saudi Digital Library (SDL) © 2024