A Generative Adversarial Network-Based Approach for Imbalanced Network Traffic Classification Datasets

dc.contributor.advisorFirdaus, Sahran
dc.contributor.authorAlharbi, Amjaad Ayed
dc.date.accessioned2026-03-29T05:47:28Z
dc.date.issued2026
dc.description.abstractClass imbalance is a common issue in datasets, and it remains a critical challenge in network traffic classification (NTC), particularly as the number of applications and protocols continues to expand. This imbalance leads to biased machine learning models that favor majority classes while neglecting critical minority classes, reducing overall classification accuracy. Traditional approaches such as oversampling and undersampling struggle to generate diverse data while preserving essential traffic patterns. Consequently, there is a pressing need for improved solutions that enhance class representation without compromising the integrity of the original dataset. To address this issue, this study introduces Equal-GAN, a novel Generative Adversarial Network (GAN)-based approach designed to mitigate class imbalance in the Unicauca NTC Dataset, designing a tailored GAN architecture, and employing synthetic data generation to balance class distribution while retaining essential traffic characteristics. The study focuses on finding the limitations with current imbalance methods in NTC, creating the Equal-GAN algorithm to generate synthetic data, and testing its effectiveness with a classifier. The methodology involves pre-processing the dataset, designing a tailored GAN architecture, and employing synthetic data generation to balance class distribution while retaining essential traffic characteristics. The effectiveness of the Equal-GAN approach is assessed using statistical metrics and classification performance comparisons. Experimental results demonstrate that Equal-GAN significantly improves class representation and classification accuracy, using Random Forest Classifier, Equal-GAN approach attained an F1-score of 0.99 and an accuracy of 99.55% enhancing classification performance and outperforming traditional methods. These findings underscore the potential of GAN-based solutions to enhance NTC, paving the way for more reliable and fair network traffic analysis.
dc.format.extent82
dc.identifier.urihttps://hdl.handle.net/20.500.14154/78508
dc.language.isoen
dc.publisherSaudi Digital Library
dc.subjectClass Imbalance
dc.subjectNetwork Traffic Classification
dc.subjectGenerative Adversarial Network
dc.subjectSynthetic Data Generation
dc.titleA Generative Adversarial Network-Based Approach for Imbalanced Network Traffic Classification Datasets
dc.typeThesis
sdl.degree.departmentComputer Networks
sdl.degree.disciplineComputer Science
sdl.degree.grantorUniversity of Malaya
sdl.degree.nameMaster of Computer Science

Files

Original bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
SACM-Dissertation.pdf
Size:
2.18 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.61 KB
Format:
Item-specific license agreed to upon submission
Description:

Collections

Copyright owned by the Saudi Digital Library (SDL) © 2026