Exploring Advanced Deep Learning, foundation and Hybrid models for Medical Image Classification

dc.contributor.advisorCarneiro, Gustavo
dc.contributor.authorKutbi, Jad
dc.date.accessioned2024-12-10T06:04:19Z
dc.date.issued2024-09
dc.description.abstractThis dissertation explores the use of advanced deep learning architectures, foundation models, and hybrid models for medical image classification. Medical imaging plays a critical role in the healthcare industry, and deep learning models have demonstrated significant potential in improving the accuracy and efficiency of diagnostic processes. This work focuses on three datasets: RetinaMNIST, BreastMNIST, and FractureMNIST3D from the MedMNISTv2 datasets, each representing different imaging modalities and classification tasks. The significance of this work lies in its comprehensive evaluation of state-of-the-art models, including ResNet, Vision Transformers (ViT), ConvNeXt, and Swin Transformers, and their effectiveness in handling complex medical images. The primary contributions of this research are the implementation and benchmarking of modern architectures on these datasets, as well as the investigation of hyperparameter optimization using Optuna. Pretrained models and hybrid architectures such as CNN-ViT, SwinConvNeXt and CNN-LSTM were explored to enhance performance. Key results demonstrate that models like ConvNeXt-tiny (pretrained) and CLIP achieved high accuracy and AUC scores, particularly in BreastMNIST and RetinaMNIST datasets, setting new performance benchmarks. The combination of Swin and ConvNeXt using feature fusion was shown to improve model robustness, especially when handling multi-class and 3D data.
dc.format.extent48
dc.identifier.urihttps://hdl.handle.net/20.500.14154/74073
dc.language.isoen
dc.publisherUniversity of Surrey
dc.subjectMedical image classification
dc.subjectViT
dc.subjectCLIP
dc.subjectFine-tuning
dc.subjectCNN
dc.subjectLSTM
dc.subjectConvNext
dc.subjectSwin
dc.subjectOptuna
dc.subjectRetinaMNIST
dc.subjectBreastMNIST
dc.subjectFractureMNIST3D
dc.subjectMedMNIST
dc.titleExploring Advanced Deep Learning, foundation and Hybrid models for Medical Image Classification
dc.typeThesis
sdl.degree.departmentSchool of Computer Science and Electrical and Electronic Engineering
sdl.degree.disciplineMedical Image Analysis
sdl.degree.grantorUniversity of Surrey
sdl.degree.nameMaster of Science Computer Vision, Robotics and Machine Learning

Files

Original bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
SACM-Dissertation.pdf
Size:
949.75 KB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.61 KB
Format:
Item-specific license agreed to upon submission
Description:

Copyright owned by the Saudi Digital Library (SDL) © 2024