Exploring Advanced Deep Learning, foundation and Hybrid models for Medical Image Classification
dc.contributor.advisor | Carneiro, Gustavo | |
dc.contributor.author | Kutbi, Jad | |
dc.date.accessioned | 2024-12-10T06:04:19Z | |
dc.date.issued | 2024-09 | |
dc.description.abstract | This dissertation explores the use of advanced deep learning architectures, foundation models, and hybrid models for medical image classification. Medical imaging plays a critical role in the healthcare industry, and deep learning models have demonstrated significant potential in improving the accuracy and efficiency of diagnostic processes. This work focuses on three datasets: RetinaMNIST, BreastMNIST, and FractureMNIST3D from the MedMNISTv2 datasets, each representing different imaging modalities and classification tasks. The significance of this work lies in its comprehensive evaluation of state-of-the-art models, including ResNet, Vision Transformers (ViT), ConvNeXt, and Swin Transformers, and their effectiveness in handling complex medical images. The primary contributions of this research are the implementation and benchmarking of modern architectures on these datasets, as well as the investigation of hyperparameter optimization using Optuna. Pretrained models and hybrid architectures such as CNN-ViT, SwinConvNeXt and CNN-LSTM were explored to enhance performance. Key results demonstrate that models like ConvNeXt-tiny (pretrained) and CLIP achieved high accuracy and AUC scores, particularly in BreastMNIST and RetinaMNIST datasets, setting new performance benchmarks. The combination of Swin and ConvNeXt using feature fusion was shown to improve model robustness, especially when handling multi-class and 3D data. | |
dc.format.extent | 48 | |
dc.identifier.uri | https://hdl.handle.net/20.500.14154/74073 | |
dc.language.iso | en | |
dc.publisher | University of Surrey | |
dc.subject | Medical image classification | |
dc.subject | ViT | |
dc.subject | CLIP | |
dc.subject | Fine-tuning | |
dc.subject | CNN | |
dc.subject | LSTM | |
dc.subject | ConvNext | |
dc.subject | Swin | |
dc.subject | Optuna | |
dc.subject | RetinaMNIST | |
dc.subject | BreastMNIST | |
dc.subject | FractureMNIST3D | |
dc.subject | MedMNIST | |
dc.title | Exploring Advanced Deep Learning, foundation and Hybrid models for Medical Image Classification | |
dc.type | Thesis | |
sdl.degree.department | School of Computer Science and Electrical and Electronic Engineering | |
sdl.degree.discipline | Medical Image Analysis | |
sdl.degree.grantor | University of Surrey | |
sdl.degree.name | Master of Science Computer Vision, Robotics and Machine Learning |