SACM - United Kingdom

Permanent URI for this collectionhttps://drepo.sdl.edu.sa/handle/20.500.14154/9667

Browse

Search Results

Now showing 1 - 10 of 43
  • ItemRestricted
    Mitigating Distribution Shift and Label Noise in Deep Neural Network Based Network Intrusion Detection Systems
    (Saudi Digital Library, 2026) Alotaibi, Fahad Mastor T; Maffeis, Sergio
    Deep Neural Network (DNN)-based Network Intrusion Detection Systems (NIDS) have shown strong performance on offline benchmarks under controlled conditions. However, real-world deployment remains challenging. This thesis focuses on two key challenges: (i) distribution shift, in which benign traffic evolves and attackers continuously devise new strategies; and (ii) label noise, arising from imperfect automated labelling of large volumes of network traffic. To address these challenges, the thesis first provides a critical literature review and develops a structured taxonomy of techniques for handling distribution shift in NIDS and label noise in both NIDS and malware datasets. Building on these insights, it introduces three frameworks. Mateen adapts one-class anomaly detection to evolving benign traffic by combining selective labelling with an ensemble-based mechanism that identifies and responds to shifts with minimal manual effort. Rasd extends multi-class NIDS to detect and integrate newly emerging attack classes, substantially reducing labelling costs through strategic selection of a small, informative, and diverse subset. SLB mitigates label noise by partitioning the dataset into clean and noisy sets and iteratively refining both the model and the labels. Each framework is evaluated extensively across multiple datasets and compared with state-of-the-art baselines. Mateen improves the anomaly-detection F1 score by 4.13% under a light-shift scenario (CICIDS2017) and by 72.6% under an extreme-shift scenario (Kitsune). Rasd increases the novel class detection F1 score by 6.83% on CICIDS2017 and by 19.21% on CSE-CIC-IDS2018. SLB reduces the noise rate in CICIDS2017 (with 30% injected random noise) to below 1.2% and outperforms the vanilla baseline by 11.83% in macro F1. This thesis serves as a reference for researchers and practitioners in cyber security and artificial intelligence. Beyond its literature review and taxonomy, it contributes three frameworks that collectively enhance the robustness of DNN-based NIDS, achieving state-of-the-art results on the evaluated benchmarks.
    10 0
  • ItemRestricted
    Predicting Carbon Credit Prices Using Advanced Machine Learning Techniques
    (Saudi Digital Library, 2026) Rayan, Najdi; Wang, Hai
    Accurate forecasting of carbon credit prices supports risk management, investment decisions, and policy assessment in the context of climate action. EU ETS carbon prices exhibit volatility, non-linearity, and non-stationarity, which reduces the effectiveness of traditional forecasting models. This dissertation proposes and evaluates a three-stage hybrid machine learning model for one-day-ahead forecasting of EU Emissions Trading System (EU ETS) carbon prices. The architecture follows a divide-and-conquer strategy. First, Wavelet Packet Decomposition (WPD) decomposes the carbon price signal into multiple frequency components. Second, a Gated Recurrent Unit (GRU) network models temporal dependencies and forecasts the trend component. Third, an Extreme Gradient Boosting (XGBoost) model predicts and corrects the GRU residual errors using wavelet-derived detail components as input features. The model was trained and tested on a dataset covering January 2018 to December 2024. The dataset includes EU ETS carbon prices, Brent crude oil prices, and electricity prices, while the forecasting model is univariate and uses the carbon price series only. On an unseen test set of 510 days, the model achieved a Mean Absolute Percentage Error (MAPE) of 1.66%, a Root Mean Squared Error (RMSE) of 4.86 EUR/ton, and a Mean Absolute Error (MAE) of 4.41 EUR/ton. The results indicate that combining signal decomposition, deep learning, and gradient boosting provides stable forecasting performance for EU ETS carbon prices under realistic evaluation conditions.
    13 0
  • ItemRestricted
    Analysing Large-Scale Attacks in IoT Environments using ML/DL
    (Saudi Digital Library, 2025) Bokhari, Mohammed Ibrahim K; Neetesh, Sexena
    The fine-grained classification of malicious network traffic presents a significant and persistent challenge in cybersecurity, primarily due to the extreme class imbalance inherent in real-world network data. Conventional machine learning approaches, which apply a single, unitary model to the problem, have demonstrated limited success, often failing to effectively identify rare but critical minority attack classes. This dissertation argues that the conventional model paradigm is fundamentally flawed for this problem space and proposes a hierarchical, multi-stage classification framework as a more robust alternative. This research presents a comprehensive, multi-faceted investigation into this problem, using the 34-class CICIoT2023 dataset as a benchmark. The study was conducted across four distinct experimental paths, comparing two ensemble methods (XGBoost and Random Forest) and two class-handling strategies (a "Grouped" approach that manually merges similar classes and an "Ungrouped" approach that tackles all 34 classes directly). Within this structure, we designed and implemented a 4-tier hierarchical framework that employs a "divide and conquer" strategy, using an initial classifier to handle majority traffic and a class-level routing mechanism to delegate ambiguous samples to specialised recovery tiers. An adaptive resampling strategy was deployed within these tiers, concentrating aggressive SMOTE only where required. The empirical results provide a holistic validation of the proposed architecture. The optimal configuration—an Ungrouped, XGBoost-led hierarchical framework—achieved a final accuracy of 0.9228 and Macro-F1 score of 0.7948, a substantial improvement over all other experimental paths and conventional baselines. More significantly, this approach demonstrated a more than 800% increase in the F1-score for some of the under-represented minority classes. The analysis also revealed a key architectural principle: classifier performance is role-dependent, with different ensemble methods excelling in different roles within the hierarchy, highlighting the importance of managing the bias-variance trade-off at a systemic level. Finally, this work provides a rigorous, data-centric analysis that distinguishes between model limitations and the inherent limitations of the dataset, identifying a "dataset-induced ceiling" on performance for 5 of the 34 classes. The primary contribution of this dissertation is, therefore, a methodologically robust and architecturally novel framework, validated through a comprehensive, multi-path experimental design. The principles of hierarchical decomposition and adaptive resource allocation are domain-agnostic and offer a promising direction for future research into extreme imbalance problems.
    10 0
  • ItemRestricted
    Explainable AI for Biometric Brain Modeling: 3D Anatomical Analysis Across Gender and Aging
    (Saudi Digital Library, 2025) Alghamdi, Ryan; Jiang, Richard
    This thesis investigates the use of deep learning models for predicting sex and age from structural MRI data, with a focus on interpretability. Three architectures were trained: a ResNet, a 3D CNN, and an Ensemble model. The Automated Anatomical Labeling (AAL) atlas was used to parcellate the brain into 116 regions, enabling a region based occlusion framework. Two complementary approaches were applied: inverse occlusion, where predictions rely on a single active region, and normal occlusion, where one region is masked while the remainder of the brain is preserved. All models achieved high training accuracies (95–99%), but their performance dropped notably on held out 70/15/15 splits, reflecting overfitting and emphasizing the importance of split evaluation. Inverse occlusion consistently identified plausible neuroanatomical markers, including the cerebellar Crus I, calcarine cortex, precentral and postcentral gyri, precuneus, and inferior temporal lobe. In contrast, normal occlusion produced flat or inconsistent results, suggesting reliance on global artifacts or scanner fingerprints rather than region specific features. These findings show that region only occlusion provides more reliable insights into localized brain structure differences than conventional occlusion. Key limitations include dependence on preprocessing pipelines, restricted dataset size, computational demands, and reliance on atlas based parcellation. Despite these, the framework demonstrates a reproducible method for evaluating the regional basis of deep learning predictions in neuroimaging. Future work should expand dataset coverage, refine preprocessing, and extend occlusion analysis to combinations of regions to capture network level effects. This work contributes a regional explainability framework to improve the interpretability and reliability of deep learning in brain imaging.
    31 0
  • ItemRestricted
    Advances in Artificial Intelligence for Energy Forecasting and Performance Management in Buildings
    (Saudi Digital Library, 2026) Alkhatani, Nasser; Petri, Ioan
    Accurate energy forecasting is essential for intelligent building management, supporting operational optimisation, strategic planning, and demand-side flexibility. However, existing forecasting methods often struggle to remain accurate across multiple time horizons and to generalise across different building types with limited data. This thesis addresses these challenges by developing a modular modelling framework that advances both multi-horizon forecasting and cross-building adaptability. The first contribution is a hybrid forecasting model (SVR → XGBoost → LSTM) designed to deliver stable prediction performance across four horizons: 24 hours, one week, one month, and one year. The hybrid design leverages the complementary strengths of its components SVR for noise reduction, XGBoost for nonlinear feature learning, and LSTM for long-range temporal modelling resulting in improved robustness and generalisation compared with single-model approaches. The second contribution introduces a deep hybrid model (CNN → GRU → LSTM) within a transfer learning framework. Pretrained on multi-building datasets and fine-tuned using limited data from new buildings, this approach enhances cross-domain adaptability while reducing training time and data requirements, demonstrating the practical value of transfer learning for scalable energy forecasting. A third contribution integrates statistical peak detection to support the identification of high- demand events, enabling forecasting outputs to inform grid-interactive building operations. Rigorous evaluation including multi-metric assessment, residual diagnostics, ablation testing, and statistical significance analysis confirms the reliability and robustness of the proposed models. Overall, the thesis provides methodological and empirical advances that strengthen data-driven building energy management. The results show that hybridisation and transfer learning, when carefully designed, can enhance accuracy, stability, and generalisation, offering a scalable pathway toward more efficient and sustainable smart building operations.
    16 0
  • ItemRestricted
    GAN-Enhanced Super-Resolution Pipeline for Robust Word Recognition in Low-Quality Non-English Handwriting.
    (Saudi Digital Library, 2025) Shahbi, Bilqis; Xia, Panqiu
    Executive summary The dissertation tackles a critical issue where current optical character recognition (OCR) technologies fall short: correctly identifying handwritten non-English scripts in poor quality and deteriorated settings. While OCR technologies have matured for printed English and other Latin-based languages, scripts such as Arabic, Devanagari, and Telugu remain underrepresented due to structural complexities, cursive connections, diacritics, ligatures, and the limited availability of annotated datasets. These challenges are amplified by real-world factors such as low-resolution scans, noisy archival documents, and mobile phone captures, where fine details necessary for recognition are lost. The study proposes a two-stage deep learning pipeline that integrates super-resolution with recognition, explicitly designed to address these shortcomings. The first stage of the pipeline utilises Real-ESRGAN, a generative adversarial network specifically optimised for real-world image degradation. Unlike earlier models such as SRCNN, VDSR, and ESRGAN, which often prioritize aesthetics or hallucinate features, Real-ESRGAN reconstructs high-resolution images with sharper strokes, preserved ligatures, and clear diacritics. Its Residual-in-Residual Dense Block (RRDB) architecture combines residual learning and dense connections, enabling robust recovery of fine-grained textual details. By preserving structural fidelity rather than merely visual appeal, Real-ESRGAN ensures that enhanced images retain the critical features necessary for recognition. The second stage utilises a Convolutional Recurrent Neural Network (CRNN) with Connectionist Temporal Classification (CTC) loss function. The CRNN combines convolutional layers for feature extraction, bidirectional LSTM layers for capturing sequential dependencies, and CTC decoding for alignment-free sequence prediction. This integration eliminates the need for explicit segmentation, a complicated task in cursive or densely written scripts. Together, the two stages form a cohesive system in which image enhancement directly supports recognition accuracy. To ensure robustness, the research incorporated extensive dataset preparation and preprocessing. Handwritten datasets for Arabic, Devanagari, and Telugu scripts were selected to reflect structural diversity. Preprocessing included resizing, artificial noise simulation (Gaussian noise, blur, and compression artefacts), and augmentation (rotations, elastic distortions, and brightness adjustments). These techniques increased dataset variability and improved the model's ability to generalize to real-world handwriting scenarios. Evaluation was conducted at both image and recognition levels. Image quality was assessed using the Peak Signal-to-Noise Ratio (PSNR) and the Structural Similarity Index (SSIM). At the same time, recognition performance was measured using the Character Error Rate (CER) and the Word Error Rate (WER). This dual evaluation ensured that improvements in image clarity translated into tangible recognition gains. The results confirm the effectiveness of the proposed pipeline. Real-ESRGAN showed improvements compared to SRCNN, VDSR, and ESRGAN, with higher PSNR and SSIM values across scripts. These gains reflect superior structural fidelity, particularly in preserving Arabic cursive flows, Devanagari's horizontal headstrokes, and Telugu's stacked ligatures. Recognition accuracy also improved compared to baseline low-resolution inputs. Script-specific analysis showed more precise word boundaries in Arabic, sharper conjuncts and diacritics in Devanagari, and more distinct glyph separations in Telugu. When benchmarked against traditional OCR systems, such as Tesseract, the pipeline demonstrated clearer recognition outcomes, indicating the critical role of task-specific super-resolution in OCR, proving that enhancing input quality directly strengthens recognition performance. The dissertation makes contributions across methodological, empirical, theoretical, and practical domains. Methodologically, it demonstrates the value of integrating enhancement and recognition stages in a fine-tuned pipeline, ensuring that improvements in image clarity yield measurable gains in recognition. Empirically, it validates the effectiveness of Real-ESRGAN for handwritten text, showing consistent improvements across structurally diverse scripts. Theoretically, it advances the understanding of script-sensitive OCR, emphasizing the preservation of structural features such as diacritics and ligatures. Practically, the work highlights applications in archival preservation, e-governance, and education. By enabling more accurate digitisation of handwritten records, the system supports inclusive access to information and the preservation of linguistic heritage. The study acknowledges several limitations. The scarcity of diverse annotated datasets constrains the model's generalizability to other scripts such as Amharic or Khmer. The computational expense of training Real-ESRGAN limits its feasibility in low-resource settings. Occasional GAN artefacts, where spurious strokes or distortions appear, pose risks in sensitive applications such as legal documents. Moreover, the pipeline has not been extensively tested on mixed-script texts, common in multilingual societies. These limitations suggest avenues for future work, including developing larger and more diverse datasets, designing lightweight models for real-time and mobile deployment, integrating script identification for mixed-language documents, and incorporating explainable AI for greater transparency in recognition decisions. In conclusion, the dissertation demonstrates that GAN-enhanced super-resolution is not merely a cosmetic tool but an essential step toward robust OCR in non-English handwritten texts. By aligning image enhancement with recognition objectives, the proposed pipeline reduces error rates and generalizes across diverse scripts. Its implications extend beyond technical achievement to cultural preservation, digital inclusion, and the democratization of access to information. At the same time, the identified limitations provide a roadmap for future research, ensuring that multilingual OCR evolves into a truly global and inclusive technology.
    22 0
  • ItemRestricted
    CADM: Creative Accounting Detection Model in Saudi-Listed Companies
    (Saudi Digital Library, 2025) Bineid, Maysoon Mohamed; Beloff, Natalia
    In business, financial statements are the primary source of information for investors and other stakeholders. Despite extensive regulatory efforts, the quality of financial reporting in Saudi Arabia still requires improvement, as prior studies have documented evidence of creative accounting. This practice occurs when managers manipulate accounting figures within the boundaries of the International Financial Reporting Standards to present a more favourable image of the company. Although various fraud detection methods exist, identifying manipulations that are legal yet misleading remains a significant challenge. This research introduces the Creative Accounting Detection Model (CADM), a deep learning (DL)-based approach that employs Long Short-Term Memory (LSTM) networks to identify Saudi-listed companies engaging in creative accounting. Two versions of the model were developed: CADM1, trained on a simulated dataset based on established accounting measures from the literature, and CADM2, trained on a dataset tailored to reflect financial patterns observed in the Saudi market. Both datasets incorporated financial and non-financial features derived from a preliminary survey of Saudi business experts. The models achieved training accuracies of 100% (CADM1) and 95% (CADM2). Both models were then tested on real-world data from the Saudi energy sector (2019–2023). CADM1 classified one company as engaging in creative accounting, whereas CADM2 classified all companies as non-creative but demonstrated greater stability in prediction confidence. To interpret these results, a follow-up qualitative study involving expert interviews confirmed CADM as a promising supplementary tool for auditors, enhancing analytical and oversight capabilities. These findings highlight CADM’s potential to support regulatory oversight, strengthen auditing procedures, and improve investor trust in the transparency of financial statements.
    17 0
  • ItemRestricted
    The Additional Regulatory Challenges Posed by AI In Financial Trading
    (Saudi Digital Library, 2025) Almutairi, Nasser; Alessio, Azzutti
    Algorithmic trading has shifted from rule-based speed to adaptive autonomy, with deep learning and reinforcement learning agents that learn, re-parameterize, and redeploy in near real time, amplifying opacity, correlated behaviours, and flash-crash dynamics. Against this backdrop, the dissertation asks whether existing EU and US legal frameworks can keep pace with new generations of AI trading systems. It adopts a doctrinal and comparative method, reading MiFID II and MAR, the EU AI Act, SEC and CFTC regimes, and global soft law (IOSCO, NIST) through an engineering lens of AI lifecycles and value chains to test functional adequacy. Chapter 1 maps the evolution from deterministic code to self-optimizing agents and locates the shrinking space for real-time human oversight. Chapter 2 reframes technical attributes as risk vectors, such as herding, feedback loops, and brittle liquidity, and illustrates enforcement and stability implications. Chapter 3 exposes human-centric assumptions (intent, explainability, “kill switches”) embedded in current rules and the gaps they create for attribution, auditing, and cross-border supervision. Chapter 4 proposes a hybrid, lifecycle-based model of oversight that combines value-chain accountability, tiered AI-agent licensing, mandatory pre-deployment verification, explainability XAI requirements, cryptographically sealed audit trails, human-in-the-loop controls, continuous monitoring, and sandboxed co-regulation. The contribution is threefold: (1) a technology-aware risk typology linking engineering realities to market integrity outcomes; (2) a comparative map of EU and US regimes that surfaces avenues for regulatory arbitrage; and (3) a practicable governance toolkit that restores traceable accountability without stifling beneficial innovation. Overall, the thesis argues for moving from incremental, disclosure-centric tweaks to proactive, lifecycle governance that embeds accountability at design, deployment, and post-trade, aligning next-generation trading technology with the enduring goals of fair, orderly, and resilient markets.
    14 0
  • ItemRestricted
    Semi-Supervised Approach For Automatic Head Gesture Classification
    (Saudi Digital Library, 2025) Alsharif, Wejdan; Hiroshi, Shimodaira
    This study utilizes a semi-supervised method, particularly self-training, for automatic head gesture recognition using motion caption data. It explores and compares fully supervised deep learning models and self-training pipelines in terms of their perfor- mance and training approaches. The proposed approach achieved an accuracy score of 52% and a macro F1 score of 44% in the cross validation. Results have shown that leveraging self-training as part of the learning process contributes to improved model performance, due to generating pseudo-labeled data that effectively supplements the original labeled dataset, thereby enabling the model to learn from a larger and more diverse set of training examples.
    7 0
  • ItemRestricted
    ENHANCING DATAREPRESENTATION IN DISTRIBUTED MACHINE LEARNING
    (Saudi Digital Library, 2025) Aladwani, Tahani Abed; CHRISTOSANAGNOSTOPOULOS
    Distributed computing devices, ranging from smartphones to edge micro-servers—collectively referred to as clients—are capable of gathering and storing diverse types of data, such as images and voice recordings. This wide array of data sources has the potential to significantly enhance the accuracy and robustness of Deep Learning (DL) models across a variety of tasks. However, this data is intrinsically heterogeneous, due to the differences in users’ preferences, lifestyles, locations, and other factors. Consequently, it necessitates comprehensive preprocessing (e.g., labeling, filtering, relevance assessment, balancing, etc.) to ensure its suitability for the development of effective and reliable models. Therefore, this thesis explores the feasibility of conducting predictive analytics and model inference on edge computing (EC) systems when access to data is limited, and on clients’ devices through federated learning (FL) when direct access to data is entirely restricted. The first part of this thesis focuses on reducing the data transmission rate between clients and EC servers by employing techniques such as data and task caching, identifying data overlaps, and evaluating task popularity. While this strategy can significantly minimize data offloading to the lowest possible level, it does not entirely eliminate dependence on third-party entities. The second part of this thesis eliminates the dependency on third-party entities by implementing FL, where direct access to raw data is not possible. In this context, node and data selection are guided by predictions and model performance. The objective is to identify the most suitable nodes and relevant data for training by clustering nodes based on data characteristics and analyzing the overlap between query boundaries and cluster boundaries. The third part of this thesis introduces a mechanism designed to support classification tasks, such as image classification. These tasks present significant challenges when building models on distributed data, particularly due to issues like label shifting or missing labels across clients. To address these challenges, the proposed method mitigates the impact of imbalances across clients by employing multiple cluster-based meta-models, each tailored to specific label distributions. The fourth part of this thesis introduces a two-phase federated self-learning framework, termed 2PFL, which addresses the challenges of extreme data scarcity and skewness when training classifiers over distributed labeled and unlabeled data. 2PFL demonstrates the capability to achieve high-performance models, even when trained with only 10% to 20% labeled data compared to the available unlabeled data. The conclusion chapter underscores the importance of adaptable learning mechanisms that can respond to the continuous changes in clients’ data volume, requirements, formats, and protection regulations. By incorporating the EC layer, we can alleviate concerns related to data privacy, reduce the volume of data needing offloading, expedite task execution, and facilitate the training of complex models. For scenarios demanding stricter privacy-preserving measures, FL offers a viable solution, enabling multiple clients to collaboratively train models while adhering to user privacy protection, data security, and government regulations. However, due to the indirect access to data inherent in FL, several challenges must be addressed to ensure the development of high-performance models. These challenges include imbalanced data distribution across clients, partially labeled data, and fully unlabeled data, all of which are explored and demonstrated through experimental evaluations.
    7 0

Copyright owned by the Saudi Digital Library (SDL) © 2026