Integrating federated learning and differential privacy in dental X-ray for caries segmentation

No Thumbnail Available

Date

2024-09

Journal Title

Journal ISSN

Volume Title

Publisher

Lancaster university

Abstract

.

Description

This dissertation investigates Federated Learning (FL) as a collaborative AI training technique designed to preserve privacy, with a specific focus on its application in medical imaging, particularly for dental X-ray segmentation in caries detection. Introduced by Google in 2016, FL revolutionized machine learning by enabling model training across multiple devices without the need to transfer raw data, addressing critical privacy concerns in sensitive sectors like healthcare. This study aims to enhance the security and effectiveness of FL by incorporating differential privacy techniques to protect patient data while preserving model accuracy. The research systematically evaluates FL’s performance with varying numbers of clients, examines the impact of unbalanced data distributions, and applies differential privacy using different noise levels to improve data security. The experiments demonstrate that FL can effectively safeguard sensitive information, offering a decentralized alternative to traditional centralized machine learning methods, which carry significant risks due to data centralization. While FL addresses key privacy concerns, the study acknowledges that security issues remain, and the development of sophisticated safeguards against potential attacks is outside the scope of this research. The findings contribute to the understanding of FL’s potential in healthcare, highlighting its ability to balance privacy preservation with collaborative AI training. This research underscores FL’s viability as a privacy-preserving technology, providing valuable insights into optimizing AI models in medical applications without compromising data security. A key outcome of the experiment is that FL with a higher number of clients performs more effectively than models with fewer clients. Additionally, random data distribution outperformed other methods in most of the evaluated metrics.

Keywords

Decentralized Machine Learning, Privacy-preserving AI, Federated Learning (FL), Differential Privacy, Medical Imaging

Citation

Endorsement

Review

Supplemented By

Referenced By

Copyright owned by the Saudi Digital Library (SDL) © 2025