Multi-Class Part Parsing based on Deep Learning

dc.contributor.advisorWu, Jing
dc.contributor.advisorLai, Yu-Kun
dc.contributor.advisorJi, Ze
dc.contributor.authorAlsudays, Njuod
dc.date.accessioned2025-03-11T08:27:58Z
dc.date.issued2024
dc.description.abstractMulti-class part parsing is a dense prediction task that seeks to simultaneously detect multiple objects and the semantic parts within these objects in the scene. This problem is important in providing detailed object understanding but is challenging due to the existence of both class-level and part-level ambiguities. This thesis investigates recent advancements in deep learning to tackle the challenges in multi-class part parsing. First, the AFPSNet network is proposed, which integrates scaled attention and feature fusion to tackle part-level ambiguity and thereby improving parts prediction accuracy. The integration of attention enhances feature representations by focusing on important features, while the feature fusion improves the fusion operation for different scales of features. An object-to-part training strategy is also used to address class-level ambiguity, improving the localisation of parts by exploiting prior knowledge of objects. Building on this foundation, the GRPSNet framework is introduced to further enhance the performance of multi-class part parsing. This framework integrates graph reasoning to capture relationships between parts, thereby improving part segmentation. These captured relationships help to enhance the recognition and localisation of parts. Moreover, the relationships of part boundaries are exploited to further enhance the accuracy of part segmentation. To further refine part segmentation, Multi-Class Boundaries integrated into the AFPSNet network. This integration aims to accurately identify and focus on the spatial boundaries of part classes, thereby enhancing the overall segmentation quality. Experimental results demonstrate the effectiveness of the proposed networks. Various evaluations, including ablation studies and comparisons with existing methods, were conducted on the widely used PASCAL-Part benchmark dataset and the large-scale ADE20K-Part benchmark dataset. These experiments validate the research hypotheses, showing notable improvements in part localisation and segmentation accuracy.
dc.format.extent154
dc.identifier.urihttps://hdl.handle.net/20.500.14154/75012
dc.language.isoen
dc.publisherCardiff University
dc.subjectPart parsing
dc.subjectSemantic segmentation
dc.subjectScaled attention
dc.subjectFeature fusion
dc.subjectGraph reasoning
dc.subjectMulti-class boundaries
dc.subjectDeep learning
dc.titleMulti-Class Part Parsing based on Deep Learning
dc.typeThesis
sdl.degree.departmentSchool of Computer Science & Informatics
sdl.degree.disciplineComputer Vision
sdl.degree.grantorCardiff University
sdl.degree.nameDoctor of Philosophy

Files

Original bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
SACM-Dissertation .pdf
Size:
55.74 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.61 KB
Format:
Item-specific license agreed to upon submission
Description:

Copyright owned by the Saudi Digital Library (SDL) © 2025