Predicting Osteoarthritis in Older Adults Using Literature-Based, Non-Invasive Risk Factors: A Cross-Sectional Analysis of ELSA Wave 9

dc.contributor.advisorYang, Hui
dc.contributor.authorFnais, Tesneem
dc.date.accessioned2025-10-28T15:59:12Z
dc.date.issued2025
dc.description.abstractOsteoarthritis (OA) is a prevalent joint disorder in older adults that is often diagnosed at a later stage, as clinical assessments typically rely on imaging and laboratory tests that are not readily accessible in all settings. This study aimed to develop and evaluate machine learning models that predict OA using non-invasive, self-reported features from Wave 9 of the English Longitudinal Study of Ageing (ELSA). A total of 4,723 participants aged 60 and above were included. An initial set of 32 features was selected based on existing literature and refined through a structured feature selection pipeline, resulting in a final set of 25 features, including joint pain and mobility limitations. Four supervised models -Logistic Regression, Random Forest, XGBoost, and CatBoost- were trained using a stratified train-test split and resampling to address class imbalance. The upsampled logistic regression model achieved the highest sensitivity (0.769) and strong overall performance (AUC = 0.755), while CatBoost showed the highest specificity (0.759) and an AUC of 0.747. A reduced logistic regression model using only the top 15 features retained similar accuracy and AUC. These findings demonstrate that OA can be predicted without imaging or biomarkers. The resulting models, particularly the logistic regression model, offer promise as cost-effective screening tools to support early identification and guide decisions about further clinical assessment. making them well-suited for primary care and digital health settings, especially where resources are limited.
dc.format.extent32
dc.identifier.urihttps://hdl.handle.net/20.500.14154/76745
dc.language.isoen
dc.publisherSaudi Digital Library
dc.subjectosteoarthritis
dc.subjectmachine learning
dc.subjectpredictive modelling
dc.subjectOA
dc.subjectdata
dc.subjectaging population
dc.titlePredicting Osteoarthritis in Older Adults Using Literature-Based, Non-Invasive Risk Factors: A Cross-Sectional Analysis of ELSA Wave 9
dc.typeThesis
sdl.degree.departmentHealth Data Science
sdl.degree.disciplineHealth Data Science
sdl.degree.grantorUniversity of Birmingham
sdl.degree.nameMSc Health Data Science

Files

Original bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
SACM-Dissertation.pdf
Size:
1.22 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.61 KB
Format:
Item-specific license agreed to upon submission
Description:

Copyright owned by the Saudi Digital Library (SDL) © 2026