FINE-GRAINED CYBERBULLYING DETECTION THROUGH BYSTANDER FEATURES IN STEP-WISE DEEP LEARNING MODEL

Alfurayj, Haifa Saleh

FINE-GRAINED CYBERBULLYING DETECTION THROUGH BYSTANDER FEATURES IN STEP-WISE DEEP LEARNING MODEL

Files

SACM-Dissertation.pdf (8 MB)

Date

2025

Authors

Alfurayj, Haifa Saleh

Publisher

UNIVERSITI SAINS MALAYSIA

Abstract

الملخص يُعَدّ الكشف الدقيق عن التنمّر الإلكتروني في شبكات التواصل الاجتماعي أمرًا بالغ الأهمية لضمان السلامة عبر الإنترنت، خصوصًا للأفراد المتأثرين بعواقبه. غالبًا ما تكون لغة التنمّر الإلكتروني ضمنية، مما يتطلب تحليلًا شاملًا للسياق الحواري لتحديد النية وشدة الفعل. ومع ذلك، تتجاهل الدراسات الحالية ومجموعات البيانات المستخدمة الدور المهم للسياق الحواري الذي يقدّمه المتفرجون (أي المستخدمون الآخرون)، الذين قد يلعبون دورًا أساسيًا في تقدير مستوى خطورة أحداث التنمّر الإلكتروني. علاوة على ذلك، تركّز معظم الدراسات حول اكتشاف التنمّر الإلكتروني على التغريدات الفردية فقط، وغالبًا ما تُصاغ المهمة ضمن نظام تصنيف ثنائي مبسّط يُحدَّد فيه ما إذا كان النص يحتوي على محتوى تنمّر إلكتروني أم لا. هذا الأسلوب قد يؤدي إلى الخلط بين العدوان الإلكتروني ومستويات العدوان المختلفة في حالات التنمّر، مما يُضعف من موثوقية بعض هذه الدراسات. لمعالجة هذه المشكلات، تتناول هذه الدراسة مشكلة سوء التصنيف من خلال تعزيز اكتشاف التنمّر الإلكتروني باستخدام تصنيفات دقيقة متعددة المستويات. وتمدّ هذه الدراسة الجهود السابقة في مجال اكتشاف التنمّر الإلكتروني من خلال أخذ المحادثة الكاملة في الخيط بعين الاعتبار، بهدف التقاط أهمية أدوار المتفرجين. ندرس بشكل خاص تأثير المتفرجين لتطوير تحديد أكثر دقة لهجمات التنمّر الإلكتروني. وفي هذا السياق، نقدّم مجموعة بيانات المصمّمة لاكتشاف التنمّر الإلكتروني الدقيق والمعتمد على الخيوط مع تضمين أدوار المتفرجين. تعتمد منهجيتنا على إعادة ضبط نموذج المُدرَّب مسبقًا للتعرّف على السمات المرتبطة بالمتفرجين، وتصنيفهم إلى أدوار مثل المدافعون، المحرّضون، المحايدون، وغيرهم. ولتعزيز اكتشاف التنمّر الإلكتروني الدقيق، نقترح نموذج تصنيف تدريجي مبتكر يدمج مخرجات طبقة الفلتر مع شبكات . ويتضمّن هذا النموذج طبقة حاسمة تُعرف بـ طبقة الفلترة ، والتي – استنادًا إلى تنبؤات ودرجة الثقة ومعدل الخطأ المسموح به – تحدد عتبة عامة تفصل التنبؤات إلى مجموعتين. تأخذ هذه الآلية في الحسبان الترابط بين التسميات وموثوقيتها، مما يقدّم حلًّا مبتكرًا لمشكلة سوء تصنيف التنمّر الإلكتروني. أظهرت التجارب أن إضافة طبقة الفلترة إلى مرحلة التصنيف الأولى فعّالة في الحد من انتشار عدم اليقين، إذ أظهرت النتائج تحسنًا ملحوظًا في جميع المقاييس الموزونة مقارنة بالنموذج الأصلي بدون هذه الطبقة. وتُظهر النتائج الرئيسية أن النموذج المقترح – الذي يتضمن سمات أدوار المتفرجين – قد حقق قيمة F1 بلغت 0.91، ما يُعدّ تحسنًا واضحًا مقارنة بالنموذج الأساسي. لا يكتفي هذا النموذج بالاستفادة من ميزة المتفرجين الجديدة، بل ينجح أيضًا في تمييز مستويات العدوان المختلفة في التنمّر الإلكتروني بدقة أكبر.

Description

ABSTRACT Accurate detection of cyberbullying on Social Network Sites is crucial for online safety, particularly for individuals affected by its consequences. The language of cyberbullying is often implicit, necessitating a comprehensive analysis of the conversational context to determine intention and severity. However, current studies and the used datasets overlook the valuable insights of the conversational context provided by bystanders (i.e., other users), who might play an important role in grading the severity of cyberbullying events. Moreover, most studies on cyberbullying detection primarily focus on standalone Tweets, often framing the task within a simplified binary classification system in which the text is determined either to contain cyberbullying content or not. This approach raises the risk of confusion over the differences between cyberaggression and the varying levels of aggression in cyberbullying instances, potentially undermining the reliability of some of these studies. To ameliorate these issues, this study addresses the issue of misclassification by augmenting cyberbullying detection with fine-grained classes. This work extends previous research effort in the field of cyberbullying detection by considering the whole conversation thread with a view toward capturing the significance of bystander roles. We specifically investigate the influence of bystanders to develop a more precise identification of cyberbullying attacks. In particular, we introduce the CYBY24 dataset for thread-based fine-grained cyberbullying detection with bystander roles. Our approach involves fine-tuning a pre-trained BERT model to identify features associated with bystanders, categorizing them into roles such as defenders, instigators, impartials, and others. To enhance fine-grained cyberbullying detection, we propose a novel step-wise classification model that combines BERT output with BiLSTM networks. A critical aspect of this model is the filter layer, which based on BERT predictions and a confidence score, and a permissible error rate, determines a universal threshold to separate predictions into two subsets. This approach considers the interdependency of labels with reliability, providing an innovative soluxviii tion to the challenge of cyberbullying misclassification. Our experiments demonstrate that incorporating a filter layer to the first classification stage is effective in limiting uncertainty propagation, as the results show a significant increase across all weighted metrics compared to the original model without the filter layer. The key findings of our experiments show that the proposed model, which incorporates bystander role features, achieved an F1-score of 0.91, demonstrating a notable improvement over the baseline model. Our approach not only leverages the newly introduced feature of bystanders but also effectively delineates the detection to a nuanced level, specifically differentiating between various levels of cyberbully aggression.

Keywords

cyberbullying detection, machine learning

URI

https://hdl.handle.net/20.500.14154/77052

Collections

Postgraduate Theses & Dissertations

Full item page

FINE-GRAINED CYBERBULLYING DETECTION THROUGH BYSTANDER FEATURES IN STEP-WISE DEEP LEARNING MODEL

Files

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

URI

Collections

Endorsement

Review

Supplemented By

Referenced By