مؤشرات التحليل البعدي للعوامل المؤثرة في مواصفات الاختبار الجيد

الزهراني، محمد رزق الله رزق اللهخياط، وجدان وديع محمد2023-09-142023-09-142023-03-01https://hdl.handle.net/20.500.14154/69165The current study aimed to identify the indicators of the Meta Analysis of the factors affecting the specifications of a good test, in light of the difference in each of: the type of test (achievement - psychological), and the educational stage (elementary - intermediate - secondary - university). To achieve the objectives of the study, the Meta Analysis approach was used. And the researcher reviewed the previous studies in the characteristics of a good test. Afterward, the researcher concluded by maintaining (79) measurement tools that included sufficient data to calculate the size of the effect on the study variables, including: (16) achievement test and (63) psychological test. (250) effect sizes were obtained, using Cohen's equation, including: (45) effect sizes for achievement tests, and (205) effect sizes for psychological tests. The results of the study are as follows: '- The level of effect size of the validation methods according to Cohen's criterion was as follows: the total effect size of the validity of the discriminative comparison amounted to (4.087), which is a high effect size, followed by the intrinsic-validity method, as the total effect size of it reached (0.930), which is an effect size High. Then the construct validity method, where the total effect size was (0.696), which is a medium effect size. Finally, the constructive validity method, where the total effect size was (0.692), which is a medium effect size. - There are statistically significant differences between the mean effect size of the following methods: construct validity, discriminative comparison, and concurrent validity in favor of achievement tests. - There are statistically significant differences between the mean effect size of the intrinsic-validity method in favor of psychological tests. '- There are statistically significant differences between the mean effect size of construct validity and intrinsic-validity methods in favor of the secondary stage, while there are statistically significant differences between the mean effect size of concurrent validity in favor of the university stage, while the differences between the mean effect size of discriminative comparison validity were in favor of the intermediate stage. '- The effect size of the stability methods was as follows: the total effect size of test -retest method was (0.867), which is a high effect size, followed by Cronbach's alpha method, where the total effect size was (0.851), which is a high effect size. Finally, the split-half method, where the total effect size was (0.851), which is also a high effect size. '- There are statistically significant differences between the mean effect size in the stability methods (Cronbach's alpha / split-half / test -retest) in favor of the achievement tests. '- There are statistically significant differences between the mean effect size of the stability methods (Cronbach's alpha and split-half) in favor of the primary stage, while there are statistically significant differences between the mean effect size of test -retest method in favor of the university stage.هدفت الدراسة الحالية إلى التعرف على مؤشرات التحليل البعدي للعوامل المؤثرة في مواصفات الاختبار الجيد، في ضوء اختلاف كل من: نوع الاختبار (تحصيلي – نفسي)، والمرحلة الدراسية (ابتدائي – متوسط - ثانوي – جامعي). ولتحقيق أهداف الدراسة تم استخدام المنهج التحليلي البعدي، حيث تمت مراجعة دراسات سابقة في خصائص الاختبار الجيد، وخلصت الباحثة إلى الإبقاء على (٧٩) أداة قياس اشتملت على بيانات كافية لحساب حجم التأثير في متغيرات الدراسة، منها: (١٦) اختبار تحصيلي و(٦٣) اختبار نفسي. وتم الحصول على (٢٥٠) حجم تأثير باستخدام معادلة كوهين، منها: (٤٥) حجم تأثير للاختبارات التحصيلية، و (٢٠٥) حجم تأثير للاختبارات النفسية. وأظهرت النتائج ما يلي: -بلغ مستوى حجم تأثير أساليب الصدق وفق محك كوهين كالتالي: حجم التأثير الكلي لصدق المقارنة الطرفية بلغ (٤,٠٨٧)، وهو حجم تأثير مرتفع، يليه أسلوب الصدق الذاتي، حيث بلغ حجم التأثير الكلي لها (٠,٩٣٠)، وهو حجم تأثير مرتفع. ثم أسلوب الصدق التلازمي حيث بلغ حجم التأثير الكلي (٠,٦٩٦)، وهو حجم تأثير متوسط. وأخيراً اسلوب الصدق البنائي، حيث بلغ حجم التأثير الكلي (٠,٦٩٢)، وهو حجم تأثير متوسط. -توجد فروق ذات دلالة إحصائية بين متوسطي حجم التأثير لأساليب: الصدق البنائي، المقارنة الطرفية، والصدق التلازمي لصالح الاختبارات التحصيلية. -توجد فروق ذات دلالة إحصائية بين متوسطي حجم التأثير لأسلوب الصدق الذاتي لصالح الاختبارات النفسية. -توجد فروق ذات دلالة إحصائية بين متوسطات حجم التأثير في أساليب الصدق البنائي والصدق الذاتي لصالح المرحلة الثانوية، بينما توجد فروق ذات دلالة احصائية بين متوسطات حجم التأثير للصدق التلازمي لصالح المرحلة الجامعية، أما الفروق بين متوسطات حجم التأثير لصدق المقارنة الطرفية فقد كان لصالح المرحلة المتوسطة. -بلغ مستوى حجم تأثير أساليب الثبات كالتالي: حجم التأثير الكلي لأسلوب إعادة التطبيق بلغ (٠,٨٦٧)، وهو حجم تأثير مرتفع، يليه اسلوب الفا كرونباخ، حيث بلغ حجم التأثير الكلي (٠,٨٥١)، وهو حجم تأثير مرتفع. وأخيراً اسلوب التجزئة النصفية، حيث بلغ حجم التأثير الكلي (٠,٨٥١)، وهو أيضاً حجم تأثير مرتفع. - توجد فروق ذات دلالة إحصائية بين متوسطي حجم التأثير في أساليب الثبات (الفا كرونباخ- التجزئة النصفية- إعادة التطبيق) لصالح الاختبارات التحصيلية. -توجد فروق ذات دلالة إحصائية بين متوسطات حجم التأثير في أساليب الثبات (الفا كرونباخ والتجزئة النصفية) لصالح المرحلة الابتدائية، بينما توجد فروق ذات دلالة احصائية بين متوسطات حجم التأثير لإعادة التطبيق لصالح المرحلة الجامعية.224ar-SAالصدقالثباتالخصائص السيكو متريةالاختبار الجيدالدلالة العلميةحجم التأثيرالتحليل البعديvalidityreliabilitypsychometric propertiesgood testpractical significanceMeta Analysiseffect sizeمؤشرات التحليل البعدي للعوامل المؤثرة في مواصفات الاختبار الجيدIndicators of the meta analysis of factors affecting the specifications of a good testThesis