The Quality of Test Items in the Arabic Grammar Olympiad: A Deep Learning Perspective

Kualitas Soal dalam Olimpiade Tata Bahasa Arab: Perspektif Pembelajaran Mendalam

Authors

  • Faiz Nasyith Hakim Universitas Pendidikan Indonesia
  • Hikmah Maulani Universitas Pendidikan Indonesia, Indonesia
  • Ilham Sofiyan Al-Azhar University, Egypt

DOI:

https://doi.org/10.58194/eloquence.v5i1.3497

Keywords:

Deep Learning, Quality Questions, Olympiad, Arabic Grammar, Taxonomy Bloom

Abstract

Background: Test instruments play an important role in measuring abilities, including in competitive events such as the Arabic Grammar Olympiad; therefore, standardized, valid test items that support learning outcomes are required to map participants' ability profiles accurately. The Indonesian government also emphasizes the concept of Deep Learning, which focuses not only on cognitive aspects but also on achieving graduate profile outcomes.

Purpose: This study aims to analyze Olympiad test items in terms of validity, reliability, difficulty level, discrimination power, distractor effectiveness, cognitive level, and their alignment with the eight Deep Learning graduate profiles of the Merdeka Curriculum.

Method: This study uses a mixed-method approach, consisting of a quantitative ex post facto design and a qualitative content analysis design.

Results and Discussion: The findings indicate that the Olympiad test items demonstrate high reliability; However, only 43% of the items are valid, 87% are categorized as easy in terms of difficulty level, 57% have poor discrimination power, and only 40% of the distractors are effective, and the cognitive level of the items is still dominated by the Lower Order Thinking Skills (LOTS) category (67%). Furthermore, the test items do not fully represent the eight Deep Learning graduate profiles, as only three profiles are reflected: independence (100%), faith and devotion to God Almighty (33%), and critical reasoning (20%).

Conclusions and Implications: This study concludes that the Arabic Grammar Olympiad test items still require improvement to function optimally as instruments for measuring participants’ abilities while supporting the achievement of Deep Learning graduate profiles.

References

[1] A. Harahap, Evaluasi Pembelajaran Berbasis Hots Dalam Kurikulum Merdeka. Indramayu: CV. Adanu Abimata, 2024.

[2] M. Laili, “Ketepatan Kontruksi Butir Pilihan Ganda Bahasa Arab,” ALSUNIYAT J. Penelit. Bahasa, Sastra, dan Budaya Arab, vol. 3, no. 2, pp. 111–124, 2020, doi: https://doi.org/10.17509/alsuniyat.v3i2.25272

[3] D. S. Logayah, M. Ruhimat, R. Arrasyid, and M. R. F. Islamy, “Item analysis of National Geography Olympiad multiple-choice questions (MCQs) in Indonesia,” Cogent Soc. Sci., vol. 10, no. 1, p., 2024, doi: https://doi.org/10.1080/23311886.2024.2354971

[4] M. Taufik and R. D. Susanti, “Pelatihan dan pendampingan olimpiade matematika berbasis strategi heuristik,” vol. 7, no. 1, pp. 1–2, 2023. https://doi.org/10.31764/jmm.v7i1.12226

[5] J. Setiawan, A. Sudrajat, Aman, and D. Kumalasari, “Development of higher order thinking skill assessment instruments in learning Indonesian history,” vol. 10, no. 2, 2021, doi: https://doi.org/10.11591/ijere.v10i2.20796

[6] S. Kadir, S. Sarif, and A. H. N. Fuadi, “Item Analysis of Arabic Thematic Questions to Determine Thinking Level Ability,” ELOQUENCE J. Foreign Lang., vol. 3, no. 1, pp. 26–39, 2024, doi: https://doi.org/10.58194/eloquence.v3i1.1498

[7] N. I. Rashwan, S. R. Aref, O. A. Nayel, and M. H. Rizk, “Postexamination item analysis of undergraduate pediatric multiple-choice questions exam: implications for developing a validated question Bank,” BMC Med. Educ., vol. 24, no. 1, pp. 1–9, 2024, doi: https://doi.org/10.1186/s12909-024-05153-3

[8] M. F. Imaduddin, H. Maulani, and I. H. Taufik, “Test the Validity and Reliability of Arabic Learning Questions,” Arab. J. Arab. Stud., vol. 7, no. 2, pp. 198–207, 2022, doi: https://doi.org/10.24865/ajas.v7i2.523

[9] L. Anderson and D. Krathwohl, Kerangka Landasan Untuk Pembelajaran, Pengajaran, Dan Asesmen Revisi Taksonomi Pendidikan Bloom. Yogyakarta: Pustaka Pelajar, 2015.

[10] Kemendikdasmen RI, “Pembelajaran Mendalam Menuju Pendidikan Bermutu untuk Semua,” 2025.

[11] S. Lestari and F. N. Yusuf, “Aligning Assessment Practices with Learning Objectives : A Case of EFL Classes in Indonesia,” vol. 10, no. May, pp. 145–163, 2025. https://doi.org/10.21093/ijeltal.v10i1.1973

[12] S. Musliha, D. Sudana, and Y. Wirza, “The Analysis of Higher Order Thinking Skills ( HOTs ) in the Test Questions Constructed by English Teachers,” vol. 595, no. Icollite, pp. 610–617, 2021. https://doi.org/10.2991/assehr.k.211119.095

[13] N. Aulia and R. U. Baroroh, “Innovation of Qawaid Nahwiyah Assessment in Arabic Textbooks Class XII Based on HOTS Assessment,” vol. 02, no. 2021, 2024, doi: https://doi.org/10.32332/ijalt.v6i01.8678

[14] Rust, “Learning and Teaching Briefing Papers Series:Theories of learning,” p. 3, 2002.

[15] S. Matazu and E. Julius, “Item Analysis: A Veritable Tool for Effective Assessment in Teaching and Learning,” J. Educ. Pract., vol. 12, no. 21, pp. 22–28, 2021, doi: https://doi.org/10.7176/JEP/12-21-04

[16] S. Bhattacherjee, A. Mukherjee, K. Bhandari, and A. Rout, “Physical Violence Against Doctors: A Content Analysis from Online Indian Newspapers,” Indian J. Community Med., vol. 42, no. 1, pp. 147–50, 2022, doi: 10.4103/ijcm.IJCM.

[17] I. R. N. Fauziah, Syihabudin, and A. Sopian, “Analisis Kualitas Tes Bahasa Arab Berbasis Higher Order Thingking Skill (HOTS),” vol. 10, no. 1, pp. 45–54, 2020. https://doi.org/10.22373/ls.v10i1.7805

[18] M. Nojomi and M. Mahmoudi, “Assessment of multiple-choice questions by item analysis for medical students’ examinations,” Res. Dev. Med. Educ., vol. 11, no. 1, p. 24, 2022, doi: https://doi.org/10.34172/rdme.2022.024

[19] Sahrani, S. Herlina, Sumin, and Hermansyah, “Evaluating The Effectiveness Of Arabic Language Summative Tests,” vol. 9, no. 2, pp. 190–206, 2024. https://doi.org/10.28918/alsinatuna.v9i2.6858

[20] V. N. I. Sari, A. P. Y. Utomo, and Sumarwati, “Kualitas Soal Bahasa Indonesia di SMP Muhammadiyah 1 Pontianak: Analisis Butir Soal,” J. Pendidik. Bhs. dan Sastra Indones., vol. 11, no. 2, pp. 112–119, 2022, [Online]. Available: https://journal.unnes.ac.id/sju/index.php/jpbsi/article/view/24018

[21] R. Istikomah, M. Al Farisi, and H. Maulani, “Classpoint AI : Kualitas Empiris Pembuat Soal Otomatis dalam Assessmen Bahasa Arab Classpoint AI : Empirical Quality of Automatic Item Test Generator in Arabic Foreign Language Assessment,” vol. 8, no. 1, pp. 44–64, 2025, doi: https://doi.org/10.26555/insyirah.v8i1.13163

[22] B. S. Hariati et al., “Analisis Kualitas Butir Soal Pilihan Ganda Elemen Dokumen Berbasis Digital Menggunakan ANATES,” vol. 6, no. 01, pp. 46–58, 2026. https://doi.org/10.57008/jjp.v6i01.1956

[23] I. I. Umiyati and F. Fakhriyah, “Analisis Kualitas Butir Soal Konsep Sistem Gerak Manusia Kelas VI Sekolah Dasar Berdasarkan Validitas , Reliabilitas , Tingkat Kesukaran dan Daya Beda pada siswa . Miskonsepsi sering kali tidak terdeteksi jika butir soal yang digunakan tidak kualitas bank soal di sekolah . Penggunaan teknologi informasi , seperti Microsoft Excel , dan akurat . Syaifudin ( 2023 ) menyebutkan bahwa analisis butir soal secara mandiri oleh guru,” no. November 2025, 2026.

[24] A. A. Rezigalla et al., “Item analysis: the impact of distractor efficiency on the difficulty index and discrimination power of multiple-choice items,” BMC Med. Educ., vol. 24, no. 1, pp. 1–7, 2024, doi: https://doi.org/10.1186/s12909-024-05433-y

[25] K. Masrifah, M. Ahsanuddin, and M. Ainin, “The Effectiveness of HOTS-Based Practice Questions in Enhancing Students ’ Critical Thinking in Arabic Learning,” ALSUNIYAT J. Penelit. Bahasa, Sastra, dan Budaya Arab, vol. 8, no. 2, 2025, doi: https://doi.org/10.17509/alsuniyat.v8i2.87847 ALSUNIYAT.

[26] A. Fudhaili, N. Nurjannah, and A. Arifin, “Integrating Kahoot to Improve Reading Skills through HOTS-Based and Religious Moderation-Oriented Materials,” vol. 8, no. 2, 2025.

[27] A. Fauzy et al., Metodologi Penelitian. 2022.

[28] R. Abubakar, Pengantar metodologi penelitian. Yogyakarta: SUKA-Press UIN Sunan Kalijaga, 2021.

[29] B. I. Sappaile, “Konsep Penelitian Ex-Post Facto,” vol. 1, pp. 105–113, 2010.

[30] A. Veronica, M. Abas, and N. Hidayah, Metodologi penelitian kuantitatif. 2022.

[31] A. Susanto and P. Rahmah, “Cognitive Level Analysis of Problems in The Worksheets of Students ( WS ) Mathematics of Junior High School Analisis Tingkat Kognitif Soal Pada Lembar Kerja Peserta Didik ( LKPD ) Matematika MTs,” vol. 3, no. 1, pp. 75–85, 2021. https://doi.org/10.24252/ajme.v3i1.20941

Downloads

Published

2026-04-15

Issue

Section

Articles