بررسی و تحلیل سؤالات آزمون‌های نهایی درس فارسی پایة دوازدهم علوم انسانی بر اساس طبقه‌‌‌بندی شناختی بلوم

نوع مقاله : مقاله پژوهشی

نویسندگان

1 نویسندة مسئول، استادیار گروه آموزش زبان و ادبیات فارسی، دانشگاه فرهنگیان، تهران، ایران.

2 دانشیار گروه آموزش زبان انگلیسی، دانشگاه فرهنگیان، تهران، ایران.

3 استادیار گروه آموزش زبان انگلیسی، دانشگاه فرهنگیان، تهران، ایران.

4 دانشجوی کارشناسی ارشد زبان و ادبیات فارسی، دانشگاه گیلان، گیلان، ایران.

چکیده

هدف از انجام پژوهش حاضر، شناسایی و تحلیل آزمون‌های درس فارسی پایة دوازدهم علوم انسانی طبق سطوح مختلف طبقه‌بندی بلوم است. این مطالعه با استفاده از روش تحلیل محتوا به بررسی سؤالات آزمون‌های نهایی فارسی در طول سال‌های 1393 تا 1403 پرداخته است. نخست؛ سؤالات بر اساس شش سطح طبقه‌بندی بلوم؛ شامل «دانش»، «درک»، «کاربرد»، «تحلیل»، «ارزیابی» و «ترکیب» دسته‌بندی شدند و فراوانی آن­ها تعیین شد. سپس داده‌ها با استفاده از آزمون کای اسکوئر و تحلیل آماری Fisher’s Exact Test  مورد بحث و مداقّه قرار گرفت. یافته‌ها نشان می‌دهد که در سال­های اولیه، بیشترین سهم سؤالات مربوط به سطوح پایین‌تر شناختی (دانش و درک) بوده و تمرکز عمدة آزمون‌ها بر یادگیری مفاهیم پایه و بازتولید اطلاعات است. در مقابل، سهم سؤالات مرتبط با سطوح بالاتر؛ همچون«ترکیب» و «ارزیابی» بسیار محدود بوده است. این امر بیانگر آن است که آزمون‌های نهایی درس فارسی، در دورة مورد بررسی، کمتر بر تقویت مهارت‌های تحلیلی، ارزیابی انتقادی و خلاقیّت دانش‌آموزان تأکید داشته‌اند. همچنین نتایج پژوهش نشان داد که تغییرات معناداری در توزیع سطوح مختلف طبقه‌بندی بلوم در طول زمان وجود دارد؛ به‌ویژه، سطوح بالاتر طبقه‌بندی بلوم مانند «کاربرد» و «تحلیل» در سال‌های اخیر افزایش یافته‌اند که نشان‌دهندة تغییر در رویکرد ارزیابی و توجه بیشتر به سطوح بالاتر شناختی است. در مقایسه با سال‌های پیش، سؤالات مربوط به سطوح پایین‌تر کاهش یافته است و این روند، حاکی از تغییر در انتظارات آموزشی و ارزیابی است. بنابراین، در سال‌های اخیر، تغییرات قابل توجهی در ساختار و محتوای سؤالات آزمون‌های نهایی فارسی مشاهده می‌شود که به سمت تقویت تفکر انتقادی و مهارت‌های تحلیلی حرکت کرده است.

کلیدواژه‌ها


عنوان مقاله [English]

An Analysis of Final Exam Questions in Grade 12 Persian Language (Humanities Stream) Based on Bloom’s Cognitive Taxonomy

نویسندگان [English]

  • Ghasem Mehravar Giglou 1
  • Abdolhossein Heydari 2
  • Ebrahim Samani 3
  • Amirabbas Hajjami 4
1 Corresponding Author, Assistant Professor, Department of Persian Language and Literature Education, Farhangian University, Tehran, Iran.
2 Department of English Language Teaching, Farhangian University, Tehran, Iran.
3 Department of English Language Teaching, Farhangian University, Tehran, Iran.
4 Master's student in Persian Language and Literature, University of Guilan, Guilan, Iran.
چکیده [English]

The purpose of the present study is to identify and analyze the final examination questions of the Grade 12 Persian language course in the humanities stream according to the different levels of Bloom’s taxonomy. This study, using the content analysis method, examined the questions of the national Persian final exams administered between 2014 and 2024 (1393–1403). First, the questions were categorized based on the six levels of Bloom’s taxonomy—knowledge, comprehension, application, analysis, evaluation, and creation—and their frequencies were determined. Then, the data were analyzed and examined statistically using the Chi-square test and Fisher’s Exact Test. The findings indicate that in the early years, most of the questions were concentrated in the lower cognitive levels (knowledge and comprehension), with the main focus of the exams on basic concept learning and information reproduction. In contrast, the proportion of questions at higher levels such as synthesis and evaluation was very limited. This demonstrates that, during the examined period, the Persian final exams placed less emphasis on developing students’ analytical skills, critical evaluation, and creativity. Furthermore, the results show that there have been significant changes over time in the distribution of Bloom’s cognitive levels. In particular, higher levels such as application and analysis have increased in recent years, indicating a change in the assessment approach and a greater focus on higher cognitive levels. Compared with earlier years, lower-level questions have decreased, reflecting a shift in educational expectations and evaluation practices. Therefore, in recent years, notable changes have been observed in the structure and content of the Persian final exam questions, moving toward strengthening critical thinking and analytical skills.
Extended Abstract:
Introduction
Examinations are essential instruments for evaluating students’ progress and the effectiveness of educational systems. In Iran, the final national exams (“Azmoon-e Nahayi”) play a central role in assessing students’ academic achievements at the end of high school. However, the cognitive depth of these exams — that is, how much they assess different levels of thinking — has often been questioned.
This study aims to evaluate the cognitive levels of the final exam questions in Persian language for 12th-grade humanities students over a ten-year period (2014–2024 / 1393–1403 in the Iranian calendar). The research specifically applies Bloom’s revised taxonomy to classify and analyze the questions according to six cognitive levels: knowledge, comprehension, application, analysis, evaluation, and creation. By doing so, the study seeks to determine the extent to which these exams measure higher-order thinking skills, such as critical analysis, reasoning, and creativity, rather than simple memorization and recall.
Bloom’s taxonomy (1956) and its revised version (Anderson & Krathwohl, 2001) provide a hierarchical model for categorizing cognitive processes in education. The taxonomy’s six levels — from simple recall of facts to creative synthesis and evaluation — offer a structured approach to designing and analyzing educational objectives and assessments.
According to this framework, lower-order skills (knowledge and comprehension) emphasize memory and understanding, while higher-order skills (application, analysis, evaluation, creation) involve reasoning, synthesis, judgment, and innovation.
This research builds on the idea that balanced assessment across all levels is essential for nurturing critical thinking (Facione, 1990; Halpern, 1998) — a skill recognized as fundamental for academic success and informed citizenship. The study assumes that exams heavily weighted toward lower cognitive levels may promote rote learning and limit students’ analytical and creative abilities.
The study addresses four key questions:

What cognitive levels do the national Persian exams for 12th-grade humanities students primarily assess?
Do these exams adequately cover all levels of Bloom’s taxonomy?
Which levels are most and least represented in the exam questions?
What changes or improvements can be made to design exams that better promote higher-order thinking?

Methodology
This study employed a descriptive–analytical design using the method of content analysis to examine the final national examination questions in the Persian language for 12th-grade humanities students in Iran from 2014 to 2024 (1393–1403). The data set included all official exam papers issued by the Ministry of Education, collected through the national assessment archives. Each question was carefully coded and classified according to the six cognitive levels of Bloom’s taxonomy — knowledge, comprehension, application, analysis, evaluation, and creation — in order to identify their frequency and distribution. To ensure accuracy and reliability, two independent raters reviewed and cross-checked the classification results, and discrepancies were resolved through consensus. The categorized data were entered into SPSS version 25 for statistical analysis. Descriptive statistics were used to summarize the frequency and percentage of each cognitive level, while Chi-square and Fisher’s Exact tests were employed to examine the significance of changes across years. This systematic procedure provided both quantitative and qualitative insight into evolving trends in exam design and cognitive emphasis.
 Results
Across the ten-year period, the results show a strong dominance of lower-level cognitive skills. Table 1  shows the results of  842 questions which were analyzed in this study.




Blooms’ Taxonomy levels


Frequency in Percentage




Knowledge


 26.84%




Comprehension


 63.18%




Application


 1.54%




Analysis


 7.84%




Evaluation


 0.36%




Creation


 0.24%




As Table 1 indicates, approximately 90% of all questions belong to the two lowest cognitive levels (knowledge and comprehension). In contrast, questions that require students to apply, analyze, evaluate, or create are extremely limited.
The Chi-square and Fisher’s Exact tests revealed a statistically significant relationship between exam year and cognitive level distribution (p < 0.001), suggesting some evolution over time. Specifically, questions at the “application” and “analysis” levels have modestly increased in recent years, particularly after 1400 (2021–2022). However, this growth remains limited, and the majority of questions still measure basic recall and understanding.
The findings clearly demonstrate a misalignment between educational objectives and assessment design. Despite the widespread advocacy for promoting critical thinking and analytical literacy within national curricula, the Persian final exams continue to prioritize memorization-based knowledge. The prevalence of low-level questions confirms the persistence of a memory-oriented examination culture, which reinforces surface learning rather than deep understanding.
This pattern echoes earlier studies (Philosofinejad et al., 2016; Shohamy, 2013), which found similar tendencies in Iranian and international contexts. The lack of higher-order questions means that students are rarely challenged to interpret, critique, or generate new ideas based on literary and linguistic content.
On the other hand, the gradual rise in “application” and “analysis” questions in later years suggests a slow shift in exam design philosophy, perhaps reflecting the influence of recent educational reforms and professional training for teachers and exam writers. However, these shifts are not yet substantial enough to change the overall assessment landscape.
Conclusion
Over the decade studied, national Persian exams for Grade 12 humanities students have predominantly assessed knowledge and comprehension, with minimal attention to analysis, evaluation, or creation. Although minor improvements are evident in recent years, the general pattern indicates a sustained emphasis on surface learning.
To move toward a more dynamic and cognitively rich assessment system, Iran’s educational policymakers and exam committees must consciously redesign their question frameworks to promote critical thinking, creativity, and problem-solving. Such transformation will help prepare students not just to recall information, but to apply, interpret, and evaluate it in meaningful contexts.
The predominance of lower-order cognitive questions in the national Persian exams has significant implications for the broader educational process. Such an assessment pattern reinforces rote learning, encouraging students to memorize rather than to think, analyze, and apply their knowledge critically. Consequently, learners are deprived of opportunities to engage in deeper forms of understanding, such as interpretation, synthesis, and creative expression. This imbalance not only limits the development of higher-order thinking skills but also diminishes motivation among students who might otherwise excel in analytical and evaluative tasks. Moreover, it narrows the educational outcomes by producing graduates who are adept at recalling information but insufficiently prepared for the intellectual demands of higher education and professional life.
To address these shortcomings, exam designers and curriculum planners need to adopt a more balanced and progressive approach to assessment design. Examinations should increasingly incorporate questions that target complex cognitive processes, including analysis, synthesis, and evaluation. Such reform would align testing practices more closely with the goals of the national curriculum, which aspires to nurture critical and creative thinkers. Furthermore, teacher training programs should emphasize strategies for constructing higher-order exam questions and for assessing reasoning and interpretive skills effectively. Periodic review and content analysis of national exams are also essential to ensure that all levels of Bloom’s taxonomy are adequately represented. By embedding these changes into the structure of exam design, the education system can move beyond memorization-based evaluation and cultivate a learning culture centered on inquiry, reasoning, and intellectual growth.
Conflict of Interest
The authors declare that there is no conflict of interest regarding the publication of this article.

کلیدواژه‌ها [English]

  • Bloom’s taxonomy
  • Persian final exams
  • Content analysis of questions
  • Cognitive assessment
  • Chi-square test
سپاسی، حسین. (1385). بررسی و تحلیل سطوح حیطه شناختی و شاخص­های روان­سنجی امتحان نهایی دروس عربی، حسابان و زیست­شناسی دانش­آموزان دختر پایة سوم متوسطه در سه منطقة متفاوت اقتصادی اجتماعی استان خوزستان، مجلة علوم تربیتی و روا­ن­شناسی، دورة 3، ش4، ص78-57.  https://doi.org/ 10.22055/edus.2007.15981
سیف، علی‌اکبر. (۱۳۹۹). اندازه‌گیری، سنجش و ارزشیابی آموزشی (ویرایش هفتم). تهران: دوران.
علیمراد، زهرا و رزمی، یگانه، رویا. (1403). ارزیابی محتوایی مجموعة «فارسی بیاموزیم» براساس چارچوب طبقه‌بندی تجدیدنظرشدة بلوم، ، پژوهش­نامۀ آموزش زبان فارسی به غیرفارسی­زبانان،  دورة 13، ش 1، ص 130-101. https://doi.org/10.30479/jtpsol.2024.20294.1666
فلسفی­نژاد، محمدرضا، فرخی، نورعلی و  بهرامی، لیلا. (1395). ویژگی­های روان­سنجی امتحانات نهایی سال سوم متوسطه و قابلیت آن­ها در گزینش داوطلبان ورود به دوره­های کارشناسی، فصلنامة اندازه­گیری تربیتی، دوره 7، ش23، ص 76-45. https://doi.org/10.22054/jem.2017.6387.1196
میرآقایی، علی عباس، سپاسی، حسین، مهاجران، بهناز و قلعه­ای، علیرضا. (1394). بررسی و تحلیل شاخص­های روان­سنجی و سطوح حیطه­شناختی سؤالات امتحانات نهایی دروس ریاضیات و علوم پایه سوم راهنمایی شهرستان خرم­آباد، مجلة روان­شناسی مدرسه، دورة 4، ش3، ص118-102.  https://doi.org/10.22098/jsp.2015.358.
نعمتی سرخی، سعیدی، زری و صحرایی، رضامراد. (1401). تحلیل محتوای نرم­افزارگردشگری فرهنگی بر مبنای نظریۀ یادگیری تجدیدنظرشدۀ بلوم، پژوهشنامۀ آموزش زبان فارسی به غیرفارسی­زبانان،  دورة 11، ش 1، ص 302-275. https://doi.org/10.30479/jtpsol.2022.17019.1583.
 
References:
 Alimorad, Z., & Razmi, Y. R. (2024). Content evaluation of the “Learn Persian” series based on the revised Bloom’s taxonomy framework. Persian Language Teaching Research Journal for Non-Persian Speakers, 13(1), 101–130. [In Persian]
Alwali, A. K. (2011). Benefits of using critical thinking in high education. In L. Gómez Chova, I. Candel Torres, & A. López Martínez (Eds.), Proceedings of the 5th International Technology, Education and Development Conference (INTED2011) (pp. 2527–2532). IATED. Available at: https://library.iated.org/view/ALWALI2011BEN . [In Persian]
Anderson, L. W., & Krathwohl, D. R. (Eds.). (2001). A taxonomy for learning, teaching, and assessing: A revision of Bloom's taxonomy of educational objectives. New York: Longman.
Bloom, B. S., Hastings, J. T., & Madaus, G. F. (1971). Handbook on formative and summative evaluation of student learning. McGraw-Hill.
Cheng, L., & Curtis, A. (2004). Washback or backwash: A review of the impact of testing on teaching and learning. In L. Cheng, Y. Watanabe, & A. Curtis (Eds.), *Washback in Language Testing: Research Contexts and Methods* (pp. 3–40). Routledge. https://doi.org/10.4324/9781410609731-9
Eli, A. R. (2011). Critical thinking: A literature review. Defense Acquisition University. https://www.dau.edu/sites/default/files/2023-12/CriticalThinkingReview.pdf
Facione, P. A. (1990). Critical thinking: A statement of expert consensus for purposes of educational assessment and instruction. The California Academic Press. Retrieved from https://www.qcc.cuny.edu/socialSciences/ppecorino/CT-Expert-Report.pdf
 Falsafi-Nejad, M. R., Farrokhi, N., & Bahrami, L. (2016). Psychometric characteristics of final exams of third-grade high school and their applicability in selecting candidates for undergraduate programs. Educational Measurement Quarterly, 6(23), 45–76. [In Persian]
Forehand, M. (2010). Bloom’s taxonomy: Original and revised. In M. Orey (Ed.), Emerging perspectives on learning, teaching, and technology. Retrieved from https://cdn.vanderbilt.edu/vu-wp0/wp-content/uploads/sites/59/2010/06/12092513/BloomsTaxonomy-mary-forehand.pdf
Glass, A. L., Ingate, M., & Sinha, N. (2013). The effect of a final exam on long-term retention. Journal of General Psychology, 140(3), 224–241. https://doi.org/10.1080/00221309.2013.797379
Hackworth, R. M. (2010). Radiation science educators’ perception of obstacles in the use of critical thinking (Master’s thesis, Ohio State University). OhioLINK Electronic Theses and Dissertations Center. Retrieved from http://rave.ohiolink.edu/etdc/view?acc_num=osu1262120623
Halpern, D. F. (1998). Teaching critical thinking for transfer across domains: Dispositions, skills, structure training, and metacognitive monitoring. American Psychologist, 53(4), 449–455.  https://doi.org/10.1037/0003-066X.53.4.449
Huitt, W. (2011). Bloom et al.'s taxonomy of the cognitive domain. Educational Psychology Interactive. Valdosta, GA: Valdosta State University. Retrieved August 31, 2025, from http://www.edpsycinteractive.org/topics/cognition/bloom.html
Kennedy, M., Fisher, M. B., & Ennis, R. H. (1991). Critical Thinking: Literature Review and Needed Research. In L. Idol & B. F. Jones (Eds.), *Educational Values and Cognitive Instruction* (pp. 11–40). Routledge. https://doi.org/10.4324/9781315044392-2
Massa, S. (2014). The development of critical thinking in primary school: The role of teachers’    beliefs. Procedia - Social and Behavioral Sciences, 141, 387–392. https://doi.org/10.1016/j.sbspro.2014.05.068
  MirAghaei, A. A., Sepasi, H., Mohajeran, B., & Qalaei, A. (2015). Analysis of psychometric indices and cognitive domain levels of final exam questions in mathematics and science of third-grade middle school students in Khorramabad. School Psychology Journal, 4(3), 102–118. [In Persian]
  Nemati-Sorkhi, M., Saeedi, Z., & Sahraei, R. M. (2022). Content analysis of cultural tourism software based on the revised Bloom’s learning theory. Persian Language Teaching Research Journal for Non-Persian Speakers, 11(1), 275–302. [In Persian]
Pešić, J. (2011). Sličnosti i razlike u konceptualizovanju kritičkog mišljenja [Similarities and differences in conceptualizing critical thinking]. Psihološka istraživanja, 14(1), 5–23. https://scindeks-clanci.ceon.rs/data/pdf/0352-7379/2011/0352-73791101005P.pdf
Riddell, T. (2007). Critical assumptions: Thinking critically about critical thinking. Journal of Nursing Education, 46(3), 121–126. https://doi.org/10.3928/01484834-20070301-06.
  Seif, A. A. (2020). Measurement, assessment, and educational evaluation (7th ed.). Tehran: Doran Publishing. [In Persian]
  Sepasi, H. (2006). Analysis of cognitive domain levels and psychometric indices of final exams in Arabic, calculus, and biology of third-grade female high school students in three socio-economic regions of Khuzestan Province. Journal of Educational Sciences and Psychology, 3(4), 57–78. [In Persian]
Shohamy, E. (2013). The power of tests: A critical perspective on the uses of language tests (Reprinted). Routledge. https://doi.org/10.4324/9781315837970
Sievertsen, H. H. (2023). Assessments in education. In Oxford Research  Encyclopedia ofEconomicsandFinance.OxfordUniversityPress.https://doi.org/10.1093/acrefore/9780190625979.013.846
Vygotsky, L. S. (1962). Thought and language. Cambridge, MA: MIT Press.
Zaidi, N. L. B., Grob, K. L., Monrad, S. M., Kurtz, J. B., Tai, A., Ahmed, A. Z., Gruppen, L. D., & Santen, S. A. (2018). Pushing critical thinking skills with multiple-choice questions: Does Bloom’s taxonomy work? Academic Medicine, 93(6), 856–859. https://doi.org/10.1097/ACM.0000000000002087 . [In Persian]