• A
  • A
  • A
  • АБВ
  • АБВ
  • АБВ
  • A
  • A
  • A
  • A
  • A
Обычная версия сайта
  • RU
  • EN
  • HSE University
  • Publications
  • Articles
  • Large Language Model-Based Automated Item Generation in STEM Assessments: Historical Mapping and a Scoping Review of Empirical Studies
  • RU
  • EN
Расширенный поиск
Высшая школа экономики
Национальный исследовательский университет
Priority areas
  • business informatics
  • economics
  • engineering science
  • humanitarian
  • IT and mathematics
  • law
  • management
  • mathematics
  • sociology
  • state and public administration
by year
  • 2027
  • 2026
  • 2025
  • 2024
  • 2023
  • 2022
  • 2021
  • 2020
  • 2019
  • 2018
  • 2017
  • 2016
  • 2015
  • 2014
  • 2013
  • 2012
  • 2011
  • 2010
  • 2009
  • 2008
  • 2007
  • 2006
  • 2005
  • 2004
  • 2003
  • 2002
  • 2001
  • 2000
  • 1999
  • 1998
  • 1997
  • 1996
  • 1995
  • 1994
  • 1993
  • 1992
  • 1991
  • 1990
  • 1989
  • 1988
  • 1987
  • 1986
  • 1985
  • 1984
  • 1983
  • 1982
  • 1981
  • 1980
  • 1979
  • 1978
  • 1977
  • 1976
  • 1975
  • 1974
  • 1973
  • 1972
  • 1971
  • 1970
  • 1969
  • 1968
  • 1967
  • 1966
  • 1965
  • 1964
  • 1963
  • 1958
  • More
Subject
News
June 11, 2026
Doctoral Student at HSE University Reveals Hidden Layout of Ancient Parion
İdil Malgil, a researcher at HSE University, conducted a UAV-based LiDAR survey of the ancient Roman city of Parion in present-day Turkey. The high density of the scans allowed the team to detect subtle terrain features concealed beneath the ground and vegetation. The survey revealed traces of entire neighbourhoods, terraced structures, and walls that had remained invisible during routine excavations and could not be identified through aerial photography. The findings have been published in Ancient Civilizations from Scythia to Siberia.
June 11, 2026
Mathematicians from Nizhny Novgorod and Shanghai Study System Stability
Mathematicians at HSE University–Nizhny Novgorod, in collaboration with colleagues from Tongji University in Shanghai, are investigating the fundamental causes of structural stability in systems and the mechanisms underlying its disruption. In this interview with the HSE News Service, Prof. Olga Pochinka, Head of the International Laboratory of Dynamical Systems and Applications at HSE University–Nizhny Novgorod and leader of the project ‘Qualitative Theory of Systems of Ordinary and Partial Differential Equations,’ discusses the project, which is being implemented as part of HSE University's International Academic Cooperation programme.
June 11, 2026
Neurolinguists Assist in Awake Surgery on 11-Year-Old Patient with Epilepsy
Researchers at the HSE Centre for Language and Brain took part in a rare awake neurosurgical procedure performed on an 11-year-old patient with drug-resistant epilepsy. Working alongside surgeons at the Voyno-Yasenetsky Centre of Specialised Medical Care for Children in Solntsevo, they monitored the resection of a portion of the left temporal lobe, where the epileptic focus had been identified.

 

Have you spotted a typo?
Highlight it, click Ctrl+Enter and send us a message. Thank you for your help!

Publications
  • Books
  • Articles
  • Chapters of books
  • Working papers
  • Report a publication
  • Research at HSE

?

Large Language Model-Based Automated Item Generation in STEM Assessments: Historical Mapping and a Scoping Review of Empirical Studies

JOURNAL OF EDUCATIONAL TECHNOLOGY DEVELOPMENT AND EXCHANGE. 2026. Vol. 19. No. 2. P. 141–165.
Omopekunola M.

Educational assessments, from low-stakes classroom tests to high-stakes national examinations, require item pools that are valid, fair, and secure. Automated Item Generation (AIG) aims to efficiently produce large pools of calibrated test items. This paper adopts a two-part design: (1) a brief historical mapping situating LLM-based AIG within the broader AIG trajectory; and (2) a scoping review of empirical studies on LLM-based AIG for STEM assessments, published between January 2022 and January 2026. A structured search of ERIC, Lens and OpenAlex yielded 1,267 records; after deduplication and screening, 7 studies were retained for synthesis. In all studies, LLMs were primarily used to draft stems, keys, distractors, and explanations by instruction-tuned prompting, sometimes enhanced with retrieval and human-in-the-loop review. Empirical evidence on item quality is generally promising. Multiple investigations have documented acceptable expert evaluations and, in a subset of studies, psychometric properties comparable to those of human-authored items. Nevertheless, recurrent limitations have been observed, including factual inaccuracies, construct drift, low calibration of item difficulty, and variable distractor plausibility. Few studies reported robust fairness audits or provided reproducible details, such as complete prompts and decoding settings. In general, LLM-based AIG can substantially increase throughput in STEM item development, but high-stakes deployment requires layered validation protocols (expert review, pilot testing, psychometrics, and bias audits) and governance controls to ensure traceability and item security.

Research target: Education Psychology
Language: English
DOI
Text on another site
Keywords: automated item generationLarge language models (LLM)prompt engineeringHigh-stakes assessmentPsychometric validation
Similar publications
Психология коучинга: методология, теория, практика. Доказательный подход
Чебоксары: ИД «Среда», 2026.
Added: June 12, 2026
Методология современных лонгитюдных исследований чтения
Bakay E., Antipkina I., Kuznetsova M. I., Отечественная и зарубежная педагогика 2026 Т. 1 № 3(115) С. 6–20
The article analyzes the challenges and prospects of longitudinal research in the field of reading. It examines the systemic limitations of classical longitudinal designs: the problem of unobserved variables, causality, and violations of metric invariance, which lead to erroneous or incomplete interpretation of results. The transition to hybrid research designs is substantiated as a promising ...
Added: June 11, 2026
Особенности организационной тревоги при сделках слияния и поглощения
Лицкевич И. А., Evdokimenko A., Журнал клинического и прикладного психоанализа 2020 Т. 1 № 4 С. 157–178
В исследовании представлен клинический подход к изучению сделок слияния и поглощения. Подробно рассмотрены бессознательные процессы проживания сотрудниками горя, которое может негативно влиять на результаты и показатели проведения организационных изменений. В тоже время учет психодинамического подхода к изучению подобных процессов и применение мер по предотвращению негативного влияния могут положительно сказаться на эффективности проводимых процедур слияния и ...
Added: June 10, 2026
Подходы к изучению просоциального поведения и альтруизма
Rean A., Шевченко А. О., Ставцев А. А. et al., Российский девиантологический журнал 2026 Т. 6 № 1 С. 92–104
Abstract Introduction. The development and refinement of psychological and pedagogical strategies for the prevention of aggressive and destructive behavior require an understanding of the mechanisms of helping (prosocial) behavior and altruism as important components of an individual’s social adaptation. Prosocial behavior is viewed as a broader phenomenon encompassing various motivational forms, whereas altruism is interpreted ...
Added: June 9, 2026
Ценностная структура и материальный статус семьи как индикаторы и ресурсы устойчивости молодежи в трудной жизненной ситуации
Rean A., Шевченко А. О., Ставцев А. А. et al., Социальная психология и общество 2025 Т. 16 № 4 С. 49–70
Context and relevance. Values play a significant role in socialization processes and can serve as a psychological resource in difficult life situations under conditions of limited material wellbeing. Theoretical foundations of the study include the value models of Sh. Schwartz and R. Inglehart, as well as the concept of economic socialization. Objective. To identify the ...
Added: June 9, 2026
Video games as stimuli in neuroimaging studies: a minireview
Blank I., Klucharev V., Shestakova A., Frontiers in Human Neuroscience 2026 Vol. 20 Article 1687121
In video games, the participants are active agents who pursue various goals within gaming environments that increasingly resemble real life. As a result, video games are increasingly offering tools for neuroimaging studies aiming to elucidate the neural basis of human perceptual, cognitive, and emotional functions. Here, we review these studies. The first studies used computerized ...
Added: June 6, 2026
Психологические издержки традиционной модели государственного управления (на материале органов прокуратуры)
Ахмедова М. С., Orlov A. B., Вопросы психологии 2025 Т. 71 № 4 С. 41–51
The article reflects problems of functioning of procuratorial bodies in the traditional paradigm of public administration, leading to professional burnout and professional deformation of employees. Modern Russian and foreign science are at the stage of awareness and identification of systemic problems in the modern culture of public administration, but the search for ways to solve them ...
Added: June 5, 2026
Понятие образовательной услуги в российском законодательстве
Линская Ю. В., Вестник Санкт-Петербургского университета. Серия 14. Право 2023 № 4 С. 844–853
The article examines the concept of “educational service” in the context of the public discussion that has arisen about the possibility of its use in relation to education and educational relations. The current situation with the gradual abandonment of the term in the current educational legislation is considered, which corresponds to the provisions of the ...
Added: June 5, 2026
Научно-практический комментарий к Закону об образовании в Российской Федерации
Издательство Санкт-Петербургского университета, 2023.
his book analyzes the application of Federal Law "On Education in the Russian Federation" over a ten-year period. The authors demonstrate the effect of the law's provisions defining new or updated institutions. Particular attention is paid to e-learning and new distance learning technologies, which are being actively implemented and discussed in Russian society due to ...
Added: June 5, 2026
Системы управления высшим образованием в России и Китайской Народной Республике: сравнительно-правовой аспект
Линская Ю. В., Вестник Санкт-Петербургского университета. Серия 14. Право 2025 Т. 16 № 2 С. 439–455
The purpose of this article is to compare the higher education systems in the Russian Federation and the People’s Republic of China (PRC), the specifics of legal regulation and government programs to support higher education. In recent years, the two countries have been developing dynamic cooperation and partnership in almost all spheres of life, including ...
Added: June 5, 2026
МОДЕЛЬ МАГИСТРАТУРЫ НА ОСНОВЕ ЗАДАЧНО-МОДУЛЬНОГО ПОДХОДА И АВТОМАТИЗИРОВАННОЙ ОЦЕНКИ ОБРАЗОВАТЕЛЬНЫХ РЕЗУЛЬТАТОВ
Адамский А. И., Kolachev N., Подболотова М. И. et al., М.: МГПУ, 2026.
Монография раскрывает концептуальный и технологический аспекты задачномодульного подхода к построению образовательной программы магистратуры, опосредованного использованием искусственного интеллекта для автоматизации оценивания компетентности обучающихся. Издание предназначено для руководителей и преподавателей магистратуры, ориентировано на подготовку кадров для сферы образования. Может быть использовано в качестве учебного материала для слушателей курсов повышения квалификации и профессиональной переподготовки. ...
Added: June 5, 2026
Путевые заметки участников экспедиционного выезда группы «Социокультурная психология и антропология» (Республика Бурятия, 2025 год)
Obukhov A., Вершок О., Володина В. et al., Исследователь/Researcher 2025 № 3-4 С. 300–339
The article synthesizes the field notes of participants from the “Sociocultural Psychology and Anthropology” expedition group of Vernadsky School No. 1553 during their trip to Buryatia in the summer of 2025. Research was conducted in the Ulyunkhan Ulus in the upper reaches of the Barguzin River, with the final conference held in Tankhoy on the ...
Added: June 2, 2026
Значимые изменения в улусе Улюнхан, произошедшие за 20 лет: транзитивность традиционной культуры бурят и эвенков
Obukhov A., Маерле М. А., Минаева Е. И., Исследователь/Researcher 2025 № 3-4 С. 282–299
The article presents a preliminary synthesis of materials from an expedition conducted in the Ulyunkhan ulus, an Evenk settlement in the Kurumkansky District of the Republic of Buryatia, in 2025, comparing them with materials from a 2005 expedition. The study focuses on the changes over the past twenty years as perceived by the local residents themselves (Buryats and Evenks). It compares ...
Added: June 2, 2026
Итоги экспедиции «Человек в гармонии с природой: взаимодействие школ Республики Бурятия с особо охраняемыми природными территориями»
Obukhov A., Исследователь/Researcher 2025 № 3-4 С. 83–131
The article describes the experience of preparing and conducting a student research expedition to Buryatia as part of the HSE University “Rediscovering Russia” program, themed “Man in Harmony with Nature: Interaction of Schools in the Republic of Buryatia with Protected Areas”. The expedition was prepared and carried out in cooperation with the D. Banzarov Buryat State ...
Added: June 2, 2026
Феномен устойчивых во времени неформальных разновозрастных сообществ, осуществляющих воспитательную работу с подростками
Obukhov A., Кириллова К. Б., Исследователь/Researcher 2025 № 3-4 С. 26–48
This article investigates the phenomenon of enduring informal multi-age communities contributing to adolescent upbringing. The conditions for the establishment and the factors ensuring the endurance of such communities are identified through the example of three organizations: the “Nadezhda” unit, the Children and Youth Organization “Ostrov Sokrovishch” (Treasure Island), and the “Karavella” (Caravel) unit. As for ...
Added: June 2, 2026
Use Case 5: LLM-driven creation of natural hazard geodatabase from digital mass media
Derkacheva A., Sakirkina M., Kraev G. et al., , in: AI for good innovate for impact report 2025.: Geneva: International Telecommunication Union, 2025. P. 167–169.
Added: May 26, 2026
Об идеологических предвзятостях генеративного ИИ: Российско-украинский конфликт в репрезентации ChatGPT
Baysha O., Trofimov V., Российская школа связей с общественностью 2026 № 40 С. 171–191
A growing number of scholars are warning about the dangers of the reproduction by generative AI of socio-political and ideological biases absorbed by models from the texts on which they were trained. If a given model was trained on Western media texts, it may generate narratives that reproduce West centric views of world events. This ...
Added: April 21, 2026
Сопоставление номенклатур товаров ресторанов и поставщиков с помощью LLM — Case Study для ресторанного холдинга
Jin S., Panfilov P., Сулейкин А. С., Труды Института системного программирования РАН 2025 Т. 37 № 6 С. 163–176
In the modern restaurant business, accurate mapping of product nomenclatures between restaurants and suppliers is a critical task. Effective inventory management and procurement optimization directly impact business profitability. With the increase in suppliers and product variety, traditional mapping methods become less efficient. This study proposes using large language models (LLM) to automate and improve the ...
Added: April 17, 2026
Learning When to Personalize: LLM Based Playlist Generation via Query Taxonomy and Classification
Buzaev F., Пугачёва Д. В., Sukharev I. et al., Transactions of the Association for Computational Linguistics 2026 P. 51–57
Playlist generation based on textual queries using large language models (LLMs) is becoming an important interaction paradigm for music streaming platforms. User queries span a wide spectrum from highly personalized intent to essentially catalog-style requests. Existing systems typically rely on non-personalized retrieval/ranking or apply a fixed level of preference conditioning to every query, which can ...
Added: April 7, 2026
Large Language Models as Political Actors: Cultural Bias and Epistemic Power
Seredkina E., Seletkova G., Mikhailovsky A., Technology and Language 2026 Vol. 7 No. 1 P. 63–79
The rapid diffusion of Large Language Models (LLMs) into socially and politically sensitive domains raises critical questions about the nature and origins of political bias in artificial intelligence. While existing research often treats bias as a technical flaw to be minimized, this article advances a broader philosophical and cultural interpretation of LLM bias as an ...
Added: April 1, 2026
Validating the Russian Adult Prosocialness Behavior Scale: Weighted Factor Analysis, Sex Invariance, and Normative Benchmarks
Mikhaylova Oxana, Bochaver Alexandra, Current Psychology 2026 Vol. 45 Article 712
Prosocial behavior measures validated across diverse cultural contexts remain limited. We validated the Adult Prosocialness Behavior Scale (APBS) in 7,965 Russian adults (52.0% women; Mage = 42.4, SD = 12.1) using sampling weights to approximate population representativeness. Split-sample analyses supported a two-factor structure with correlated Prosocial Actions and Prosocial Feelings dimensions (χ² = 2,754.87, CFI = .978, TLI ...
Added: March 11, 2026
Промпт-инжиниринг как ключевая компетенция в образовании: сущность, особенности и подходы к оцениванию
Davlatova M., Сперанская М. В., Высшее образование в России 2026 Т. 35 № 2 С. 53–73
In the context of the rapid development of Generative Artificial Intelligence (GenAI), prompt engineering is becoming a key competence for effective interaction with large language models in educational settings. However, the lack of a unified understanding of its nature, structure, and assessment tools complicates its integration into educational practice.    The aim of this study is to ...
Added: March 7, 2026
Can Large Language Models Develop High-Stakes Physics Exam Items? A Comprehensive Study of Cognitive and Psychometric Efficacy
Moses Oluoke Omopekunola, Elena Yu. Kardanova, Journal of Science Education and Technology 2026
High-stakes assessment is crucial for evaluating student performance and making significant educational decisions. Traditionally, the development of test items for such examinations has relied on manual development by subject matter experts. However, Automated Item Generation (AIG) using Large Language Models (LLMs) has emerged as a promising alternative, though systematic research on their application in high-stakes ...
Added: January 16, 2026
Многоаспектная оценка методов адаптации токенизатора для больших языковых моделей на русском языке
Андрющенко Г. Д., Godunova M., Иванов В. В. et al., Доклады Российской академии наук. Математика, информатика, процессы управления (ранее - Доклады Академии Наук. Математика) 2025 Т. 527 С. 320–331
Large language models (LLMs) pretrained on English-centered corpora have biases and perform sub-optimally on other natural languages. Adaptation of LLMs vocabulary provides a resource-efficient way to improve the quality of a pretrained model. Previously proposed adaptation techniques focus on performance (accuracy) and size metrics (fertility), ignoring other aspects in comparison, such as inference latency, compute ...
Added: January 15, 2026
  • About
  • About
  • Key Figures & Facts
  • Sustainability at HSE University
  • Faculties & Departments
  • International Partnerships
  • Faculty & Staff
  • HSE Buildings
  • HSE University for Persons with Disabilities
  • Public Enquiries
  • Studies
  • Admissions
  • Programme Catalogue
  • Undergraduate
  • Graduate
  • Exchange Programmes
  • Summer University
  • Summer Schools
  • Semester in Moscow
  • Business Internship
  • Research
  • International Laboratories
  • Research Centres
  • Research Projects
  • Monitoring Studies
  • Conferences & Seminars
  • Academic Jobs
  • Yasin (April) International Academic Conference on Economic and Social Development
  • Media & Resources
  • Publications by staff
  • HSE Journals
  • Publishing House
  • iq.hse.ru: commentary by HSE experts
  • Library
  • Economic & Social Data Archive
  • Video
  • HSE Repository of Socio-Economic Information
  • HSE1993–2026
  • Contacts
  • Copyright
  • Privacy Policy
  • Site Map
Edit