Language Exercise Generation: Emulating Cambridge Open Cloze

A. Malafeev

doi:10.4018/IJCSSA

Publications

?

Language Exercise Generation: Emulating Cambridge Open Cloze

International Journal of Conceptual Structures and Smart Applications (IJCSSA). 2014. Vol. 2. No. 2. P. 20–35.

Malafeev A.

This article presents an approach to the automatic generation of open cloze exercises based on arbitrary English text. The exercise format is similar to the open cloze test used in Cambridge English certificate exams (FCE, CAE, CPE). The presented method also makes it possible to adjust the difficulty of the resulting exercises to better suit specific proficiency levels. Three experiments were conducted to evaluate the usefulness of the machine-generated exercises, compare them with authentic Cambridge English tests and study the difficulty-setting capabilities. The experiments showed that the generation method used was quite effective. With some customization, the method can be applied to generating similar exercises for other languages.

Research target: Philology and Linguistics Education Computer Science

Priority areas: IT and mathematics

Keywords: natural language processing преподавание английского языка автоматическая обработка естественного языка автоматическая обработка текста автоматическое создание упражнений тесты open cloze open cloze computer-assisted language learning CALL English as a foreign language EFL language exercise generation adjusting exercise difficulty изучение иностранных языков с помощью компьютера английский язык как иностранный регулировка сложности упражнений

No ‘iota’ type-shifter in Kazym Khanty

Tiutiunnikova V., Mikhailov Stiopa, Golosov F., Proceedings of Sinn und Bedeutung (Германия) 2025 No. 29 P. 1593–1608

In this paper, we present new challenging data from Kazym Khanty (a Uralic language spoken in Western Siberia, Russia): in this articleless language, bare singular and bare dual NPs in argument positions can receive indefinite readings on par with definite ones, contradicting the predictions of the classic neo-Carlsonian approach (Chierchia, 1998; Dayal, 2004). We argue ...

Added: January 30, 2026

Употребление порядковых числительных в разных семантических контекстах (на материале параллельных переводов Нового Завета)

Nasledskova P., Известия РАН. Серия литературы и языка 2025 Т. 84 № 6 С. 88–102

Работа посвящена сравнению употребления порядковых конструкций в разных семантических контекстах в пяти языках: русском, английском, испанском, индонезийском и рутульском. Сравнение проведено на материале параллельных переводов Нового завета. Из шести книг Нового Завета (канонические Евангелия, Деяния апостолов и Откровение Иоанна Богослова) были выбраны стихи, в которых хотя бы в одном из языков выборки употреблены порядковые числительные. ...

Added: January 29, 2026

Метод преобразования речевого сигнала для улучшения разборчивости речи

Savchenko L., Савченко В. В., Радиотехника и электроника 2025 Т. 70 № 8 С. 753–760

The problem of improving speech intelligibility in voice communication systems is considered. The acute issue of speaker recognition when applying known methods for solving this problem is highlighted. To overcome the specified problem, a new method for transforming the speech signal based on an autoregressive model of the vocal tract and the principle of frequency-selective ...

Added: January 29, 2026

Specification Tests for Jump-Diffusion Models Based on the Characteristic Function

Belomestny D., Grobler G. L., Meintanis S. G. et al., International Statistical Review 2026 P. 1–31

Goodness-of-fit tests are suggested for several popular jump-diffusion processes. The suggested test statistics utilise the marginal characteristic function of the model and its L2-type discrepancy from an empirical counterpart. Model parameters are estimated either by minimising the aforementioned L2-type discrepancy or by maximum likelihood. A hybrid estimation method that uses moment estimation is also proposed ...

Added: January 29, 2026

Барьеры реализации трансформации в университетах и факторы их преодоления: взгляд руководителей вузов и экспертов

Krestinin V., Университетское управление: практика и анализ 2025 Т. 29 № 4 С. 129–143

The aim of this article is to analyze the institutional barriers to implementing transformation in universities, as well as the factors enabling their overcoming, through the lens of those leading these processes. To this end, the author conducted 22 in-depth expert interviews with university leaders (primarily rectors and vice-rectors) participating in the “Priority 2030” program, ...

Added: January 29, 2026

Применение технологий ИИ в обучении студентов в рамках дисциплины «Академическое письмо на английском языке»

Gabrielova E., Магия ИННО 2025 Т. 7 № 1 С. 165–172

Artificial intelligence (AI) technologies are rapidly developing and are being widely applied in various fields, including education. The use of AI carries certain risks; however, one cannot completely reject it in student education. The article presents the experience of using AI in teaching English to 34 fourth-year students and 26 post-graduate students within the discipline ...

Added: January 29, 2026

Explorations in Applied Ethnolinguistics: Words, Cultures, and Global Perspectives

Palgrave Macmillan, 2025.

This volume contributes to the growing body of cutting-edge research into the Natural Semantic Metalanguage (NSM) approach in linguistics. It explores the broad range of possible applications enabled by the NSM approach, from linguistic studies of semantics and culture to cross-cultural studies, psychology and childhood education. The volume builds on previous studies, bringing a diversity ...

Added: January 28, 2026

Эпос о Гильгамеше. Перевод Николая Гумилева. Предисловие Е. Маркиной. Введение В. Шилейко.

Markina E., Манн, Иванов и Фербер, 2025.

Аннотация издателя: «Эпос о Гильгамеше» — древнейший памятник мировой литературы, дошедший до нас из глубин шумерской и аккадской цивилизаций. Поэма повествует о приключениях могущественного царя города Урука и его друга Энкиду. Это история о силе и дружбе, гордыне и смирении, страхе смерти и жажде бессмертия. Поэма издается в переводе поэта-акмеиста Николая Гумилева с пояснительной статьей ассириолога и современника поэта Владимира Шилейко, ...

Added: January 28, 2026

An Analysis of Sequential Patterns in Datasets for Evaluation of Sequential Recommendations

Klenitskiy A., Володкевич А. А., Pembek A. et al., ACM Transactions on Recommender Systems 2026

Sequential recommender systems are an important and in-demand area of research. These systems aim to use the order of interactions in a user’s history to predict future interactions. The premise is that the order of interactions and sequential patterns play an essential role. Therefore, it is crucial to use datasets that exhibit a sequential structure ...

Added: January 28, 2026

Исследование отношения учителей к цифровым образовательным технологиям: качественный анализ

Bushina E., Muminova A., Фахретдинова А. П., Население и экономика 2026 Т. 21 № 1 С. 1–21

Digitalization and digital technologies have become an integral part of the education system, radically transforming both teaching methods and the role of teachers. Therefore, examining teachers’ attitudes toward digitalization is essential for enhancing the effectiveness of education and ensuring the successful implementation of technologies in the educational process. Teachers’ attitudes are shaped by various factors, ...

Added: January 27, 2026

Making a Difference: Stories of Inspirational Teachers

Breffney Morgan, О'Каллаган П., Лоулор П., SuperGeneration, 2012.

A compilation of stories of inspirational teachers, compiled by Philip O’Callaghan & Padraig Lawlor, co-authored by Brendan O’Carroll, Eileen Dunne, Breffny Morgan, Rachel Allen, Ruairi Quinn, and Terry Prone. ...

Added: January 27, 2026

Semi-fake indexicals in Russian

Тискин Д. Б., Типология морфосинтаксических параметров 2025 Vol. 8 No. 1 P. 112–129

There are several rival theories of fake indexicals, i.e. bound indexicals (prominently pronouns) whose φ-features do not semantically contribute to focus alternatives (e.g. Only Mary did her homework, John didn’t do his). According to Minimal Pronoun theories (such as Kratzer’s or Wurmbrand’s), bound pronouns are Merged without φ-features and acquire them under binding via agreement-like ...

Added: January 26, 2026

Iterative Ricci-Foster Curvature Flow with GMM-Based Edge Pruning: A Novel Approach to Community Detection

Sorokin K., Beketov M., Онучин А. et al., / arxiv.org. Серия cs.SI "Social and Information Networks ". 2025.

Community detection in complex networks is a fundamental problem, open to new approaches in various scientific settings. We introduce a novel community detection method, based on Ricci flow on graphs. Our technique iteratively updates edge weights (their metric lengths) according to their (combinatorial) Foster version of Ricci curvature computed from effective resistance distance between the ...

Added: January 15, 2026

Implementing Transport Coding in OMNeT++ for Message Delay Reduction

Petrovanov I., Sergeev A., / Series Computer Science "arxiv.org". 2025. No. 2512.18332.

Transport coding reduces message delay in packet-switched networks by introducing controlled redundancy at the transport layer: original packets are encoded into coded packets, and the message is reconstructed after the first successful deliveries, effectively shifting latency from the maximum packet delay to the -th order statistic. We present a concise, reproducible discrete-event implementation of transport coding in OMNeT++, including ...

Added: December 24, 2025

Hessian-based lightweight neural network for brain vessel segmentation on a minimal training dataset

Меньшиков И. А., Бернадотт А. К., Elvimov N. S., / Series arXie "Statistical mechanics". 2025.

Accurate segmentation of blood vessels in brain magnetic resonance angiography (MRA) is essential for successful surgical procedures, such as aneurysm repair or bypass surgery. Currently, annotation is primarily performed through manual segmentation or classical methods, such as the Frangi filter, which often lack sufficient accuracy. Neural networks have emerged as powerful tools for medical image ...

Added: December 1, 2025

Determining the boundary of dynamical chaos in the generalized Chirikov map via machine learning

Чернышов Д. П., Satanin A., Shchur L., / Series arXiv "math". 2025.

We investigate the boundary separating regular and chaotic dynamics in the generalized Chirikov map, an extension of the standard map with phase-shifted secondary kicks. Lyapunov maps were computed across the parameter space (K,K(α, τ)) and used to train a convolutional neural network (ResNet18) for binary classification of dynamical regimes. The model reproduces the known critical ...

Added: November 21, 2025

Эффективный алгоритм торговли на фондовом рынке: ретроспективный анализ, основанный на данных по S&P-500.

Rubchinskiy A., Chubarova D., / Series WP7 "Математические методы анализа решений в экономике, бизнесе и политике". 2025. No. WP7/2025/01.

The article examines one of the most famous examples of socio-economic systems, characterized by significant uncertainty – the S&P-500 stock market, where shares of 500 largest US companies are traded. No assumptions are made about the probabilistic characteristics of the stock market. A flexible algorithm for daily trading has been developed, based on both known fixed data ...

Added: November 9, 2025

Diffusion on language model embeddings for protein sequence generation

Meshchaninov V., Strashnov, P., Shevtsov A. et al., / Cornell University. Серия CoRR, arXiv:2403.03726 "Computing Research Repository,". 2025.

Protein design requires a deep understanding of the inherent complexities of the protein universe. While many efforts lean towards conditional generation or focus on specific families of proteins, the foundational task of unconditional generation remains underexplored and undervalued. Here, we explore this pivotal domain, introducing DiMA, a model that leverages continuous diffusion on embeddings derived ...

Added: October 5, 2025

Smoothie: Smoothing Diffusion on Token Embeddings for Text Generation

Shabalin A., Meshchaninov V., Vetrov D., / Series cs.CL, arXiv:2505.18853 "Computation and Language". 2025.

Diffusion models have achieved state-of-the-art performance in generating images, audio, and video, but their adaptation to text remains challenging due to its discrete nature. Prior approaches either apply Gaussian diffusion in continuous latent spaces, which inherits semantic structure but struggles with token decoding, or operate in categorical simplex space, which respect discreteness but disregard semantic ...

Added: October 5, 2025

Compressed and Smooth Latent Space for Text Diffusion Modeling.

Meshchaninov V., Chimbulatov E., Shabalin A. et al., / Series cs.CL, arXiv:2506.21170 "Computation and Language". 2025.

Autoregressive language models dominate modern text generation, yet their sequential nature introduces fundamental limitations: decoding is slow, and maintaining global coherence remains challenging. Diffusion models offer a promising alternative by enabling parallel generation and flexible control; however, their application to text generation is hindered by the high dimensionality of token-level representations. We introduce COSMOS, a ...

Added: October 5, 2025

New Concept of Teaching English to Students from Non-English Speaking Countries

Rintaningrum R., Kavgić A., Garaeva M. et al., Emerging Science Journal 2023 Vol. 7 No. 6 P. 2202–2215

The objective of this study is to compare and underscore the advantages and disadvantages of learning English in non-English speaking countries to propose the concept of a new English teaching method for students from non-English speaking countries (the case of Russia, Spain, Serbia, and Indonesia). The study used a mixed-methods approach with qualitative analysis of ...

Added: September 29, 2025

A Feature Engineering Framework for Computer Vision Based on Topological Data Analysis

Абрамов А. С., Chernyshev V. L., Mikhaylets E. et al., / Series Social Science Research Network "Social Science Research Network". 2025.

Computer vision is one of the most relevant modern research areas with broad practical applications. However, traditional solutions based on deep learning have signicant limitations and can be misleading. Topological data analysis, on the other hand, is a modern approach to solving similar problems using mathematically deterministic methods of algebraic topology that reduce the risk ...

Added: September 23, 2025

On the construction of frieze patterns from partitions of convex polygons by nonintersecting diagonals

Kochetkov Y., / Series arXiv.org e-print archive "arXiv.math". 2025. No. 07600.

We demonstrate in an elementary way how to construct a frieze pattern of width m-3 from a partition of a convex m-gon by not intersecting diagonals. ...

Added: September 17, 2025

On one property of Catalan numbers

Kochetkov Y., / Series arXiv.org e-print archive "arXiv.math". 2025. No. 20584.

We give a new proof of the following statement: the Catalan number C_n is divisible by n+2, if n is odd and n<> 3k+1. ...

Added: September 9, 2025