Morpheme Segmentation for Russian: Evaluation of Convolutional Neural Network Models

Maltina L.; A. Malafeev

doi:10.1007/978-3-030-39575-9_16

Publications

?

Morpheme Segmentation for Russian: Evaluation of Convolutional Neural Network Models

P. 160–166.

Maltina L., Malafeev A.

This paper is aimed at evaluating the performance of existing models of morphemic analysis for Russian based on convolutional neural networks. The models were trained on a relatively small amount of annotated training data (38,368 words). We tuned the hyperparameters to accommodate the harder task setting, which helped improve the accuracy of the model. In addition to testing 15 different configurations on the available test set, a new sample of 800 words containing roots that are missing in the training sample (e.g. neologisms and recent loan words) was manually created and annotated for morphemic structure (the new dataset is made available to the community). The effectiveness of the models was evaluated on this sample, and it turned out that the performance of the CNN models was much worse on this set (an almost 30% drop in word accuracy). We performed a classification of errors made by the best model both on the standard test set and the new one.

Language: English

Full text

DOI

Keywords: convolutional neural networks model evaluation error analysis morpheme segmentation for Russian words with out-of-vocabulary roots parameter tuning морфемный анализ слов русского языка свёрточные нейронные сети

Publication based on the results of:

Эффективные методы распознавания мультимедийных данных для задач анализа предпочтений пользователей мобильных устройств (2019)

In book

Analysis of Images, Social Networks and Texts. 8th International Conference, AIST 2019, Kazan, Russia, July 17–19, 2019, Revised Selected Papers. Communications in Computer and Information Science

Vol. 1086. , Springer, 2020.

Approximate Calculation of the Generalized Erdélyi-Kober Operator Using a Cubic Spline

Shishkina E., Revista Internacional de Metodos Numericos para Calculo y Diseno en Ingenieria 2025

This article investigates the problem of approximating the generalized Erdélyi-Kober fractional operator (often referred to as the Lowndes operator) using cubic splines. A method based on cubic spline interpolation is proposed for approximating the operator on a non-uniform grid. The convergence rate of the proposed method is proven, and its stability is analyzed. Error bounds are established for functions in ...

Added: March 2, 2026

Ансамбль современных моделей компьютерного зрения для задачи обнаружения дипфейков

Pikul A. S., Безопасность информационных технологий 2024 Т. 31 № 4 С. 116–127

This article explores the potential use of modern computer vision architectures for the task of deepfake detection. The following architectures are considered: EfficientNet, Vision Transformer (ViT), VisionLSTM (ViL), Vision KAN, and Mamba Vision. The novelty of the approach lies in the application and comparison of these architectures, as well as their combination into paired ensembles ...

Added: December 12, 2025

Recognition of Mentally Pronounced Russian Phonemes Using Convolutional Neural Networks and Electroencephalography Data

Seleznev L. E., Chupakhin A. A., Kostenko V. A. et al., Optical Memory and Neural Networks (Information Optics) 2023 Vol. 32 No. 2 P. 73–85

We analyze a classification problem of mentally pronounced Russian phonemes based on data obtained by means of an electroencephalography device. We describe the data collection method as well as the methods of the obtained data processing. To solve the small sample size problem we present the augmentation techniques that use the time stretching and the ...

Added: October 2, 2025

Convolutional Neural Networks Decode Finger Movements in Motor Sequence Learning from MEG Data

Zabolotniy A., Chan R. W., Moiseeva V. et al., Frontiers in Neuroscience 2025 Vol. 19 Article 1623380

We demonstrated the feasibility of finger movement decoding with a tailored Convolutional Neural Network. The performance of our approach was comparable to complex deep learning architectures, while providing faster and interpretable outcome. This algorithmic strategy holds high potential for the investigation of the mechanisms underlying non-invasive neurophysiological recordings in cognitive neuroscience. ...

Added: October 2, 2025

From challenges to solutions: Investigating academic writing errors to enhance curriculum focus

Попова А. О., Stognieva O., Педагогика и психология образования 2025 No. 2 P. 184–198

Studying academic writing equips university students with the skills to effectively communicate their research, arguments, and findings in a clear and structured manner, in the English as a Foreign Language (EFL) context. Within the framework of the educational paradigm of the National Research University Higher School of Economics (HSE), the development of academic writing skills ...

Added: September 27, 2025

Разработка архитектуры классификатора для оценки состояния объектов инфраструктуры с применением нейронных сетей

Moiseev N., Абрамов И. А., Камакин А. Ю., В кн.: Параллельные вычислительные технологии – XIX всероссийская конференция с международным участием, ПаВТ'2025, г. Москва, 8–10 апреля 2025 г. Короткие статьи и описания плакатов.: Челябинск: Издательский центр ЮУрГУ, 2025. С. 301–301.

In recent years, with the advancement of deep learning and neural network methods, their application in geospatial analysis tasks has become particularly relevant. A key challenge in this field is assessing the state of urban infrastructure, including the classification of buildings by their functional purpose (residential, commercial, governmental, industrial). The use of neural networks significantly ...

Added: September 17, 2025

BERT-like Models for Slavic Morpheme Segmentation

Morozov D., Astapenka L., Glazkova A. et al., , in: Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)Vol. 1: Long papers.: Association for Computational Linguistics, 2025. P. 6795–6815.

Automatic morpheme segmentation algorithms are applicable in various tasks, such as building tokenizers and language education. For Slavic languages, the development of such algorithms is complicated by the rich derivational capabilities of these languages. Previous research has shown that, on average, these algorithms have already reached expert-level quality. However, a key unresolved issue is the ...

Added: July 17, 2025

Automatic Morpheme Segmentation for Russian: Can an Algorithm Replace Experts?

Morozov D., Garipov T., Lyashevskaya O. et al., Journal of Language and Education 2024 Vol. 10 No. 4 P. 71–84

Introduction: Numerous algorithms have been proposed for the task of automatic morpheme segmentation of Russian words. Due to the differences in task formulation and datasets utilized, comparing the quality of these algorithms is challenging. It is unclear whether the errors in the models are due to the ineffectiveness of algorithms themselves or to errors and inconsistencies ...

Added: January 7, 2025

Proceedings Volume 11605, Thirteenth International Conference on Machine Vision

Teplyakov L., Kaymakov K., Shvets E. et al., SPIE, 2021.

Line detection is an important computer vision task traditionally solved by Hough Transform. With the advance of deep learning, however, trainable approaches to line detection became popular. In this paper we propose a lightweight CNN for line detection with an embedded parameter-free Hough layer, which allows the network neurons to have global strip-like receptive fields. ...

Added: November 5, 2024

Lightweight and Elegant Data Reduction Strategies for Training Acceleration of Convolutional Neural Networks

Demidovskij A., Artyom Tugaryov, Aleksei Trutnev et al., Mathematics 2023 Vol. 14 No. 11 Article 3120

Due to industrial demands to handle increasing amounts of training data, lower the cost of computing one model at a time, and lessen the ecological effects of intensive computing resource consumption, the job of speeding the training of deep neural networks becomes exceedingly challenging. Adaptive Online Importance Sampling and IDS are two brand-new methods for ...

Added: September 12, 2023

MobileEmotiFace: Efficient Facial Image Representations in Video-Based Emotion Recognition on Mobile Devices

Demochkina P., Savchenko A., , in: Pattern Recognition. ICPR International Workshops and Challenges. Virtual Event, January 10–15, 2021, Proceedings, Part V.: Springer, 2021. P. 266–274.

In this paper, we address the emotion classification problem in videos using a two-stage approach. At the first stage, deep features are extracted from facial regions detected in each video frame using a MobileNet-based image model. This network has been preliminarily trained to identify the age, gender, and identity of a person, and further fine-tuned ...

Added: April 10, 2022

Application of an Optical Sensor for Fracture and Chips Control on Metal Surfaces

Tuv A., Akatov M., Starostenko V. et al., , in: Proceedings of the 2021 IEEE International Conference "Quality Management, Transport and Information Security, Information Technologies" (IT&QM&IS).: IEEE, 2021. Ch. 6 P. 345–348.

Added: January 25, 2022

ОСОБЕННОСТИ ПРИМЕНЕНИЯ ПРЕДОБУЧЕННЫХ СВЁРТОЧНЫХ НЕЙРОННЫХ СЕТЕЙ К ЗАДАЧАМ СТЕГОАНАЛИЗА ГРАФИЧЕСКИХ ИЗОБРАЖЕНИЙ

Терещенко С. Н., Perov A., Осипов А. Л., Автометрия 2021 № 4 С. 98–105

Исследовано использование свёрточных нейронных сетей в целях анализа контейнера графических изображений на наличие данных, внедрённых методами стеганографии. Показано, что глубокая свёрточная нейронная сеть обучается классифицировать присутствие скрытых данных в графических изображениях, достигая точности по метрике weighted AUC, равной 0,928. Проверена гипотеза об эффективности применения концепции «transfer learning» в сфере стеганографии. Эффективность предложенной технологии продемонстрирована на ...

Added: November 17, 2021

Touching the Limits of a Dataset in Video-Based Facial Expression Recognition

Churaev E., Savchenko A., , in: 2021 International Russian Automation Conference (RusAutoCon).: IEEE, 2021. P. 633–638.

In this paper, we examine the issue of video-based facial emotion recognition algorithms which show excellent performance on some benchmarks, but have much worse accuracy in practical applications. For example, the typical error rate of contemporary deep neural networks on the RAVDESS dataset is less than 5%. We argue that such results are obtained only ...

Added: October 7, 2021

Russian SuperGLUE 1.1: Revising the Lessons not Learned by Russian NLP-models

Fenogenova A., Tikhonova M., Mikhailov V. et al., , in: Computational Linguistics and Intellectual Technologies: Papers from the Annual International Conference “Dialogue” (2021)Issue 20: Основной том.: -, 2021. Ch. 21 P. 235–245.

In the last year, new neural architectures and multilingual pre-trained models have been released for Russian, which led to performance evaluation problems across a range of language understanding tasks. This paper presents Russian SuperGLUE 1.1, an updated benchmark styled after GLUE for Russian NLP models. The new version includes a number of technical, user experience ...

Added: September 5, 2021

Some Features of Sentiment Analysis for Russian Language Posts and Comments from Social Networks

Sidorov Nikita, Slastnikov Sergey, Journal of Physics: Conference Series 2021 Vol. 1740 P. 1–6

Sentiment analysis of different language texts is one of the very popular machine learning tasks. The complexity of its solution depends both on the characteristics of a particular language, and on the length of the evaluated texts. In our work, we consider the task of creating a sentiment analysis software tool for Russian posts and ...

Added: February 2, 2021

Deep convolutional neural networks capabilities for binary classification of polar mesocyclones in satellite mosaics

Криницкий М. А., Verezemskaya P., Гращенков К. В. et al., Atmosphere 2018 Vol. 9 No. 426 P. 1–23

Polar mesocyclones (MCs) are small marine atmospheric vortices. The class of intense MCs, called polar lows, are accompanied by extremely strong surface winds and heat fluxes and thus largely influencing deep ocean water formation in the polar regions. Accurate detection of polar mesocyclones in high-resolution satellite data, while challenging, is a time-consuming task, when performed ...

Added: November 26, 2020

Analysis of Parameters of Optical Linear Displacement Transducer with Open Channel

Akatov Maxim S., Safonov Sergey N., Tuv Alexander L., , in: Proceedings of the 2020 IEEE International Conference "Quality Management, Transport and Information Security, Information Technologies" (IT&QM&IS).: IEEE, 2020. Ch. 6 P. 170–172.

This article describes the structure and principle of operation of the optical linear displacement sensor with open optical channel. Scope of application in industrial equipment for measuring vibration parameters is determined. Analysis of errors caused by external illumination of the optical channel and a change in the reflection coefficient of the object surface is accomplished. ...

Added: November 11, 2020