Text classification with deep learning neural networks

?

Text classification with deep learning neural networks

P. 362–370.

Voronkov Ilia, Amajd M., Kaimuldenov Z.

In this paper, we analyze the use of different neural networks for the
text classification task. The accuracy of the studied text classifiers can be
changed by a small number of previously classified texts. This is important due
to the fact that in many applications of text classification a large number of un-
labeled texts are easily accessible, while the receipt of marked texts is quite a
difficult task. The paper also shows that the convolution neural network can
work better at the level of words, and does not require knowledge of the syntac-
tic or semantic structure of the language. On the other hand, a recurrent neural
network for the level of data representation in the form of a sequence can effec-
tively classify the text. Experimental results obtained for text corpora from two
different sources show that using a vector data representation can also improve
the accuracy of the classification.

Language: English

Text on another site

In book

Actual Problems of System and Software Engineering 2017. Proceedings of the 5th International Conference on Actual Problems of System and Software Engineering Supported by Russian Foundation for Basic Research. Project #17-07-20565 Moscow, Russia, November 14-16, 2017, 408 P.

Vol. 1989. , Aachen: CEUR Workshop Proceedings, 2017.

Дискриминативная лемматизация сокращений в эпоху LLM

Глазкова А. В., Смаль И. В., Lyashevskaya O. et al., Доклады Российской академии наук. Математика, информатика, процессы управления (ранее - Доклады Академии Наук. Математика) 2025 Т. 527 С. 146–155

This paper presents a study on the effectiveness of discriminative methods for abbreviation lemmatization in Russian texts. Unlike generative approaches, discriminative models select the optimal lemma from a fixed set of candidates, eliminating the risk of generating grammatically incorrect word forms. For the first time in Russian language processing, we conduct a comprehensive analysis of ...

Added: March 10, 2026

Transformer-based approaches for lemmatizing abbreviations in Russian texts

Glazkova A., Lyashevskaya O., Morozov D. et al., Journal of Mathematical Sciences 2025 Vol. 546 P. 32–47

This paper addresses the task of lemmatizing abbreviations in the Russian language. Abbreviation lemmatization is particularly challenging, as it involves not only transforming a word into its normal form but also correctly expanding the abbreviation. We explore two approaches to this task, both leveraging large pretrained language models. The first approach is generative, where the ...

Added: March 10, 2026

Ансамбль современных моделей компьютерного зрения для задачи обнаружения дипфейков

Pikul A. S., Безопасность информационных технологий 2024 Т. 31 № 4 С. 116–127

This article explores the potential use of modern computer vision architectures for the task of deepfake detection. The following architectures are considered: EfficientNet, Vision Transformer (ViT), VisionLSTM (ViL), Vision KAN, and Mamba Vision. The novelty of the approach lies in the application and comparison of these architectures, as well as their combination into paired ensembles ...

Added: December 12, 2025

Recognition of Mentally Pronounced Russian Phonemes Using Convolutional Neural Networks and Electroencephalography Data

Seleznev L. E., Chupakhin A. A., Kostenko V. A. et al., Optical Memory and Neural Networks (Information Optics) 2023 Vol. 32 No. 2 P. 73–85

We analyze a classification problem of mentally pronounced Russian phonemes based on data obtained by means of an electroencephalography device. We describe the data collection method as well as the methods of the obtained data processing. To solve the small sample size problem we present the augmentation techniques that use the time stretching and the ...

Added: October 2, 2025

Convolutional Neural Networks Decode Finger Movements in Motor Sequence Learning from MEG Data

Zabolotniy A., Chan R. W., Moiseeva V. et al., Frontiers in Neuroscience 2025 Vol. 19 Article 1623380

We demonstrated the feasibility of finger movement decoding with a tailored Convolutional Neural Network. The performance of our approach was comparable to complex deep learning architectures, while providing faster and interpretable outcome. This algorithmic strategy holds high potential for the investigation of the mechanisms underlying non-invasive neurophysiological recordings in cognitive neuroscience. ...

Added: October 2, 2025

Development of Image Preprocessing Methods for Software Compensation of Refraction Anomalies of an Observer’s Eyes

Алкзир Н., Yarykina n., Nikolaev D. et al., Neuroscience and Behavioral Physiology 2024

Added: April 28, 2025

Approximating and Predicting Energy Consumption of Portable Devices

Ullah T., Siraj A. H., Umer Mukhtar Andrabi et al., , in: 2022 VIII International Conference on Information Technology and Nanotechnology (ITNT).: IEEE, 2022. P. 1–7.

Added: March 20, 2025

Features of Data Collection and Software Tool Architecture for Performing Predictive Analysis of Phenomena Leading to Forest Fires

Hlib Nekrasov, Aleksandr Belov, , in: International IoT, Electronics and Mechatronics Conference, Volume 2. Proceedings of IEMTRONICS 2024. LNEE, volume 1228Vol. 1228.: Springer Publishing Company, 2025. P. 379–395.

Added: January 26, 2025

Automatic Morpheme Segmentation for Russian: Can an Algorithm Replace Experts?

Morozov D., Garipov T., Lyashevskaya O. et al., Journal of Language and Education 2024 Vol. 10 No. 4 P. 71–84

Introduction: Numerous algorithms have been proposed for the task of automatic morpheme segmentation of Russian words. Due to the differences in task formulation and datasets utilized, comparing the quality of these algorithms is challenging. It is unclear whether the errors in the models are due to the ineffectiveness of algorithms themselves or to errors and inconsistencies ...

Added: January 7, 2025

Нейросетевой алгоритм выявления и удаления выбросов в зашумленных наборах данных

Yasnitsky L., Plotnikova E. G., Прикладная информатика 2024 Т. 19 № 5 С. 88–100

Outliers in statistical data, which are the result of erroneously collected information, are often an obstacle to the successful application of machine learning methods in many subject areas. The presence of outliers in training data sets reduces the accuracy of machine learning models, and in some cases, makes the application of these methods impossible. Currently ...

Added: November 29, 2024

Proceedings Volume 11605, Thirteenth International Conference on Machine Vision

Teplyakov L., Kaymakov K., Shvets E. et al., SPIE, 2021.

Line detection is an important computer vision task traditionally solved by Hough Transform. With the advance of deep learning, however, trainable approaches to line detection became popular. In this paper we propose a lightweight CNN for line detection with an embedded parameter-free Hough layer, which allows the network neurons to have global strip-like receptive fields. ...

Added: November 5, 2024

Эмоциональный анализ постов в ВКонтакте: классификатор или регрессор

Kolmogorova A., Калинин А. А., В кн.: Компьютерная лингвистика и интеллектуальные технологии: по материалам международной конференции «Диалог 2022», выпуск 21Вып. 21.: Изд-во РГГУ, 2022. С. 311–322.

The article summarizes the results of two tasks in machine learning paradigm: the task of classification according to the criterion of dominating emotion on the data of social networks posts in Russian and the regression task using the same data. The experiments are conducted on the data set collected from VKontakte social network and consisted of 3879 posts ...

Added: March 18, 2024

Machine learning approach for scientific and technical expertise

A. V. Belov, E. A. Egorova, Bulletin D. Serikbayev East Kazakhstan Technical University 2023 No. 4 P. 92–102

When conducting scientific and technical expertise, it is necessary to analyze the texts of reports on scientific research work. The analysis is carried out in order to determine whether the research being conducted belongs to the class of scientific research and development work in the field of IT. This article discusses the tasks of binary ...

Added: March 9, 2024

Content policy and access limitations on commercial neural networks as an incentive to artivism

Milovidov S., Artnodes 2024 No. 33 P. 1–9

This article employs a case‐study method to investigate the artivism neural network community concentrated on Twitter (since renamed X), which has been ideologically influenced by the content policy and limitations of OpenAI. Today, many young artists using machine learning technologies in their artworks (Midjourney, Stable Diffusion, Kandinsky) note that despite significant progress in the field ...

Added: February 1, 2024

Обзор методов стегоанализа с использованием нейронных сетей

Kosmachev A., Задорожникова А. А., Perov A., В кн.: БОЛЬШИЕ ДАННЫЕ Материалы I Международного форума (Новосибирск, 16–18 ноября 2022 года).: Новосибирск: Новосибирский государственный университет экономики и управления «НИНХ», 2023.

The article discusses the basic concepts and terms used in steganography, substantiates the relevance of the problem of steganalysis, discusses the use of deep neural networks in the tasks of steganalysis on digital images. A comparative analysis and description of the most effective convolutional network architectures for solving the task is performed. ...

Added: January 26, 2024

Classification of Short Scientific Texts

I. K. Kusakin, Fedorets O. V., A. Y. Romanov, Scientific and Technical Information Processing 2023 Vol. 50 No. 3 P. 176–183

This paper discusses modern approaches to natural language processing and the application of machine learning models to the task of classifying short scientific texts in Russian. This study is devoted to the analysis of methods for vectorization of textual information, selection of a model for scientific paper clas- sification, and training of linguistic model BERT ...

Added: November 4, 2023

Применение методов машинного обучения при автоматизации детектирования препятствия движению поезда через железнодорожный переезд

Искандеров Ю. М., Катарушкин Б. Е., Ершов А. А., Информатизация и связь 2020 № 2 С. 46–51

Aim. Currently, when creating intelligent information systems in various fields of practical activity, machine learning methods are used. The article shows the possibilities of using these methods in automating the detection of obstacles in the interest of improving safety and reducing the number of emergencies at level crossings. Materials and methods. The article discusses advanced ...

Added: September 15, 2023