• A
  • A
  • A
  • АБВ
  • АБВ
  • АБВ
  • A
  • A
  • A
  • A
  • A
Обычная версия сайта
  • RU
  • EN
  • HSE University
  • Publications
  • Book chapter
  • Text classification with deep learning neural networks
  • RU
  • EN
Расширенный поиск
Высшая школа экономики
Национальный исследовательский университет
Priority areas
  • business informatics
  • economics
  • engineering science
  • humanitarian
  • IT and mathematics
  • law
  • management
  • mathematics
  • sociology
  • state and public administration
by year
  • 2027
  • 2026
  • 2025
  • 2024
  • 2023
  • 2022
  • 2021
  • 2020
  • 2019
  • 2018
  • 2017
  • 2016
  • 2015
  • 2014
  • 2013
  • 2012
  • 2011
  • 2010
  • 2009
  • 2008
  • 2007
  • 2006
  • 2005
  • 2004
  • 2003
  • 2002
  • 2001
  • 2000
  • 1999
  • 1998
  • 1997
  • 1996
  • 1995
  • 1994
  • 1993
  • 1992
  • 1991
  • 1990
  • 1989
  • 1988
  • 1987
  • 1986
  • 1985
  • 1984
  • 1983
  • 1982
  • 1981
  • 1980
  • 1979
  • 1978
  • 1977
  • 1976
  • 1975
  • 1974
  • 1973
  • 1972
  • 1971
  • 1970
  • 1969
  • 1968
  • 1967
  • 1966
  • 1965
  • 1964
  • 1963
  • 1958
  • More
Subject
News
June 5, 2026
Neural Network Maps as a Method for Constructing Mathematical Models
Scientists from HSE University–Nizhny Novgorod and the Institute of Physics Belgrade, Serbia, are jointly exploring the application of machine learning techniques and neural networks to the study of nonlinear dynamics. Natalya Stankevich, Leading Research Fellow at the Laboratory of Topological Methods in Dynamics of the Faculty of Informatics, Mathematics, and Computer Science at HSE University–Nizhny Novgorod, spoke to the HSE News Service about this international project.
June 5, 2026
‘In the Age of Technology, It Is Interesting to Look into the Past and Think about What We Can Take from It
Polina Tabakova decided to apply for a Philology degree at HSE in Nizhny Novgorod because she grew up in Mari El and did not want to move far away from the Russian forests. In an interview for the Young Scientists of HSE University project, she spoke about the genre of the campus novel, the existential drama of Kolobok, and a blackout version of Eugene Onegin.
June 5, 2026
HSE Scientists Develop Method to Compress Large Language Models Without Losing Quality
Researchers from the AI and Digital Science Institute at the HSE Faculty of Computer Science have developed a new compression method for large language models such as GPT and LLaMA that reduces their size by 25–36% without additional training or significant loss of accuracy. This is the first approach to use mathematical transformations—specifically, rotations of model weights—to make models more amenable to compression with structured matrices. The study results have been published in ACL Findings 2025. The code is available on GitHub.

 

Have you spotted a typo?
Highlight it, click Ctrl+Enter and send us a message. Thank you for your help!

Publications
  • Books
  • Articles
  • Chapters of books
  • Working papers
  • Report a publication
  • Research at HSE

?

Text classification with deep learning neural networks

P. 362–370.
Voronkov Ilia, Amajd M., Kaimuldenov Z.

In this paper, we analyze the use of different neural networks for the
text classification task. The accuracy of the studied text classifiers can be
changed by a small number of previously classified texts. This is important due
to the fact that in many applications of text classification a large number of un-
labeled texts are easily accessible, while the receipt of marked texts is quite a
difficult task. The paper also shows that the convolution neural network can
work better at the level of words, and does not require knowledge of the syntac-
tic or semantic structure of the language. On the other hand, a recurrent neural
network for the level of data representation in the form of a sequence can effec-
tively classify the text. Experimental results obtained for text corpora from two
different sources show that using a vector data representation can also improve
the accuracy of the classification.

Language: English
Text on another site
Keywords: convolutional neural networkstext classificationword embeddingsrecurrent neural networkNeural Network

In book

Actual Problems of System and Software Engineering 2017. Proceedings of the 5th International Conference on Actual Problems of System and Software Engineering Supported by Russian Foundation for Basic Research. Project #17-07-20565 Moscow, Russia, November 14-16, 2017, 408 P.
Vol. 1989. , Aachen: CEUR Workshop Proceedings, 2017.
Similar publications
Дискриминативная лемматизация сокращений в эпоху LLM
Глазкова А. В., Смаль И. В., Lyashevskaya O. et al., Доклады Российской академии наук. Математика, информатика, процессы управления (ранее - Доклады Академии Наук. Математика) 2025 Т. 527 С. 146–155
This paper presents a study on the effectiveness of discriminative methods for abbreviation lemmatization in Russian texts. Unlike generative approaches, discriminative models select the optimal lemma from a fixed set of candidates, eliminating the risk of generating grammatically incorrect word forms. For the first time in Russian language processing, we conduct a comprehensive analysis of ...
Added: March 10, 2026
Transformer-based approaches for lemmatizing abbreviations in Russian texts
Glazkova A., Lyashevskaya O., Morozov D. et al., Journal of Mathematical Sciences 2025 Vol. 546 P. 32–47
This paper addresses the task of lemmatizing abbreviations in the Russian language. Abbreviation lemmatization is particularly challenging, as it involves not only transforming a word into its normal form but also correctly expanding the abbreviation. We explore two approaches to this task, both leveraging large pretrained language models. The first approach is generative, where the ...
Added: March 10, 2026
Ансамбль современных моделей компьютерного зрения для задачи обнаружения дипфейков
Pikul A. S., Безопасность информационных технологий 2024 Т. 31 № 4 С. 116–127
This article explores the potential use of modern computer vision architectures for the task of deepfake detection. The following architectures are considered: EfficientNet, Vision Transformer (ViT), VisionLSTM (ViL), Vision KAN, and Mamba Vision. The novelty of the approach lies in the application and comparison of these architectures, as well as their combination into paired ensembles ...
Added: December 12, 2025
Recognition of Mentally Pronounced Russian Phonemes Using Convolutional Neural Networks and Electroencephalography Data
Seleznev L. E., Chupakhin A. A., Kostenko V. A. et al., Optical Memory and Neural Networks (Information Optics) 2023 Vol. 32 No. 2 P. 73–85
We analyze a classification problem of mentally pronounced Russian phonemes based on data obtained by means of an electroencephalography device. We describe the data collection method as well as the methods of the obtained data processing. To solve the small sample size problem we present the augmentation techniques that use the time stretching and the ...
Added: October 2, 2025
Convolutional Neural Networks Decode Finger Movements in Motor Sequence Learning from MEG Data
Zabolotniy A., Chan R. W., Moiseeva V. et al., Frontiers in Neuroscience 2025 Vol. 19 Article 1623380
We demonstrated the feasibility of finger movement decoding with a tailored Convolutional Neural Network. The performance of our approach was comparable to complex deep learning architectures, while providing faster and interpretable outcome. This algorithmic strategy holds high potential for the investigation of the mechanisms underlying non-invasive neurophysiological recordings in cognitive neuroscience. ...
Added: October 2, 2025
Development of Image Preprocessing Methods for Software Compensation of Refraction Anomalies of an Observer’s Eyes
Алкзир Н., Yarykina n., Nikolaev D. et al., Neuroscience and Behavioral Physiology 2024
Added: April 28, 2025
Approximating and Predicting Energy Consumption of Portable Devices
Ullah T., Siraj A. H., Umer Mukhtar Andrabi et al., , in: 2022 VIII International Conference on Information Technology and Nanotechnology (ITNT).: IEEE, 2022. P. 1–7.
Added: March 20, 2025
Features of Data Collection and Software Tool Architecture for Performing Predictive Analysis of Phenomena Leading to Forest Fires
Hlib Nekrasov, Aleksandr Belov, , in: International IoT, Electronics and Mechatronics Conference, Volume 2. Proceedings of IEMTRONICS 2024. LNEE, volume 1228Vol. 1228.: Springer Publishing Company, 2025. P. 379–395.
Added: January 26, 2025
Automatic Morpheme Segmentation for Russian: Can an Algorithm Replace Experts?
Morozov D., Garipov T., Lyashevskaya O. et al., Journal of Language and Education 2024 Vol. 10 No. 4 P. 71–84
Introduction: Numerous algorithms have been proposed for the task of automatic morpheme segmentation of Russian words. Due to the differences in task formulation and datasets utilized, comparing the quality of these algorithms is challenging. It is unclear whether the errors in the models are due to the ineffectiveness of algorithms themselves or to errors and inconsistencies ...
Added: January 7, 2025
Нейросетевой алгоритм выявления и удаления выбросов в зашумленных наборах данных
Yasnitsky L., Plotnikova E. G., Прикладная информатика 2024 Т. 19 № 5 С. 88–100
Outliers in statistical data, which are the result of erroneously collected information, are often an obstacle to the successful application of machine learning methods in many subject areas. The presence of outliers in training data sets reduces the accuracy of machine learning models, and in some cases, makes the application of these methods impossible. Currently ...
Added: November 29, 2024
Proceedings Volume 11605, Thirteenth International Conference on Machine Vision
Teplyakov L., Kaymakov K., Shvets E. et al., SPIE, 2021.
Line detection is an important computer vision task traditionally solved by Hough Transform. With the advance of deep learning, however, trainable approaches to line detection became popular. In this paper we propose a lightweight CNN for line detection with an embedded parameter-free Hough layer, which allows the network neurons to have global strip-like receptive fields. ...
Added: November 5, 2024
Эмоциональный анализ постов в ВКонтакте: классификатор или регрессор
Kolmogorova A., Калинин А. А., В кн.: Компьютерная лингвистика и интеллектуальные технологии: по материалам международной конференции «Диалог 2022», выпуск 21Вып. 21.: Изд-во РГГУ, 2022. С. 311–322.
The article summarizes the results of two tasks in machine learning paradigm: the task of classification according to the criterion of dominating emotion on the data of social networks posts in Russian and the regression task using the same data. The experiments are conducted on the data set collected from VKontakte social network and consisted of 3879 posts ...
Added: March 18, 2024
Machine learning approach for scientific and technical expertise
A. V. Belov, E. A. Egorova, Bulletin D. Serikbayev East Kazakhstan Technical University 2023 No. 4 P. 92–102
When conducting scientific and technical expertise, it is necessary to analyze the texts of reports on scientific research work. The analysis is carried out in order to determine whether the research being conducted belongs to the class of scientific research and development work in the field of IT. This article discusses the tasks of binary ...
Added: March 9, 2024
Content policy and access limitations on commercial neural networks as an incentive to artivism
Milovidov S., Artnodes 2024 No. 33 P. 1–9
This article employs a case‐study method to investigate the artivism neural network community concentrated on Twitter (since renamed X), which has been ideologically influenced by the content policy and limitations of OpenAI. Today, many young artists using machine learning technologies in their artworks (Midjourney, Stable Diffusion, Kandinsky) note that despite significant progress in the field ...
Added: February 1, 2024
Обзор методов стегоанализа с использованием нейронных сетей
Kosmachev A., Задорожникова А. А., Perov A., В кн.: БОЛЬШИЕ ДАННЫЕ Материалы I Международного форума (Новосибирск, 16–18 ноября 2022 года).: Новосибирск: Новосибирский государственный университет экономики и управления «НИНХ», 2023.
The article discusses the basic concepts and terms used in steganography, substantiates the relevance of the problem of steganalysis, discusses the use of deep neural networks in the tasks of steganalysis on digital images. A comparative analysis and description of the most effective convolutional network architectures for solving the task is performed. ...
Added: January 26, 2024
Classification of Short Scientific Texts
I. K. Kusakin, Fedorets O. V., A. Y. Romanov, Scientific and Technical Information Processing 2023 Vol. 50 No. 3 P. 176–183
This paper discusses modern approaches to natural language processing and the application of machine learning models to the task of classifying short scientific texts in Russian. This study is devoted to the analysis of methods for vectorization of textual information, selection of a model for scientific paper clas- sification, and training of linguistic model BERT ...
Added: November 4, 2023
Применение методов машинного обучения при автоматизации детектирования препятствия движению поезда через железнодорожный переезд
Искандеров Ю. М., Катарушкин Б. Е., Ершов А. А., Информатизация и связь 2020 № 2 С. 46–51
Aim. Currently, when creating intelligent information systems in various fields of practical activity, machine learning methods are used. The article shows the possibilities of using these methods in automating the detection of obstacles in the interest of improving safety and reducing the number of emergencies at level crossings. Materials and methods. The article discusses advanced ...
Added: September 15, 2023
  • About
  • About
  • Key Figures & Facts
  • Sustainability at HSE University
  • Faculties & Departments
  • International Partnerships
  • Faculty & Staff
  • HSE Buildings
  • HSE University for Persons with Disabilities
  • Public Enquiries
  • Studies
  • Admissions
  • Programme Catalogue
  • Undergraduate
  • Graduate
  • Exchange Programmes
  • Summer University
  • Summer Schools
  • Semester in Moscow
  • Business Internship
  • Research
  • International Laboratories
  • Research Centres
  • Research Projects
  • Monitoring Studies
  • Conferences & Seminars
  • Academic Jobs
  • Yasin (April) International Academic Conference on Economic and Social Development
  • Media & Resources
  • Publications by staff
  • HSE Journals
  • Publishing House
  • iq.hse.ru: commentary by HSE experts
  • Library
  • Economic & Social Data Archive
  • Video
  • HSE Repository of Socio-Economic Information
  • HSE1993–2026
  • Contacts
  • Copyright
  • Privacy Policy
  • Site Map
Edit