Автоматическое определение частей речи для русского языка с помощью обучения трансформаций.

В. В. Китов

Publications

?

Автоматическое определение частей речи для русского языка с помощью обучения трансформаций.

Научные труды Вольного экономического общества России. 2014. Т. 186. С. 228–235.

Kitov V. V.

In press

This paper describes the application of well-known «transformation-based learning» algorithm of automatic rule generation for the task of part-of-speech tagging. Algorithm is applied to corpora of annotated Russian texts and accuracy as well as most significant rules are shown.

Research target: Philology and Linguistics Mathematics

Priority areas: IT and mathematics

Language: Russian

Full text

Text on another site

Keywords: русский язык корпусная лингвистика Russian language corpus linguistics морфологическая омонимия morphological tagging morphological disambiguation морфологическая разметка

Using predefined vector systems to speed up neural network multimillion class classification

Gabdullin N., Androsov I., / Series Computer Science "arxiv.org". 2026.

Label prediction in neural networks (NNs) has O(n) complexity proportional to the number of classes. This holds true for classification using fully connected layers and cosine similarity with some set of class prototypes. In this paper we show that if NN latent space (LS) geometry is known and possesses specific properties, label prediction complexity can ...

Added: April 2, 2026

Relations between Average Clustering Coefficient and Another Centralities in Graphs

Tuzhilin M., Moscow University Mathematics Bulletin 2025 Vol. 80 P. 335–341

Relations between average clustering coefficient and global clustering coefficient, local efficiency, radiality, closeness, betweenness, and stress centralities were obtained for simple graphs. ...

Added: April 1, 2026

Классификация неособых потоков на ориентируемых 4-многообразиях

Pochinka O., Galkin V., Успехи математических наук 2026 Т. 81 № 2 (422) С. 71–144

Настоящая работа посвящена изучению динамики регулярных четырехмерных потоков, их топологической классификации и взаимосвязи с топологией несущего многообразия. Регулярные потоки являются топологическими аналогами потоков Морса–Смейла. Их появление мотивировано двумя фактами: 1) существованием топологических многообразий размерности 4 и выше, не имеющих гладкой структуры; 2) развитием методов топологической классификации гладких систем, использующих чисто топологические свойства этих систем и ...

Added: April 1, 2026

Полимультипликативные отображения, связанные с алгеброй итерированных рядов Лорана, и многомерный символ Конту-Каррера

Levashev V., Математический сборник 2026 Т. 217 № 4 С. 106–136

В настоящей работе изучаются функториальные полимультипликативные отображения от n+1 переменной мультипликативной группы алгебры n раз итерированных рядов Лорана над коммутативным кольцом в мультипликативную группу этого кольца. Доказывается, что если такое отображение инвариантно относительно непрерывных автоморфизмов этой алгебры, то оно с точностью до знака совпадает с целой степенью n-мерного символа Конту-Каррера. ...

Added: March 31, 2026

Математическое и компьютерное моделирование в экономике, страховании и управлении рисками: сборник статей. Выпуск 10. Материалы XIV Научно-практической конференции

Саратов: Саратовский университет, 2025.

В сборнике представлены материалы XIV Научно-практической конференции «Математическое и компьютерное моделирование в экономике, страховании и управлении рисками». Тематика статей затрагивает круг вопросов, связанных с экономикоматематическим и компьютерным моделированием и управлением рисками в финансовой деятельности, страховании, банковском деле, инвестировании, государственном управлении экономикой, бизнес-информатике и других разделах экономикоматематических знаний. Для сотрудников банков, финансовых и страховых компаний, экономических отделов организаций, служб управления ...

Added: March 31, 2026

Актуарные расчеты : учебник и практикум для вузов

Mironkina Y., Zvezdina N., Скорик М. А. et al., М.: Юрайт, 2025.

Эта книга представляет собой полный вводный курс в актуарную математику. Раскрыты основные понятия страхования и актуарных расчетов, финансовой математики и демографической статистики. Рассмотрены принципы и определения страхования, задачи актуарных расчетов, структура страховой премии и основные подходы к ее расчету, методы формирования страховых резервов в страховании ином, чем страхование жизни, страховании жизни и пенсий. Теоретический материал ...

Added: March 30, 2026

Функции интермедиальности в сериале "Очень странные дела"

Fomina E., Афанасьев В. А., Философия. Журнал Высшей школы экономики 2026 Т. 10 № 1 С. 189–216

The article is devoted to the realization of intermediality in the series “Stranger Things” (2016–2025). The specific approach of the Duffer brothers, the creators of the series, to recreate the atmosphere of the American 80s allows us to feel the sense of nostalgia, mainly achieved with the help of intermedial inclusions peculiar for the depicted era. In particular, the ...

Added: March 29, 2026

The effect of spelling errors on reading tasks: a study on Russian.

Slioussar N., Chernova D., Magomedova V. et al., The Mental Lexicon 2026 P. 1–31

Many studies on different languages analyzed how spelling errors are produced and detected. Recently, a new generalization was made for several languages: frequently misspelled words are read more slowly, even when they are written correctly and one knows how to spell them. This is explained by the lower quality of their lexical representations diluted by ...

Added: March 26, 2026

Орбиты сферических представлений и двойственность Пясецкого

Shunin D., Математический сборник 2026 Т. 217 № 3 С. 135–160

Dual representations V and V* of a complex connected algebraic group G simultaneously have either infinitely or finitely many orbits. Whenever the latter holds, the orbits in V and V* are in a bijective correspondence called Pyasetskii duality. We obtain a complete description of this duality in the case of spherical representations. ...

Added: March 26, 2026

Паратекст о паратексте

Kasatkina A., Сергеев М. Л., Acta Linguistica Petropolitana. Труды института лингвистических исследований 2025 Т. 21 № 3 С. 13–25

This article introduces a collection of publications selected from the Proceedings of the conference “Circum Text: Para, Meta-, and Other Marginalia” (Institute for Linguistic Studies RAS, St. Petersburg, October 19–21, 2023). It describes the general agenda of paratextual studies and aligns the selected articles with its various aspects. Paratext is a variety of verbal and ...

Added: March 25, 2026

On flexibility of affine factorial varieties

Arzhantsev I., Shakhmatov K., Revista de la Real Academia de Ciencias Exactas, Fisicas y Naturales - Serie A: Matematicas 2026 Vol. 120 Article 55

We give a criterion of factoriality of a suspension. This allows to construct many examples of flexible affine factorial varieties. In particular, we find a homogeneous affine factorial 3-fold that is not a homogeneous space of an algebraic group. ...

Added: March 24, 2026

Emergence of champion solitons from two-solitary-wave interactions in the fourth-order generalized Korteweg–de Vries equation

Flamarion M. V., Pelinovsky E., Chaos, Solitons and Fractals 2025 Vol. 208 No. 3 Article 118271

Two-solitary-wave interactions are investigated within the fourth-order generalized Korteweg– de Vries equation. This equation is closely related to the classical Korteweg–de Vries equation but includes a quartic nonlinear term. We show that, although collisions between two solitary waves are not perfectly elastic, only a small amount of radiation is generated during the interaction. This allows a clear characterization of ...

Added: March 22, 2026

On string functions of the generalized parafermionic theories, mock theta functions, and false theta functions

Borozenets N., Mortenson E., Advances in Mathematics 2026 Vol. 484 Article 110684

Kac and Wakimoto introduced the admissible highest weight representations as a conjectural classification of all modular-invariant representations of the affine Kac–Moody algebras. For the affine Kac–Moody algebra A_1^{(1)} their conjectural construction has been proved. Using Kac and Wakimoto's result, Ahn, Chung, and Tye introduced the generalized Fateev–Zamolodchikov parafermionic theories, whose chiral current algebras were recently ...

Added: March 22, 2026

Static manifolds with boundary: Their geometry and some uniqueness theorems

Medvedev V., Annales Henri Poincare. A Journal of Theoretical and Mathematical Physics 2026 P. 1–33

Static manifolds with boundary appear naturally in the context of the prescribed scalar curvature problem on manifolds with boundary, when the mean curvature of the boundary is also prescribed. They also arise in the setting of gen eral relativity: for example the time-slice of the photon sphere on the Riemannian Schwarzschild manifold splits it into static manifolds with boundary. ...

Added: March 21, 2026

О решении детерминированной и стохастической задачи домашнего хозяйства с конечным горизонтом планирования

Pilnik N., Экономический журнал Высшей школы экономики 2025 Т. 29 № 1 С. 42–71

The article uses the example of an optimization problem of a household that makes a decision on the volumes of consumption and investment to show what difficulties arise in deterministic and stochastic formulations on a finite time interval. In order to make the problem solvableon a finite time interval, a special terminal condition on the ...

Added: March 19, 2026

Особенности стратегии убеждения в российском и китайском политическом дискурсе (на материале политических ток-шоу «60 минут» и «这就是中国» («Это Китай»))

Бинштейн М. М., Вестник Томского государственного университета. Филология 2026 № 99 С. 5–27

The article explores the argumentative nature of political discourse, which, according to the authors, becomes the key to the analysis of the communicativestrategy of persuasion. The aim of the research is a comparative analysis of speeches by Russian and Chinese politicians, identifying similarities and differences in the use of rhetorical devices when implementing the persuasion ...

Added: March 19, 2026

Английский язык для профессиональных целей: Когнитивная нейробиология

Zakharova A. V., Мищук А. М., M.: Флинта, 2025.

The aim of the textbook is to develop English skills and competences of biology students to a level necessary for successful oral and written communication in academic and professional spheres. The textbook materials allow for the improvement of essential language skills that are required for academic and professional communication. The textbook consists of four sections that cover the ...

Added: March 19, 2026

Hausdorff dimension estimates for Sudler products with positive lower bound

Гайфулин Д. Р., Hauke M., Nonlinearity 2025 Vol. 38 No. 6 Article 065008

Given an irrational number $\alpha$, we study the asymptotic behaviour of the Sudler product denoted by $P_N(\alpha) =\prod_{r=1}^N 2\lvert \sin \pi r \alpha \rvert$. We show that $\liminf_{N \to \infty} P_N(\alpha) >0$ and $\limsup_{N \to \infty} P_N(\alpha)/N < \infty$ whenever the sequence of partial quotients in the continued fraction expansion of $\alpha$ exceeds 3 only finitely ...

Added: March 19, 2026

Российская социология в условиях цифровизации общества: результаты анализа корпуса научных текстов

Smirnov A., Социологические исследования 2023 № 4 С. 39–50

Using the analysis of a corpus of texts from eight leading Russian sociological journals, the article examines the impact of the digitalization of society on sociology in 2000–2021. Frequency analysis of 13.8 thousand scientific texts tracked the introduction of concepts related to digitalization into academic circulation. The article reveals the differences between the journals, due ...

Added: March 18, 2026

Дискриминативная лемматизация сокращений в эпоху LLM

Глазкова А. В., Смаль И. В., Lyashevskaya O. et al., Доклады Российской академии наук. Математика, информатика, процессы управления (ранее - Доклады Академии Наук. Математика) 2025 Т. 527 С. 146–155

This paper presents a study on the effectiveness of discriminative methods for abbreviation lemmatization in Russian texts. Unlike generative approaches, discriminative models select the optimal lemma from a fixed set of candidates, eliminating the risk of generating grammatically incorrect word forms. For the first time in Russian language processing, we conduct a comprehensive analysis of ...

Added: March 10, 2026

Rubic2: Ensemble Model for Russian Lemmatization

Afanasev I., Glazkova A., Lyashevskaya O. et al., , in: Proceedings of the 10th Workshop on Slavic Natural Language Processing (Slavic NLP 2025).: Association for Computational Linguistics, 2025. P. 157–170.

Pre-trained language models have significantly advanced natural language processing (NLP), particularly in analyzing languages with complex morphological structures. This study addresses lemmatization for the Russian language, the errors in which can critically affect the performance of information retrieval, question answering, and other tasks. We present the results of experiments on generative lemmatization using pre-trained language ...

Added: March 10, 2026

Transformer-based approaches for lemmatizing abbreviations in Russian texts

Glazkova A., Lyashevskaya O., Morozov D. et al., Journal of Mathematical Sciences 2025 Vol. 546 P. 32–47

This paper addresses the task of lemmatizing abbreviations in the Russian language. Abbreviation lemmatization is particularly challenging, as it involves not only transforming a word into its normal form but also correctly expanding the abbreviation. We explore two approaches to this task, both leveraging large pretrained language models. The first approach is generative, where the ...

Added: March 10, 2026

Говорящий и пишущий: К 100-летию со дня рождения Татьяны Григорьевны Винокур

М.: Институт русского языка им. В.В. Виноградова РАН, 2024.

The book is dedicated to the memory of a remarkable Russian language scholar, Tatyana Grigoryevna Vinokur (1924–1992). The range of issues addressed in the collected scholarly articles reflects the breadth of Tatyana Grigoryevna's research interests: the history of language, poetics, the language of fiction, stylistics, speech culture, problems of communication studies, and many other topics. ...

Added: March 8, 2026

Promotional adjectives in grant proposal abstracts: a corpus study

Tulyakov D., Permyakova T. M., Balezina E., Вестник Волгоградского государственного университета. Серия 2: Языкознание 2025 Vol. 24 No. 6 P. 58–67

By effectively integrating promotional discourse into grant proposal abstracts, researchers can more compellingly present their ideas and increase their chances of securing funding. Implications of promotional adjectives in grant writing might differ across various research fields. This study aims to explore the use of promotional adjectives in abstracts of research grant proposals in six research ...

Added: March 2, 2026