Word Sense Frequency of Similar Polysemous Words in Different Languages

B. Iomdin; A. Lopukhina; Лопухин К. А.; Носырев Г. В.

?

Word Sense Frequency of Similar Polysemous Words in Different Languages

Компьютерная лингвистика и интеллектуальные технологии. 2016. No. 15. P. 214–225.

Iomdin B., Lopukhina A., Лопухин К. А., Носырев Г. В.

When words have several senses, it is important to describe them properly in dictionary (a lexicographic task) and to be able to distinguish them in a given context (a computational linguistics task, WSD). Different senses normally have different frequencies in corpora. We introduced several techniques for determining sense frequency based on dictionary entries matched with data from large corpora. Information about word sense frequency is not only useful for explanatory lexicography and WSD, but it also may enrich language learning resources. Learners of a foreign language who encounter a word similar to one of their native language are often tempted to assume that the foreign word and its equivalent have the same meaning structure. Sometimes, however, this is not the case, and the most frequent sense of a word in one language may be much less frequent for its cognate. We proposed a method for detecting such cases. Having selected a set of Russian words included into the Active Dictionary of Russian which have more than two dictionary senses and have cognates in English, we estimated the frequencies for English and Russian senses using SemCor and Russian National Corpus respectively, matched the senses in each pair of words and compared their frequencies. Thus we revealed cases in which the most frequent senses and whole meaning structures are, cross-linguistically, substantially different and studied them in more detail. This technique can be applied not only to cognates, but also to pairs of words which are usually offered by the dictionaries as the translation equivalents of each other.

Research target: Computer Science Philology and Linguistics

Priority areas: humanitarian

Language: English

Full text

Text on another site

Keywords: semantics frequency lexicography polysemy experiments text corpora meaning frequency

(Mis)Understandings in Multicultural Communication: Implications for Second Language Classrooms and Professional Settings

Switzerland: Springer, 2026.

This book explores how multicultural speakers interact in monolingual, bilingual, and telecollaborative contexts in order to establish evidence-based recommendations for best practices in second/foreign language classrooms and professional settings. The book features leading experts sharing valuable insights and cutting-edge research analyses of talk in interaction. It consists of six parts. Part 1 describes its main ...

Added: April 6, 2026

Дмитрий Пригов и Евгений Винокуров: "Домашнее хозяйство" в свете официальной советской поэзии

Павел Успенский, Костомарова К. П., Новый мир 2026 № 3 С. 197–202

The article examines one of the sources of Prigov's cycle "Household Management", namely, Evgeny Vinokurov's poem "Iz istoricheskikh sobitiy..." It proves that the impetus for writing the cycle was official Soviet poetry. ...

Added: April 5, 2026

A New Edition of Adad-nērārī I’s inscription A.0.76.5

Alexandrov B., Vestnik Drevnei Istorii 2025 Vol. 85 No. 3 P. 509–521

The article offers a new treatment of the inscription commissioned by the Assyrian king Adad-nērārī I (1295–1264 BC) and now widely known, thanks to A.K. Grayson’s 1987 edition, as A.0.76.5. The text comes from Qalʽat Širqāṭ / Aššur; in the early 1920s it was brought to Lvov, at that time a part of Poland, and ...

Added: April 4, 2026

Not all coexpressions are syncretisms: Limiting Nanosyntax

Bubnov G., Glossa: a journal of general linguistics 2026 Vol. 11

This paper revises the findings of Dekier (2021) concerning the syncretism and containment of indefinites in light of their semantic implausibility and empirical inadequacy, compared to the alternative semantic approach of Degano & Aloni (2025), and argues against the omnipotence of a nanosyntactic approach to coexpression phenomena. The paper also addresses the diachronic predictions of Nanosyntax and ...

Added: April 3, 2026

Проблемы семантики и прагматики языковых единиц разных уровней в эпоху больших языковых данных. Сборник трудов Международной научной конференции, посвященной памяти доктора филологических наук, профессора Ольги Павловны Ермаковой

Калуга: ФБГОУ ВПО "Калужский государственный университет им. К.Э.Циолковского", 2025.

В настоящем сборнике представлены доклады ученых-лингвистов из разных стран (России, Беларуси, Молдовы, КНР) по итогам Международной научной конференции, посвященной памяти доктора филологических наук, профессора Ольги Павловны Ермаковой, «Проблемы семантики и прагматики языковых единиц разных уровней в эпоху больших языковых данных», которая проходила на базе Калужского госуниверситета 28 - 30 июня 2025 года и была посвящена ...

Added: April 3, 2026

Цифровая фотография в эпоху социальных медиа: между правдой и вымыслом

Balakina Y. V., Вестник Российского университета дружбы народов. Серия: Литературоведение, журналистика 2026 Т. 31 № 1 С. 163–174

The phenomenon of digital social photography is explored as a product of contemporary social media practices. It analyses the transformation of the role of photography in the digital age, where it functions not merely as a tool for recording reality, but also as an active agent in the process of meaning-making. The study relies on ...

Added: April 3, 2026

Using predefined vector systems to speed up neural network multimillion class classification

Gabdullin N., Androsov I., / Series Computer Science "arxiv.org". 2026.

Label prediction in neural networks (NNs) has O(n) complexity proportional to the number of classes. This holds true for classification using fully connected layers and cosine similarity with some set of class prototypes. In this paper we show that if NN latent space (LS) geometry is known and possesses specific properties, label prediction complexity can ...

Added: April 2, 2026

Математическое и компьютерное моделирование в экономике, страховании и управлении рисками: сборник статей. Выпуск 10. Материалы XIV Научно-практической конференции

Саратов: Саратовский университет, 2025.

В сборнике представлены материалы XIV Научно-практической конференции «Математическое и компьютерное моделирование в экономике, страховании и управлении рисками». Тематика статей затрагивает круг вопросов, связанных с экономикоматематическим и компьютерным моделированием и управлением рисками в финансовой деятельности, страховании, банковском деле, инвестировании, государственном управлении экономикой, бизнес-информатике и других разделах экономикоматематических знаний. Для сотрудников банков, финансовых и страховых компаний, экономических отделов организаций, служб управления ...

Added: March 31, 2026

МОДИФИЦИРОВАННАЯ ГРАВИТАЦИОННАЯ МОДЕЛЬ ОЦЕНКИ ДОСТУПНОСТИ МЕДИЦИНСКИХ УСЛУГ: ЗАДАЧА, АЛГОРИТМ И РЕАЛИЗАЦИЯ

Begicheva A., Бегичева С. В., Прикладная информатика 2025 Т. 20 № 5 (119) С. 4–21

Territorial inequality in access to healthcare remains a pressing issue for the healthcare system of the Russian Federation. Significant disparities in transport accessibility, staffing levels, and the spatial distribution of medical facilities complicate evidence-based decision-making, especially in regions with uneven population density and fragmented infrastructure. This creates the need for formalized and reproducible approaches to ...

Added: March 30, 2026

Функции интермедиальности в сериале "Очень странные дела"

Fomina E., Афанасьев В. А., Философия. Журнал Высшей школы экономики 2026 Т. 10 № 1 С. 189–216

The article is devoted to the realization of intermediality in the series “Stranger Things” (2016–2025). The specific approach of the Duffer brothers, the creators of the series, to recreate the atmosphere of the American 80s allows us to feel the sense of nostalgia, mainly achieved with the help of intermedial inclusions peculiar for the depicted era. In particular, the ...

Added: March 29, 2026

A framework for text mining on Twitter: a case study on joint comprehensive plan of action (JCPOA)- between 2015 and 2019

Behzadidoost R., Quality and Quantity 2021 Vol. 56 No. 5 P. 3053–3084

In the big data era, there is a necessity for effective frameworks to collect, retrieve, and manage data. As not all tweets are hashtagged by users, retrieving them is a complicated task. To address this issue, we present a rule-based expert system classifier that uses the well-known concept of fingerprint in the judicial sciences. This ...

Added: March 27, 2026

The effect of spelling errors on reading tasks: a study on Russian.

Slioussar N., Chernova D., Magomedova V. et al., The Mental Lexicon 2026 P. 1–31

Many studies on different languages analyzed how spelling errors are produced and detected. Recently, a new generalization was made for several languages: frequently misspelled words are read more slowly, even when they are written correctly and one knows how to spell them. This is explained by the lower quality of their lexical representations diluted by ...

Added: March 26, 2026

Паратекст о паратексте

Kasatkina A., Сергеев М. Л., Acta Linguistica Petropolitana. Труды института лингвистических исследований 2025 Т. 21 № 3 С. 13–25

This article introduces a collection of publications selected from the Proceedings of the conference “Circum Text: Para, Meta-, and Other Marginalia” (Institute for Linguistic Studies RAS, St. Petersburg, October 19–21, 2023). It describes the general agenda of paratextual studies and aligns the selected articles with its various aspects. Paratext is a variety of verbal and ...

Added: March 25, 2026

О задаче построения децентрализованной интеллектуальной транспортной системы на основе протокола RAFT и кластеризации по сетевому расстоянию.

Kaperko A., Городничев М. Г., Саксонов Е. А. et al., Вестник Рязанского государственного радиотехнического университета, Российская Федерация 2025 № 94 С. 59–67

The article is devoted to the development and experimental evaluation of a decentralized architecture for an intelligent transport system (ITS) based on the Raft consensus protocol and the network distance metric (RTT) server clustering method. It is shown that existing solutions either require manual configuration and centralized coordination, or are not optimized for latency with ...

Added: March 25, 2026

Особенности стратегии убеждения в российском и китайском политическом дискурсе (на материале политических ток-шоу «60 минут» и «这就是中国» («Это Китай»))

Бинштейн М. М., Вестник Томского государственного университета. Филология 2026 № 99 С. 5–27

The article explores the argumentative nature of political discourse, which, according to the authors, becomes the key to the analysis of the communicativestrategy of persuasion. The aim of the research is a comparative analysis of speeches by Russian and Chinese politicians, identifying similarities and differences in the use of rhetorical devices when implementing the persuasion ...

Added: March 19, 2026

Толковый словарь русской разговорной речи. Вып. 6, дополнительный, часть 1: А-И

Жидкова Е. Г., Занадворова А. В., Какорина Е. В. et al., Институт русского языка им. В.В. Виноградова РАН, 2026.

The dictionary provides a description of the vocabulary of modern Russian colloquial speech. The dictionary is experimental and, unlike most academic dictionaries, is not prescriptive. The authors' goal was to reflect as fully as possible, in dictionary form, the semantic, grammatical, collocational, and stylistic properties of the lexical and phraseological means used in the everyday ...

Added: March 9, 2026

Лексикографическая копилка

-, 2025.

Сборник научных статей включает материалы, имеющие научную ценность для дальнейших лингвистических, особенно лексикографических, работ. Исходя из своего названия сборник аккумулирует информацию не только об уже созданных лексикографических ресурсах, но и тех, которые находятся на стадии создании. Специально выделена для этой цели рубрика «Лексикографические проекты». Все статьи носят научно-исследовательский характер. Разнообразна языковая палитра, отраженная в данном ...

Added: February 23, 2026

A naive picture of the world and a biosemantic approach to describing the lexical structure of a word

Trofimova N., Pesina S., Vinogradova S. et al., Revista EntreLinguas 2023 Vol. 9 No. 00

The problems of studying the lexical structure of a word have a way out into various areas of cognitive science, including biosemiotics. In the article, the biosemiotic approach is reframed into a biosemantic approach based on decoding specific lexical structures. The lexical invariants of polysemous words are shown to be meaningful cores of their figurative ...

Added: February 23, 2026

Textual metaphtonymy as opposed to lexical metaphtonymy

Trofimova N., Pesina S., Vinogradova S. et al., Brazilian Journal of Education, Technology and Society - BRAJETS 2025 Vol. 17 No. 3 P. 126–136

n this article, we have proposed to differentiate textual and lexical metaphtonymy as different phenomena. We have considered the phenomenon of metaphtonymy at the level of text, a chain of words the length of a sentence, a phrase, and a separate meaning. We propose to reserve the term “metaphtonymy” for cases when we are talking ...

Added: February 22, 2026

Когнитивные исследования языка

Тамбов: Тамбовский государственный университет им. Г.Р. Державина, 2025.

"Cognitive Studies of Language" is a leading scientific field and periodical studying language as a cognitive mechanism, a tool for conceptualizing and categorizing the world, storing and processing information. It analyzes the relationship between language, consciousness, and mental processes, including conceptual analysis, frame semantics, and modeling, drawing on the work of Russian and international experts. The ...

Added: February 22, 2026

Rendering ‘The Story of God and People’ into Minimal Russian: A Translator’s Commentary

Gladkova A., , in: Explorations in Applied Ethnolinguistics: Words, Cultures, and Global Perspectives.: Palgrave Macmillan, 2025. Ch. 8 P. 149–162.

This chapter is a reflective analysis of the author’s experience of translating The Story of God and People (henceforth: The Story) from Anna Wierzbicka’s book What Christians Believe: The Story of God and People in Minimal English from English into Russian (Wierzbicka, 2019; Vežbickaja, 2021). ...

Added: January 28, 2026

Attribution of de re Propositional Attitudes as a Means of Persuasion

D.B. Tiskin, Frolov K., Herald of the Russian Academy of Sciences 2025 Vol. 95 No. 1 P. 26–33

By de re propositional attitude ascription for rhetorical purposes we will understand uttering a modal statement wherein the speaker deliberately uses a description of the object of an attitude that is knowingly unavailable for the attitude holder. As the existence of the de re rhetorical statement class is revealed, it gives rise to two questions, which will be the primary concern ...

Added: January 26, 2026

Русские глаголы исчезновения в типологическом контексте

Albitskiy P., Rakhilina E. V., Acta Linguistica Petropolitana. Труды института лингвистических исследований 2025 № 21.1 С. 34–61

The article examines Russian verbs of disappearance analyzing their semantic and syntactic properties. These verbs are classified as “underdetermined predicates” similar to verbs of hiding and searching as they do not specify the exact process of disappearance only indicating its result. Unlike verbs of searching and hiding, verbs of disappearance do not describe associated processes ...

Added: December 6, 2025

Сложное слово и словосочетание: корпусный подход (случай «bad blood»)

Филатов А. С., Когнитивные исследования языка 2025 Т. 1-2 № 25 С. 302–305

The article demonstrates the productivity of corpus-based linguistic analysis regarding the problem of distinguishing phrases from compounds. The object of the research is “bad blood” in the American English language, the morphological status of which is approached in close connection with its real-life usage and the polysemies of its constituents. ...

Added: November 24, 2025