• A
  • A
  • A
  • АБВ
  • АБВ
  • АБВ
  • A
  • A
  • A
  • A
  • A
Обычная версия сайта
  • RU
  • EN
  • HSE University
  • Publications
  • Book chapter
  • Russe’2018: A shared task on word sense induction for the Russian language
  • RU
  • EN
Расширенный поиск
Высшая школа экономики
Национальный исследовательский университет
Priority areas
  • business informatics
  • economics
  • engineering science
  • humanitarian
  • IT and mathematics
  • law
  • management
  • mathematics
  • sociology
  • state and public administration
by year
  • 2027
  • 2026
  • 2025
  • 2024
  • 2023
  • 2022
  • 2021
  • 2020
  • 2019
  • 2018
  • 2017
  • 2016
  • 2015
  • 2014
  • 2013
  • 2012
  • 2011
  • 2010
  • 2009
  • 2008
  • 2007
  • 2006
  • 2005
  • 2004
  • 2003
  • 2002
  • 2001
  • 2000
  • 1999
  • 1998
  • 1997
  • 1996
  • 1995
  • 1994
  • 1993
  • 1992
  • 1991
  • 1990
  • 1989
  • 1988
  • 1987
  • 1986
  • 1985
  • 1984
  • 1983
  • 1982
  • 1981
  • 1980
  • 1979
  • 1978
  • 1977
  • 1976
  • 1975
  • 1974
  • 1973
  • 1972
  • 1971
  • 1970
  • 1969
  • 1968
  • 1967
  • 1966
  • 1965
  • 1964
  • 1963
  • 1958
  • More
Subject
News
April 30, 2026
HSE Researchers Compile Scientific Database for Studying Childrens Eating Habits
The database created at HSE University can serve as a foundation for studying children’s eating habits. This is outlined in the study ‘The Influence of Age, Gender, and Social-Role Factors on Children’s Compliance with Age-Based Nutritional Norms: An Experimental Study Using the Dish-I-Wish Web Application.’ The work has been carried out as part of the HSE Basic Research Programme and was presented at the XXVI April International Academic Conference named after Evgeny Yasin.
April 30, 2026
New Foresight Centre Study Identifies the Most Destructive Global Trends for Humankind
A team of researchers from the HSE International Research and Educational Foresight Centre has examined how global trends affect the quality of human life—from life expectancy to professional fulfilment. The findings of the study titled ‘Human Capital Transformation under the Influence of Global Trends’ were published in Foresight.
April 28, 2026
Scientists Develop Algorithm for Accurate Financial Time Series Forecasting
Researchers at the HSE Faculty of Computer Science benchmarked more than 200,000 model configurations for predicting financial asset prices and realised volatility, showing that performance can be improved by filtering out noise at specific frequencies in advance. This technique increased accuracy in 65% of cases. The authors also developed their own algorithm, which achieves accuracy comparable to that of the best models while requiring less computational power. The study has been published in Applied Soft Computing.

 

Have you spotted a typo?
Highlight it, click Ctrl+Enter and send us a message. Thank you for your help!

Publications
  • Books
  • Articles
  • Chapters of books
  • Working papers
  • Report a publication
  • Research at HSE

?

Russe’2018: A shared task on word sense induction for the Russian language

P. 547–564.
Panchenko A., Lopukhina A., Ustalov D., Lopukhin K., Arefyev, N., Leontyev A., Loukachevitch N.

The paper describes the results of the first shared task on word sense induction (WSI) for the Russian language. While similar shared tasks were conducted in the past for some Romance and Germanic languages, we explore the performance of sense induction and disambiguation methods for a Slavic language that shares many features with other Slavic languages, such as rich morphology and virtually free word order. The participants were asked to group contexts of a given word in accordance with its senses which were not provided beforehand. For instance, given a word “bank” and a set of contexts for this word, e.g. “bank is a financial institution that accepts deposits” and “river bank is a slope beside a body of water”, a participant was asked to cluster such contexts into unknown in advance number of clusters corresponding to, in this case, the “company” and the “area” senses of the word “bank”. For the purpose of this evaluation campaign, we developed three new evaluation datasets based on sense inventories that have different sense granularity. The contexts in these datasets were sampled from texts of Wikipedia, the academic corpus of Russian, and an explanatory dictionary of Russian. Overall, 18 teams participated in the competition submitting 383 models. Multiple teams managed to substantially outperform competitive state-of-the-art baselines from the previous years based on sense embeddings.

Language: English
Text on another site
Keywords: lexical semanticshomonymy word sense induction

In book

Computational Linguistics and Intellectual Technologies. International Conference "Dialogue 2018" Proceedings
M.: Conference Proceedings Editorial board, 2018.
Similar publications
Semantic integrity of a word structure and semantic primitives
Trofimova N., Pesina S. A., Vinogradova S. A. et al., Res Militaris 2022 Vol. 12 No. 2 P. 2111–2119
The article attempts to describe the way of storing and functioning of meanings of polysemous words in the linguistic lexicon. To achieve this goal we turned to research of semantic primitives discovered in the course of lexical analysis. Within the framework of the interdisciplinary approach to the problems of words meanings ambiguity, the article justifies ...
Added: February 23, 2026
Лексема "православный" как элемент оппозиции «свое – чужое» в дискурсе IT
Комышкова А. Д., В кн.: Теоретическая семантика и идеографическая лексикография: Словарь. Дискурс. Корпус: тезисы докладов Всероссийской науч. конф. с международным участием. 17-18 октября 2024, Екатеринбург.: Екатеринбург: Кабинетный ученый, 2024.
The article presents an analysis of the semantics of the lexeme православный (orthodox) based on non-standardized written speech on the Internet (using the subcorpus of social networks in the National Corpus of Russian Language). The vast majority of cases where православный is used in a derogatory or ironic sense are related to IT discourse. The meaning of ...
Added: February 19, 2026
Сложное слово и словосочетание: корпусный подход (случай «bad blood»)
Филатов А. С., Когнитивные исследования языка 2025 Т. 1-2 № 25 С. 302–305
The article demonstrates the productivity of corpus-based linguistic analysis regarding the problem of distinguishing phrases from compounds. The object of the research is “bad blood” in the American English language, the morphological status of which is approached in close connection with its real-life usage and the polysemies of its constituents. ...
Added: November 24, 2025
Quantifying lexical distances among Nudiz, Mahmudi, and Verin Dvin Urmi (North-Еastern Neo-Aramaic)
Shvedova E., Koryakov Y., Забелина Е. А., Journal of Language Relationship 2025 Vol. 23 No. 3–4 P. 207–275
This study documents and analyzes lexical data from four Christian North-Eastern Neo-Aramaic varieties: Mahmudi, Nudiz, Verin Dvin Urmi, and Urmia Urmi, focusing on the previously undescribed Mahmudi and Nudiz. We provide correspondences from these lects for an extended 226-item basic vocabulary list collected for this study with etymologies, cognates from earlier Aramaic, and loanword sources. ...
Added: September 4, 2025
Introduction: the role of the lexicon in actionality
Crane T. M., Nichols J., Persohn B., STUF - Language Typology and Universals 2021 Vol. 74 No. 3-4 P. 427–434
Actionality (also referred to by labels such as “lexical aspect” or “aktionsart”) is the semantic dimension that encodes the constituent phases and boundaries of situations. Despite its central role in aspectual interpretation, careful language-specific descriptions and typological surveys of actional systems have been rare thus far. In this introduction, we describe the steps that lead ...
Added: August 2, 2022
Automation In Cognitive Linguistics: The Case Of Old Slavonic And Old Church Slavonic Literary Texts
Afanasev I., Крючкова О. Ю., Lingua Viva 2019 № 29 С. 48–56
The article considers the case of one particular research in the border field between cognitive linguistics and lexical semantics, namely the study of conceptual opposition “us – them”, as exemplified in the Old Slavonic and the Old Church Slavonic literary texts. The researcher faces the problem of processing of hundreds of lexemes, and it is ...
Added: December 28, 2021
Фрагмент лексической системы казымского диалекта хантыйского языка: глаголы pitti ‘упасть, попасть’ и χɔjti ‘задеть, попасть’ и их аргументная структура
Ryzhova D., Урало-алтайские исследования 2022 № 2 (45) С. 123–140
The paper describes semantics of the Kazym Khanty verbs pitti ‘to fall; to get into somewhere’ and χɔjti ‘to touch; to hit the target’ under the framework of the frame-based approach to lexical typology, according to which a word acquires different meanings in different context types. The sets of physical meanings of the verbs in ...
Added: October 30, 2021
An Interpretable Approach to Lexical Semantic Change Detection with Lexical Substitution
Arefyev N.V., Bykov D. A., , in: Computational Linguistics and Intellectual Technologies: Papers from the Annual International Conference “Dialogue” (2021)Issue 20: Основной том.: -, 2021. P. 31–46.
Added: September 23, 2021
Always Keep your Target in Mind: Studying Semantics and Improving Performance of Neural Lexical Substitution
Nikolay Arefyev, Sheludko B., Podolskiy A. et al., , in: Proceedings of the 28th International Conference on Computational Linguistics.: International Committee on Computational Linguistics, 2020. P. 1242–1255.
Lexical substitution, i.e. generation of plausible words that can replace a particular target word in a given context, is an extremely powerful technology that can be used as a backbone of various NLP applications, including word sense induction and disambiguation, lexical relation extraction, data augmentation, etc. In this paper, we present a large-scale comparative study ...
Added: December 7, 2020
Neural GRANNy at SemEval-2019 Task 2: A combined approach for better modeling of semantic relationships in semantic frame induction
Arefyev Nikolay, Sheludko B., Adis D. et al., , in: Proceedings of the 13th International Workshop on Semantic Evaluation (SemEval-2019).: Minneapolis: Association for Computational Linguistics, 2019. P. 31–38.
We describe our solutions for semantic frame and role induction subtasks of SemEval 2019 Task 2. Our approaches got the highest scores, and the solution for the frame induction problem officially took the first place. The main contributions of this paper are related to the semantic frame induction problem. We propose a combined approach that ...
Added: October 10, 2020
Hm2 at semeval 2019 task2: Unsupervised frame induction using contextualized and uncontextualized word embeddings
Anwar S., Ustalov D., Arefyev N. et al., , in: Proceedings of the 13th International Workshop on Semantic Evaluation (SemEval-2019).: Minneapolis: Association for Computational Linguistics, 2019. P. 125–129.
We present our system for semantic frame induction that showed the best performance in Subtask B.1 and finished as the runner-up in Subtask A of the SemEval 2019 Task 2 on unsupervised semantic frame induction (Qasem-iZadeh et al., 2019). Our approach separates this task into two independent steps: verb clustering using word and their context ...
Added: October 10, 2020
How much does a word weight? Weighting word embeddings for word sense induction
Arefyev, N., Ermolaev P., Panchenko A., , in: Computational Linguistics and Intellectual Technologies. International Conference "Dialogue 2018" Proceedings.: M.: Conference Proceedings Editorial board, 2018. P. 68–84.
The paper describes our participation in the first shared task on word sense induction and disambiguation for the Russian language RUSSE'2018 [Panchenko et al., 2018]. For each of several dozens of ambiguous words, the participants were asked to group text fragments containing it according to the senses of this word, which were not provided beforehand, ...
Added: October 9, 2020
Neural networks with attention for word sense induction
Struyanskiy O., Arefyev, N., , in: Supplementary Proceedings of the 7th International Conference on Analysis of Images, Social Networks and Texts (AIST-SUP 2018), Moscow, Russia, July 5-7, 2018.: Aachen: CEUR Workshop Proceedings, 2018. P. 208–213.
Attentional neural networks have achieved remarkable results for a number of tasks in the past few years. The fascinating success of neural networks with attention mechanism in natural language processing, especially in machine translation, suggests that these models can capture the meaning of ambiguous words considering their context. In this paper we introduce a new ...
Added: October 9, 2020
Combining neural language models for word sense induction
Arefyev, N, Boris S., Aleksashina T., , in: Analysis of Images, Social Networks and Texts. 8th International Conference, AIST 2019, Lecture Notes in Computer Science, Revised Selected PapersVol. 11832.: Cham: Springer, 2019. P. 105–121.
Word sense induction (WSI) is the problem of grouping occurrences of an ambiguous word according to the expressed sense of this word. Recently a new approach to this task was proposed, which generates possible substitutes for the ambiguous word in a particular context using neural language models, and then clusters sparse bag-of-words vectors built from ...
Added: October 9, 2020
Combining Lexical Substitutes in Neural Word Sense Induction
Nikolay Arefyev, Boris S., Panchenko A., , in: Proceedings of the International Conference Recent Advances in Natural Language Processing, RANLP 2019.: INCOMA Ltd, 2019. P. 62–70.
Word Sense Induction (WSI) is the task of grouping of occurrences of an ambiguous word according to their meaning. In this work, we improve the approach to WSI proposed by Amrami and Goldberg (2018) based on clustering of lexical substitutes for an ambiguous word in a particular context obtained from neural language models. Namely, we ...
Added: October 9, 2020
Metaphor Is Between Metonymy and Homonymy: Evidence From Event-Related Potentials
Yurchenko A., Lopukhina A., Dragoy O., Frontiers in Psychology 2020 Vol. 11 P. 2113
The goal of the present study was to investigate the interaction between different senses of polysemous nouns (metonymies and metaphors) and different meanings of homonyms using the method of event-related potentials (ERPs) and a priming paradigm. Participants read two-word phrases containing ambiguous words and made a sensicality judgment. Phrases with polysemes highlighted their literal sense ...
Added: September 2, 2020
Глаголы с семантикой падения в современном персидском языке
Armand E., Nikitenko E., Acta Linguistica Petropolitana. Труды института лингвистических исследований 2020 Т. 16.1 С. 609–637
The article provides a classification of approximately 30 verbs used to describe situations associated with the falling of animate and inanimate objects. The authors conclude that in the modern Persian language, the choice of the exact verb to denote falling depends on the degree of integrity of the falling object in question. Only situations in ...
Added: October 28, 2019
Meaning relatedness in polysemous and homonymous words: an ERP study in Russian
Yurchenko A., Lopukhina A., Dragoy O., / NRU HSE. Series WP BRP "Linguistics". 2018. No. 67.
Previous research showed that polysemous and homonymous words are processed differently. However, mechanisms underlying processing of ambiguous words are still unclear. The goal of the present study was to investigate comprehension of metonymies, metaphors, and homonyms using priming paradigm and the method of event-related potentials (ERPs). We asked participants to read two-word phrases with ambiguous ...
Added: December 14, 2018
Стало быть, по русски так говорят: вводная конструкция стало быть в русском языке
Litvintseva K., Leontieva A., Труды института русского языка им. В.В. Виноградова 2019 № 20 С. 128–138
The article describes the linguistic behavior of the introductory phrase stalo byt’ (literally became to be, meaning roughly ‘so’) in the Russian language of the XVIII - XXI centuries. The data from the Russian National Corpus show that this construction acquired additional senses over the centuries. Firstly it was used as a reason and cause ...
Added: September 19, 2018
RUSSE2018: a Shared Task on Word Sense Induction for the Russian Language
Panchenko A., Lopukhina A., Ustalov D. et al., Компьютерная лингвистика и интеллектуальные технологии 2018 No. 17 P. 547–564
The paper describes the results of the first shared task on word sense induction (WSI) for the Russian language. While similar shared tasks were conducted in the past for some Romance and Germanic languages, we explore the performance of sense induction and disambiguation methods for a Slavic language that shares many features with other Slavic ...
Added: June 7, 2018
  • About
  • About
  • Key Figures & Facts
  • Sustainability at HSE University
  • Faculties & Departments
  • International Partnerships
  • Faculty & Staff
  • HSE Buildings
  • HSE University for Persons with Disabilities
  • Public Enquiries
  • Studies
  • Admissions
  • Programme Catalogue
  • Undergraduate
  • Graduate
  • Exchange Programmes
  • Summer University
  • Summer Schools
  • Semester in Moscow
  • Business Internship
  • Research
  • International Laboratories
  • Research Centres
  • Research Projects
  • Monitoring Studies
  • Conferences & Seminars
  • Academic Jobs
  • Yasin (April) International Academic Conference on Economic and Social Development
  • Media & Resources
  • Publications by staff
  • HSE Journals
  • Publishing House
  • iq.hse.ru: commentary by HSE experts
  • Library
  • Economic & Social Data Archive
  • Video
  • HSE Repository of Socio-Economic Information
  • HSE1993–2026
  • Contacts
  • Copyright
  • Privacy Policy
  • Site Map
Edit