• A
  • A
  • A
  • АБВ
  • АБВ
  • АБВ
  • A
  • A
  • A
  • A
  • A
Обычная версия сайта
  • RU
  • EN
  • HSE University
  • Publications
  • Book chapter
  • RuREBus-2020 Shared Task: Russian Relaton Extraction for Business
  • RU
  • EN
Расширенный поиск
Высшая школа экономики
Национальный исследовательский университет
Priority areas
  • business informatics
  • economics
  • engineering science
  • humanitarian
  • IT and mathematics
  • law
  • management
  • mathematics
  • sociology
  • state and public administration
by year
  • 2027
  • 2026
  • 2025
  • 2024
  • 2023
  • 2022
  • 2021
  • 2020
  • 2019
  • 2018
  • 2017
  • 2016
  • 2015
  • 2014
  • 2013
  • 2012
  • 2011
  • 2010
  • 2009
  • 2008
  • 2007
  • 2006
  • 2005
  • 2004
  • 2003
  • 2002
  • 2001
  • 2000
  • 1999
  • 1998
  • 1997
  • 1996
  • 1995
  • 1994
  • 1993
  • 1992
  • 1991
  • 1990
  • 1989
  • 1988
  • 1987
  • 1986
  • 1985
  • 1984
  • 1983
  • 1982
  • 1981
  • 1980
  • 1979
  • 1978
  • 1977
  • 1976
  • 1975
  • 1974
  • 1973
  • 1972
  • 1971
  • 1970
  • 1969
  • 1968
  • 1967
  • 1966
  • 1965
  • 1964
  • 1963
  • 1958
  • More
Subject
News
June 25, 2026
HSE Researchers Make Aldehydes Perform Dual Function
Chemists from HSE University have discovered a way to carry out a reductive addition reaction without using an external reducing agent. Instead, the required 'resource' is supplied by the aldehyde itself, one of the reaction participants. This approach helps prevent unwanted side reactions, reduces toxicity, and simplifies the production and synthesis of organic molecules, including those used in the manufacture of medicines. The study has been published in Journal of Catalysis.
June 25, 2026
HSE Scientists Explain Why Findings in Autism Research Differ
Researchers from the Cognitive Health and Intelligence Centre at HSE University conducted the first-ever systematic review of studies on the specifics of emotion-from-motion perception in autism. The review showed that differences found between autistic and non-autistic individuals are largely associated with the experimental design and the types of tasks given to study participants. The review findings have been published in Research in Autism.
June 22, 2026
‘In Science, You Are Your Own Boss
Polina Nasledskova is interested in identifying gaps in linguistics and topics that have been overlooked by other researchers. In an interview for the  Young Scientists of HSE University project, she spoke about rare ordinal numerals in Nakh-Daghestanian languages, the benefits of knitting for concentration, and the beauty of the Patriarshy Bridge.

 

Have you spotted a typo?
Highlight it, click Ctrl+Enter and send us a message. Thank you for your help!

Publications
  • Books
  • Articles
  • Chapters of books
  • Working papers
  • Report a publication
  • Research at HSE

?

RuREBus-2020 Shared Task: Russian Relaton Extraction for Business

P. 416–432.
Artemova E., Batura T., Sarkisyan V., Tutubalina E., Smurov I.

В статье представлены результаты соревнования по распознаванию именованных сущностей и извлечению отношений. Целью соревнования является сравнение методов извлечения сущностей и отношений на русском языке в постановке, приближенной к индустриальным задачам. В качестве исходной коллекции текстов использовался корпус Минэкономразвития РФ, содержащий программы стратегического развития. Корпус был размечен в соответствии с инструкцией, разработанной авторами статьи. В процессе разметки использовались различные методы активного обучения, что позволило за короткое время создать качественный набор данных. Всего
1
было размечено более двухсот документов. Соревнование проводилось по трем задачам (дорожкам): 1) распознавание именованных сущностей, 2) извлечение отношений и 3) совместное распознавание именованных сущностей и извлечение отношений. Вместе с коллекцией размеченных текстов участникам также были предоставлены неразмеченные тексты, которые могли быть использованы для улучшения решений. В статье дается обзор и сравниваются результаты участников соревнования. Детальное описание соревнования, текстовые коллекции, инструкция по разметке и скрипты для оценки качества доступны по ссылке: https://github.com/dialogue-evaluation/RuREBus

Language: English
Keywords: русскийrelation extractionnamed entity recognitionshared taskсоревнованиеBERT Language Modelраспознавание именованных сущностей извлечение отношенийдообучениеBERT

In book

Компьютерная лингвистика и интеллектуальные технологии: По материалам ежегодной международной конференции «Диалог» (Москва, 17 июня — 20 июня 2020 г.)
Вып. 19(26). , М.: Изд-во РГГУ, 2020.
Similar publications
АНАЛИЗ ТОНАЛЬНОСТИ РУССКОЙ ДРАМЫ XVIII–XX ВВ. КАК ИНСТРУМЕНТ МОДЕЛИРОВАНИЯ ХУДОЖЕСТВЕННОЙ СТРУКТУРЫ
Anisimova K., Цифровые гуманитарные исследования 2025 № 2 С. 24–47
Исследование посвящено описанию эмоциональной динамики как проявления художественной структуры русской драмы XVIII–XX вв. на основе автоматической разметки тональности реплик с использованием нейросетевых моделей BERT-архитектуры. Такие модели, дообученные даже на нехудожественных текстах, показывают удовлетворительные результаты при анализе тональности драматических реплик, что было проверено на вручную размеченной тестовой выборке. На основе такой автоматической эмоциональной разметки было показано, ...
Added: February 24, 2026
Исследования благополучия с помощью передовых методов обработки естественного языка (NLP): перспективы и ограничения
Voevodina E., Современная зарубежная психология 2025 Т. 14 № 3 С. 172–181
Context and relevance. Well-being research faces methodological limitations of conventional psychometric measures, criticized for poor ecological validity, limited information yield, and inadequate capture of multidimensional construct of well-being. Advanced natural language processing (NLP) technologies offer solutions to these constraints. Objective. To evaluate opportunities and challenges of transformer-based NLP for well-being research. Methods and materials. We conducted an analytical review of ...
Added: October 9, 2025
Языковые модели для предобработки текстов в машинном переводе
Mylnikova A., Mylnikov L., Научно-техническая информация. Серия 2: Информационные процессы и системы 2025 № 7 С. 32–44
Рассмотрена модель использования скелетных структур на базе синтаксической разметки для предобработки корпусов текстов перед передачей в нейросетевые модели машинного перевода с целью повышения качества их работы, реализованная с помощью частеречной и синтаксической разметок корпусов текстов, использующих языковую модель, с использованием сети BERT и набора правил. Описана подготовка данных для обучения и предложены способы повышения эффективности ...
Added: September 22, 2025
Spatial representation - Crosslinguistic and experimental studies: Coding manual, Volume
Hickmann M., Hendriks H., Demagny A. et al., Laboratoire Structures Formelles du langage, CNRS & University of Paris, 2022.
The information comprised in these appendices (Volume 2) are complementary to the Coding Manual (Volume 1) with the same title. The two Volumes are based on data collected in a series of experimental studies (controlled production) and longitudinal data collection (spontaneous productions by young children) and concern the expression of space and / or time in spoken language. ...
Added: March 11, 2025
Индекс этичности российских банков на основе искусственного интеллекта
Storchevoy M., Parshakov P., Paklina S. et al., Доклады Российской академии наук. Математика, информатика, процессы управления (ранее - Доклады Академии Наук. Математика) 2024 Т. 520 № 6 С. 70–81
Measuring a company's ethics is an important element in the mechanism of regulating the behavior of market participants, as it allows consumers and regulators to make better decisions, which has a disciplining effect on companies. We tested various methods of machine analysis of consumer feedback from Russian banks and developed an Ethics Index that allows ...
Added: October 31, 2024
Исследовательский потенциал корпуса советских песен: эмоциональная тональность и география песенных текстов через призму компьютерных технологий
Kolmogorova A., Зарембо В. С., Ткачева Е. С. et al., В кн.: Лингвистическая семантика в пространственном измерении: Словарь. Дискурс. Корпус.: Екатеринбург: Кабинетный ученый, 2024. Гл. 10 С. 423–445.
The purpose of this study is to describe the characteristics of the text of a popular Soviet song as a linguo-ideological phenomenon. The corpus of Soviet songs collected by the research group is used as material. The focus of this publication is on two characteristics: changes in the emotional tonality of popular songs released on ...
Added: December 10, 2023
Grammar in Language Models: BERT Study
Chistyakova K., Kazakova Tatiana, / NRU HSE. Series WP BRP "Linguistics". 2023. No. 115.
The problem of language models’ interpretation is extensively inspected, but no universal answers have been found. Our study offers to combine widely accepted probing methods with a novel approach to a neural network under investigation. We propose to break grammatical forms on the pre-training step in order to get two "sibling" models, as it casts ...
Added: November 29, 2023
Identifying and Visualizing Trends in Science, Technology, and Innovation Using SciBERT
Lobanova P., Bakhtin P., Sergienko Y., IEEE Transactions on Engineering Management 2024 No. 71 P. 11898–11906
Identification of science, technology, and innovation trends is a critical topic both for the scientific community and for companies that develop technologies, work on science and technology policy or invest in high tech. In this research authors demonstrate a novel approach implemented in iFORA system (developed by National Research University Higher School of Economics) using ...
Added: September 8, 2023
Many heads but one brain: FusionBrain – a single multimodal multitask architecture and a competition
D. D. Bakshandaeva, Dimitrov D. V., Arkhipkin V. S. et al., Computer Optics 2023 Vol. 47 No. 1 P. 185–195
Supporting the current trend in the AI community, we present the AI Journey 2021 Challenge called FusionBrain, the first competition which is targeted to make a universal architecture which could process different modalities (in this case, images, texts, and code) and solve multiple tasks for vision and language. The FusionBrain Challenge combines the following specific tasks: Code2code ...
Added: June 13, 2023
Cross-Domain Limitations of Neural Models on Biomedical Relation Classification
Alimova I., Tutubalina E., Nikolenko S. I., IEEE Access 2022 Vol. 10 P. 1432–1439
Relation extraction (RE) aims to extract relational facts from plain text, which is essential to the biomedical research field with the rapid growth of biomedical literature and generally large volumes of biomedicine-related text coming from various sources. Numerous annotated corpora and state-of-the-art models have been introduced in the past five years. However, there are no ...
Added: April 10, 2023
NEREL-BIO: A Dataset of Biomedical Abstracts Annotated with Nested Named Entities
Loukachevitch N., Manandhar S., Baral E. et al., Bioinformatics 2023 Vol. 39 No. 4 Article btad161
Motivation This paper describes NEREL-BIO – an annotation scheme and corpus of PubMed abstracts in Russian and smaller number of abstracts in English. NEREL-BIO extends the general domain dataset NEREL (Loukachevitch et al., 2021) by introducing domain-specific entity types. NEREL-BIO annotation scheme covers both general and biomedical domains making it suitable for domain transfer experiments. NEREL-BIO ...
Added: April 5, 2023
Исследование методов машинного обучения для классификации научных текстов на русском языке
Кусакин И. К., Федорец О. В., Romanov A., Научно-техническая информация. Серия 2: Информационные процессы и системы 2022 Т. 12 С. 6–9
This paper discusses modern approaches to natural language processing and appliance of artificial intelligence technologies in the task of classifying scientific texts in Russian. The report contains an analysis of implementations of text vectorization methods, a description of experiments with training various classifier models: from classical machine learning algorithms to neural network transformer architectures. ...
Added: January 31, 2023
CROSS-NATIONAL RIVALRY: NATIONAL IDENTITY IN SPORTS (A CASE STUDY OF ENGLISH AND RUSSIAN)
Morozova I., Permyakova T. M., Ross B. et al., Вестник Волгоградского государственного университета. Серия 2: Языкознание 2022 Vol. 21 No. 6 P. 121–131
Nationalism and sport are often interwoven and, subsequently, the competitive nature of sport competition can also mirror the contentious nature between international athletes. Evidence of such inter-group conflict may manifest itself through ethnolinguistics and is reinforced through social identity theory. Data analysis for the English and Russian languages was evaluated in four categories. Data includes ...
Added: January 5, 2023
Illumination estimation challenge: The experience of the first 2 years
Egor Ershov, Savchik A., Ilya Semenkov et al., Color Research and Application 2021 Vol. 46 No. 4 P. 705–718
Illumination estimation is the essential step of computational color constancy, one of the core parts of various image processing pipelines of modern digital cameras. Having an accurate and reliable illumination estimation is important for reducing the illumination influence on the image colors. To motivate the generation of new ideas and the development of new algorithms ...
Added: December 19, 2022
Using Text Analytics for Health to Get Meaningful Insights from a Corpus of COVID Scientific Papers
Soshnikov D. V., Soshnikova V., / Series Computer Science "arxiv.org". 2021.
Since the beginning of COVID pandemic, there have been around 700000 scientific papers published on the subject. A human researcher cannot possibly get acquainted with such a huge text corpus -- and therefore developing AI-based tools to help navigating this corpus and deriving some useful insights from it is highly needed. In this paper, we ...
Added: February 22, 2022
SocialBERT – Transformers for Online Social Network Language Modelling
Ilia Karpov, Nick Kartashev, , in: Analysis of Images, Social Networks and Texts. 10th International Conference, AIST 2021, Tbilisi, Georgia, December 16–18, 2021, Revised Selected Papers.: Cham: Springer, 2022. P. 1–10.
The ubiquity of the contemporary language understanding tasks gives relevance to the development of generalized, yet highly efficient models that utilize all knowledge, provided by the data source. In this work, we present SocialBERT - the first model that uses knowledge about the author’s position in the network during text analysis. We investigate possible models ...
Added: October 31, 2021
  • About
  • About
  • Key Figures & Facts
  • Sustainability at HSE University
  • Faculties & Departments
  • International Partnerships
  • Faculty & Staff
  • HSE Buildings
  • HSE University for Persons with Disabilities
  • Public Enquiries
  • Studies
  • Admissions
  • Programme Catalogue
  • Undergraduate
  • Graduate
  • Exchange Programmes
  • Summer University
  • Summer Schools
  • Semester in Moscow
  • Business Internship
  • Research
  • International Laboratories
  • Research Centres
  • Research Projects
  • Monitoring Studies
  • Conferences & Seminars
  • Academic Jobs
  • Yasin (April) International Academic Conference on Economic and Social Development
  • Media & Resources
  • Publications by staff
  • HSE Journals
  • Publishing House
  • iq.hse.ru: commentary by HSE experts
  • Library
  • Economic & Social Data Archive
  • Video
  • HSE Repository of Socio-Economic Information
  • HSE1993–2026
  • Contacts
  • Copyright
  • Privacy Policy
  • Site Map
Edit