• A
  • A
  • A
  • АБВ
  • АБВ
  • АБВ
  • A
  • A
  • A
  • A
  • A
Обычная версия сайта
  • RU
  • EN
  • HSE University
  • Publications
  • Book chapter
  • FactRuEval 2016: Evaluation of Named Entity Recognition and Fact Extraction Systems for Russian
  • RU
  • EN
Расширенный поиск
Высшая школа экономики
Национальный исследовательский университет
Priority areas
  • business informatics
  • economics
  • engineering science
  • humanitarian
  • IT and mathematics
  • law
  • management
  • mathematics
  • sociology
  • state and public administration
by year
  • 2027
  • 2026
  • 2025
  • 2024
  • 2023
  • 2022
  • 2021
  • 2020
  • 2019
  • 2018
  • 2017
  • 2016
  • 2015
  • 2014
  • 2013
  • 2012
  • 2011
  • 2010
  • 2009
  • 2008
  • 2007
  • 2006
  • 2005
  • 2004
  • 2003
  • 2002
  • 2001
  • 2000
  • 1999
  • 1998
  • 1997
  • 1996
  • 1995
  • 1994
  • 1993
  • 1992
  • 1991
  • 1990
  • 1989
  • 1988
  • 1987
  • 1986
  • 1985
  • 1984
  • 1983
  • 1982
  • 1981
  • 1980
  • 1979
  • 1978
  • 1977
  • 1976
  • 1975
  • 1974
  • 1973
  • 1972
  • 1971
  • 1970
  • 1969
  • 1968
  • 1967
  • 1966
  • 1965
  • 1964
  • 1963
  • 1958
  • More
Subject
News
May 25, 2026
HSE Scientists Train Neural Network to 'Hear' Faults in Electric Motors
Researchers at the AI and Digital Science Institute of the HSE Faculty of Computer Science have developed a new method—the Signature-Guided Data Augmentation (SGDA) framework—that achieves 99% accuracy in motor fault detection and 86% accuracy in fault classification. The application of this approach can reduce industrial equipment repair costs, minimise downtime, and improve production safety. The study results have been published in Engineering Applications of Artificial Intelligence.
May 25, 2026
'The Humanities Serve as a Conscience'
Maria Mizernaia studies Soviet literature and the history of book publishing. In this interview for the HSE Young Scientists project, she discusses plans to publish a novel about besieged Leningrad, AI-provoked reflections on what it means to be human, and how novels can help satisfy our dopamine hunger.
May 25, 2026
Is It Possible to Predict a Citys Life Based on the Shape of Its Neighbourhoods?
Is it possible to predict, based on the configuration of streets and buildings, where a café will open or where traffic congestion will occur? Participants in the Spatial Analysis and Modelling of Urban Processes research and study group use open data and machine learning to identify universal patterns. Alexander Sheludkov and Eduard Somov discuss the purpose of comparing cities, the need for new forms of urban statistics, and how open data is transforming approaches to urban studies.

 

Have you spotted a typo?
Highlight it, click Ctrl+Enter and send us a message. Thank you for your help!

Publications
  • Books
  • Articles
  • Chapters of books
  • Working papers
  • Report a publication
  • Research at HSE

?

FactRuEval 2016: Evaluation of Named Entity Recognition and Fact Extraction Systems for Russian

P. 688–705.
Starostin A. S., Bocharov V. V., Alexeeva S. V., Bodrova A. A., Chuchunkov A. S., Dzhumaev S. S., Efimenko I. V., Granovsky D. V., Khoroshevsky V. F., Krylova I. V., Nikolaeva M. A., Smurov I. M., Toldova S. Y.

In this paper, we describe the rules and results of the FactRuEval informa- tion extraction competition held in 2016 as part of the Dialogue Evaluation initiative in the run-up to Dialogue 2016. The systems were to extract in- formation from Russian texts and competed in two named entity extraction tracks and one fact extraction track. The paper describes the tasks set be- fore the participants and presents the scores achieved by the contending systems. Additionally, we dwell upon the scoring methods employed for evaluating the results of all the three tracks and provide some preliminary analysis of the state of the art in Information Extraction for Russian texts. We also provide a detailed description of the composition and general orga- nization of the annotated corpus created for the competition by volunteers using the OpenCorpora.org platform. The corpus is publicly available and is expected to evolve in the future. 

Language: English
Full text
Text on another site
Keywords: evaluationinformation extractionrelation extractionnamed entity recognitionfact extraction

In book

Компьютерная лингвистика и интеллектуальные технологии: По материалам ежегодной международной конференции «Диалог» (Москва,1–4 июля 2016 г.)
Вып. 15. , М.: Изд-во РГГУ, 2016.
Similar publications
The interplay of conceptual metaphors and evaluation in press reports on the AUKUS agreement
Trnavac R., Katie Patterson J., Russian Journal of Linguistics 2025 Vol. 29 No. 3 P. 560–585
The linguistic literature has shown that metaphor provokes evaluative meanings, but it is unclear how different types of metaphor, as well as different genres of discourse influence the realization of such meanings. To partially answer this question, our study aims to investigate the relationship between conceptual metaphors and evaluation in Australian broadsheet and tabloid articles ...
Added: September 23, 2025
Developing an Approach for Automated Data Collection and Mining Using Web Scraping Techniques and Large Language Models: A Case Study on Extracting Technology Readiness Level Assessments
F. M. Grozovskiy, I. V. Loginova, Automatic Documentation and Mathematical Linguistics 2025 Vol. 59 No. 4 P. 269–278
The paper proposes an approach to the automated extraction and structuring of information from text, combining web scraping for data collection from online sources with a large language model for subsequent data mining. As a case study, texts from news publications on technology readiness levels from the CNews website were chosen to test the developed methodology in a ...
Added: August 25, 2025
Родительская оценка эффективности социальной интервенции в проблемную семью с детьми
Abramov R., Rozhdestvenskaya E., Sablya A., Журнал социологии и социальной антропологии 2025 Т. 28 № 2 С. 182–210
Evidence-based social policy has become a prevalent approach to managing social programs, relying heavily on evaluation procedures often intertwined with sociological methods. This article analyzes the implementation of one such evaluation procedure—feedback from recipients of social intervention—within a comprehensive assessment of an interdisciplinary team’s intervention technology addressing child rights violations. Interviews with family members participating in the program were conducted. Findings ...
Added: July 6, 2025
Argument Identification for Neuro-Symbolic Dispute Resolution in Scientific Peer Review
Baimuratov I., Karpovich A., Lisanyuk E. et al., , in: JCDL '24: Proceedings of the 24th ACM/IEEE Joint Conference on Digital Libraries.: NY: Association for Computing Machinery (ACM), 2024. Ch. 6.
Peer review is a cornerstone of the academic editorial decisionmaking process, yet it faces significant challenges. Artificial intelligence can help address these challenges, but its use raises concerns about reliability and the potential for reproducing existing biases. In this research, we employ a formal argumentation-theoretic framework that allows for explicit analysis of arguments and their ...
Added: May 29, 2025
Sim4Rec: Flexible and Extensible Simulator for Recommender Systems for Large-Scale Data
Anna Volodkevich, Ivanova V., Vasilev A. et al., , in: Advances in Information Retrieval: 47th European Conference on Information Retrieval, ECIR 2025, Lucca, Italy, April 6–10, 2025, Proceedings, Part IV.: Springer, 2025. P. 425–430.
Simulators for recommender systems are widely used for recommender systems performance evaluation and feedback loop effects analysis. Existing simulators often propose inflexible pipelines, are focused on narrow research tasks, or are not adapted to work with industrial large data volumes. To address these challenges, we developed the Sim4Rec simulation framework. The Sim4Rec models key aspects ...
Added: April 10, 2025
From Variability to Stability: Advancing RecSys Benchmarking Practices
Shevchenko V., Belousov N., Vasilev A. et al., , in: KDD '24: Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining.: Association for Computing Machinery (ACM), 2024. P. 5701–5712.
In the rapidly evolving domain of Recommender Systems (RecSys), new algorithms frequently claim state-of-the-art performance based on evaluations over a limited set of arbitrarily selected datasets. However, this approach may fail to holistically reflect their effectiveness due to the significant impact of dataset characteristics on algorithm performance. Addressing this deficiency, this paper introduces a novel ...
Added: November 24, 2024
Исследовательский потенциал корпуса советских песен: эмоциональная тональность и география песенных текстов через призму компьютерных технологий
Kolmogorova A., Зарембо В. С., Ткачева Е. С. et al., В кн.: Лингвистическая семантика в пространственном измерении: Словарь. Дискурс. Корпус.: Екатеринбург: Кабинетный ученый, 2024. Гл. 10 С. 423–445.
The purpose of this study is to describe the characteristics of the text of a popular Soviet song as a linguo-ideological phenomenon. The corpus of Soviet songs collected by the research group is used as material. The focus of this publication is on two characteristics: changes in the emotional tonality of popular songs released on ...
Added: December 10, 2023
Комбинирование методов для извлечения терминов из научно-технического текста
Bolshakova E. I., Семак В. В., Интеллектуальные системы. Теория и приложения 2021 Т. 25 № 4 С. 239–242
An approach to automatic extraction of terms from an individual scientific text is reported, which combines known methods: linguistic patterns, statistical terminological measures, methods of graph ranking. The combined methods and stages for extracting, selection and ranking of terms are described, which are implemented for processing documents in Russian. The results of experiments on extracting ...
Added: November 23, 2023
NEREL: a Russian information extraction dataset with rich annotation for nested entities, relations, and wikidata entity links
Loukachevitch N., Artemova E., Batura T. et al., Language Resources and Evaluation 2024 Vol. 58 P. 547–583
This paper describes NEREL—a Russian news dataset suited for three tasks: nested named entity recognition, relation extraction, and entity linking. Compared to flat entities, nested named entities provide a richer and more complete annotation while also increasing the coverage of relations annotation and entity linking. Relations between nested named entities may cross entity boundaries to ...
Added: September 24, 2023
Государственная биополитическая программа в дискурсе немецких СМИ во время пандемии
Balakina Y. V., Вестник Московского университета. Серия 10: Журналистика 2023 № 5 С. 61–83
The study is aimed at revealing the stance of German non-state media on biopolitical instruments employed by the government during the COVID-19 pandemic. Implementation of restrictive measures as well as vaccination campaign were scrutinized. It was hypothesized that the media taking into account their own goals and owners’ interests might not support state biopolitics in ...
Added: September 18, 2023
Identifying the style by a qualified reader on a short fragment of generated poetry
Orekhov B., / Series Computer Science "arxiv.org". 2023.
Style is an important concept in today's challenges in natural language generating. After the success in the field of image style transfer, the task of text style transfer became actual and attractive. Researchers are also interested in the tasks of style reproducing in generation of the poetic text. Evaluation of style reproducing in natural poetry ...
Added: June 7, 2023
Entity Linking over Nested Named Entities for Russian
Loukachevitch N., Braslavski P., Ivanov V. et al., , in: Proceedings of the 13th Conference on Language Resources and Evaluation (LREC 2022).: Marseille: European Language Resources Association (ELRA), 2022. P. 4458–4466.
In this paper, we describe entity linking annotation over nested named entities in the recently released Russian NEREL dataset for information extraction. The NEREL collection is currently the largest Russian dataset annotated with entities and relations. It includes 933 news texts with annotation of 29 entity types and 49 relation types. The paper describes the ...
Added: April 16, 2023
Cross-Domain Limitations of Neural Models on Biomedical Relation Classification
Alimova I., Tutubalina E., Nikolenko S. I., IEEE Access 2022 Vol. 10 P. 1432–1439
Relation extraction (RE) aims to extract relational facts from plain text, which is essential to the biomedical research field with the rapid growth of biomedical literature and generally large volumes of biomedicine-related text coming from various sources. Numerous annotated corpora and state-of-the-art models have been introduced in the past five years. However, there are no ...
Added: April 10, 2023
NEREL-BIO: A Dataset of Biomedical Abstracts Annotated with Nested Named Entities
Loukachevitch N., Manandhar S., Baral E. et al., Bioinformatics 2023 Vol. 39 No. 4 Article btad161
Motivation This paper describes NEREL-BIO – an annotation scheme and corpus of PubMed abstracts in Russian and smaller number of abstracts in English. NEREL-BIO extends the general domain dataset NEREL (Loukachevitch et al., 2021) by introducing domain-specific entity types. NEREL-BIO annotation scheme covers both general and biomedical domains making it suitable for domain transfer experiments. NEREL-BIO ...
Added: April 5, 2023
Русский литературный пейзаж: между природой и историей
Botchkarev A., Вестник РГГУ. Серия «Литературоведение. Языкознание. Культурология» 2022 № 6-2 С. 192–203
The article deals with the specifics of a Russian literary landscape’s construction as a system of values organized in a particular way. It allows answering the question what is good and what is beauty in the life contextrelevant to a human being. In this regard, special attention is paid to various descriptions. They allow not ...
Added: January 27, 2023
О скуке в русском языковом сознании
Botchkarev A., Slavia 2022 Т. 91 № 4 С. 456–466
The article explores the ways of displaying skuka ‘boredom’ in the Russian language consciousness. The National Russian Corpus is more appropriate for this purpose, because a conceptual configuration of an analyzed concept is not present in a “finished” form in any single utterance, but may be reconstructed only on the totality of all possible utterances. ...
Added: January 23, 2023
Nonveridicality and evaluation: Theoretical, computational and corpus approaches.
Trnavac R., Taboda M., Brill, 2013.
Edited volume ...
Added: January 13, 2023
Discourse relations and evaluation.
Trnavac Radoslava, Das D., Taboada M., Corpora 2016 P. 169–190
n this paper, we examine the role of discourse relations (relations between propositions) in the interpretation of evaluative or opinion words. Through a combination of Rhetorical Structure Theory (or RST; Mann and Thompson, 1988) and Appraisal Theory (Martin and White, 2005), we analyse how different discourse relations modify the evaluative content of opinion words, and what ...
Added: January 8, 2023
  • About
  • About
  • Key Figures & Facts
  • Sustainability at HSE University
  • Faculties & Departments
  • International Partnerships
  • Faculty & Staff
  • HSE Buildings
  • HSE University for Persons with Disabilities
  • Public Enquiries
  • Studies
  • Admissions
  • Programme Catalogue
  • Undergraduate
  • Graduate
  • Exchange Programmes
  • Summer University
  • Summer Schools
  • Semester in Moscow
  • Business Internship
  • Research
  • International Laboratories
  • Research Centres
  • Research Projects
  • Monitoring Studies
  • Conferences & Seminars
  • Academic Jobs
  • Yasin (April) International Academic Conference on Economic and Social Development
  • Media & Resources
  • Publications by staff
  • HSE Journals
  • Publishing House
  • iq.hse.ru: commentary by HSE experts
  • Library
  • Economic & Social Data Archive
  • Video
  • HSE Repository of Socio-Economic Information
  • HSE1993–2026
  • Contacts
  • Copyright
  • Privacy Policy
  • Site Map
Edit