Coreference resolution for Russian: the impact of semantic features

S. Toldova; Maxim Ionov

?

Coreference resolution for Russian: the impact of semantic features

P. 339–348.

Toldova S., Maxim Ionov

This paper presents the results of our experiments on building a general coreference resolution system for Russian. The main aim of those experiments was to set a baseline for this task for Russian using the standard set of features developed and tested for coreference resolution systems created for other languages. We propose several baseline systems, both rule-based and ML-based. We show that adding some semantic information is crucial for the task and even the small amount of data can improve the overall result. We show that different types of semantic resources affect the performance differently and sometimes more does not imply better.

Language: English

Full text

Text on another site

Keywords: ontologies semantic features coreference resolution

In book

Computational Linguistics and Intellectual Technologies. International Conference "Dialogue 2017" Proceedings

Vol. 1. Issue 16 (23). , M.: -, 2017.

Среда Онтологически Контролируемых Вычислительных Экспериментов в Химии и Материаловедении

Glushko A., Neznanov A., В кн.: Перспективные материалы и технологии (ПМТ-2024) : Сборник докладов Международной научно-технической конференции ИПТИП РТУ МИРЭА, Москва, 12–16 апреля 2024 годаТ. 1.: М.: РТУ МИРЭА, 2024. С. 380–385.

In this paper we would like to discuss the basic principles, design decisions and tools that formed the basis of a software system for analyzing the results of real experiments and performing computational experiments in chemistry and materials science. With this work we aim to formalize knowledge at multiple levels and improving the efficiency of ...

Added: April 29, 2026

Сообщество, связанное в общее тело, и тела, созданные одним аффектом

Петров К. А., Логос 2025 Т. 35 № 5 С. 93–114

The concept of “enactment” refers to the idea of the contingency of the body/technology boundary and as such it’s the basis for the emerging multiple ontologies of bodies in the Annemarie Mol’s texts. However, “enactment” does not mean the absolute malleability of the body. For authors working within the framework of actor-network theory the body ...

Added: November 18, 2025

Architecture of a software system for designing robust business processes

Samoylova K., Zamyatina E., Proceedings of the Institute for System Programming of the RAS 2022 Vol. 34 No. 2 P. 67–76

Nowadays, in order for a company to remain competitive, efficient and attractive to investors it needs to have reliable and threat-resistant business processes. The question of methods for building such business processes remains relevant. This paper proposes a software system, which involves the use of methods and tools of DSM (Domain Specific Modeling), ontological approach, ...

Added: February 13, 2023

2022 IEEE 24th Conference on Business Informatics (CBI)

IEEE, 2022.

CBI is a well-established conference series on business informatics that has a tradition of hosting workshops on topics related to its main themes. CBI workshops provide ample room for discussion of recent business informatics developments, as well as new and emerging ideas. ...

Added: December 6, 2022

Онтологический подход к интеграции информации в областях с интенсивным использованием данных

Заякин В. С., Lyadova L. N., Рабчевский Е. А., Информационные технологии 2022 Т. 28 № 10 С. 529–538

The development and support of knowledge-based systems for experts in the field of social network analysis (SNA) is complicated because of the problems of viability maintenance that inevitably emerge in data intensive domains. Largely this is the case due to the properties of semi-structured objects and processes that are analyzed by data specialists using data ...

Added: October 22, 2022

Modelling of Developing Socio-Economic Systems Using Multiparadigm Simulation Modelling: Advancing Towards Complexity Theory and Synergetics

Lychkina N. N., , in: World Organization of Systems and Cybernetics 18. Congress-WOSC2021: Systems Approach and Cybernetics: Engaging for the Future of MankindVol. 495.: Springer, 2022. Ch. III P. 191–204.

Purpose The goal of this research is to demonstrate model designs and approaches based on using modern paradigms and technological solutions in the field of simulation modeling of socio-economic processes and social forecasting that allow us to study complex dynamic occurrences in the development of socio-economic systems. Strategic management of socio-economic system involves the analysis of structural ...

Added: October 31, 2021

Modeling of the strategic development of socio-economic systems based on hybrid simulation and ontologies

Lychkina N. N., , in: Systems approach and cybernetics, directed towards the future of mankind. Collection of materials of the 18th Congress WOSC2021 “Systems approach and cybernetics, directed to the future of mankind”.: M.: Cogito-Centre–IPRAS Publishing House, 2021. P. 153–154.

Added: October 12, 2021

Communications in Computer and Information Science. 11th International Joint Conference on Knowledge Discovery, Knowledge Engineering, and Knowledge Management, IC3K 2019, Vienna, Austria, September 17-19, 2019, Revised Selected Papers

Switzerland: Springer, 2020.

This book constitutes the revised selected papers of the 11th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management, IC3K 2019, held in Vienna, Austria, in September 2019. The 25 full papers presented were carefully reviewed and selected from 220 submissions. The papers are organized in topical sections on knowledge discovery and information retrieval; ...

Added: February 4, 2021

Proceedings of the First Workshop on Computational Approaches to Discourse

Association for Computational Linguistics, 2020.

Added: November 18, 2020

A new Russian paraphrase corpus. Paraphrase identification and classification based on different prediction models

Pronoza E., Yagunova E., , in: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 17th International Conference on Intelligent Text Processing and Computational Linguistics, CICLing 2016Vol. 1. Issue 9623.: Springer Publishing Company, 2018. P. 573–587.

Our main objectives are constructing a paraphrase corpus for Russian and developing of the paraphrase identification and classification models based on this corpus. The corpus consists of pairs of news headlines from different media agencies which are extracted and analyzed in real time. Paraphrase candidates are extracted using an unsupervised matrix similarity metric: if the ...

Added: October 30, 2020

Сумерки урбанизма: пространственные онтологии и воображение в романе "Чевенгур"

Zamyatin D., В кн.: На самой черте горизонта: платоновские пространства. Поэтика Андрея Платонова. Сборник 4 / Ред. Е. А. Яблоков. — М.: ПОЛИМЕДИА, 2019. — 176 с.: М.: Полимедиа, 2019. С. 17–41.

Роман Андрея Платонова «Чевенгур» – пример мощного пространственного воображения, трансформирующего онтологии самого языка, в данном случае – русского. В то же время этот роман является феноменологическим свидетельством решающего изменения самих пространственных онтологий, связанного с «взрывом» представлений о пространстве-времени начала XX века . Вместе с тем, реальность событий и ключевого нарратива «Чевенгура» оказывается действительно «судьбоносной», если ...

Added: September 27, 2019

Checking the Data Complexity of Ontology-Mediated Queries: A Case Study with Non-uniform CSPs and Polyanna

Gerasimova O., Kikot S., Zakharyaschev M., , in: Description Logic, Theory Combination, and All That.: Berlin: Springer, 2019. P. 329–351.

It has recently been shown that first-order- and datalog-rewritability of ontology-mediated queries (OMQs) with expressive ontologies can be checked in NExpTime using a reduction to CSPs. In this paper, we present a case study for OMQs with Boolean conjunctive queries and a fixed ontology consisting of a single covering axiom 𝐴 -> 𝐹 v 𝑇, A -> F v T, possibly supplemented with ...

Added: July 29, 2019

Description Logic, Theory Combination, and All That

Berlin: Springer, 2019.

This Festschrift has been put together on the occasion of Franz Baader's 60th birthday to celebrate his fundamental and highly influential scientific contributions. The 30 papers in this volume cover several scientific areas that Franz Baader has been working on during the last three decades, including description logics, term rewriting, and the combination of decision procedures. We hope that ...

Added: July 29, 2019

Knowledge Management in Organizations. 14th International Conference, KMO 2019, Zamora, Spain, July 15–18, 2019, Proceedings

Switzerland: Springer, 2019.

This book contains the refereed proceedings of the 14th International Conference on Knowledge Management in Organizations, KMO 2019, held in Zamora, Spain, in July 2019. The 46 papers accepted for KMO 2018 were selected from 109 submissions and are organized in topical sections on: knowledge management models and analysis; knowledge transfer and learning; knowledge and service ...

Added: June 14, 2019

Proceedings 16th Russian Conference on Artificial Intelligence (RCAI 2018)

Cham: Springer, 2018.

This book constitutes the proceedings of the 16th Russian Conference on Artificial Intelligence, RCAI 2018, Moscow, Russia, in September 2018. The 22 full papers presented along with 4 short papers in this volume were carefully reviewed and selected from 75 submissions. The conference deals with a wide range of topics, including data mining and knowledge discovery, text mining, ...

Added: November 10, 2018

Employing Wikipedia data for coreference resolution in Russian

Azerkovich I., , in: Artificial Intelligence and Natural Language, 7th International Conference, AINL 2018, St. Petersburg, Russia, October 17–19, 2018, ProceedingsIssue 930.: Switzerland: Springer, 2018. P. 107–112.

Semantic information has been deemed a valuable resource for solving the task of coreference resolution by many researchers. Unfortunately, not much has been done in the direction of using this data when working with Russian data. This work describes the first step of a research, attempting to create a coreference resolution system for Russian based on semantic data, concerned with ...

Added: September 5, 2018

Features for Discourse-New Referent Detection in Russian

Toldova S., Ionov M., , in: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 17th International Conference on Intelligent Text Processing and Computational Linguistics, CICLing 2016Vol. 1. Issue 9623.: Springer Publishing Company, 2018. P. 648–662.

This paper concerns discourse-new mention detection in Russian. This might be helpful for different NLP applications such as coreference resolution, protagonist identification, summarization and different tasks of information extraction to detect the mention of an entity newly introduced into discourse. In our work, we are dealing with the Russian where there is no grammatical devices, ...

Added: September 1, 2018

Новые онтологии архитектуры и архитектуры новых онтологий

Maiorova K., Социология власти 2017 Т. 29 № 1 С. 19–40

The aim of this article is to highlight the relationships between contemporary tendencies in the humanities (the new ontologies) and contemporary architectural practices. The author articulates the distinction between the optics of the «old ontologies» and the new ones. The ontologies considered to be new ones are flat, free from classical opposition between the whole ...

Added: January 31, 2018

Identification of Singleton Mentions in Russian

Toldova S., Max Ionov, , in: CLLS 2016. Computational Linguistics and Language Science. Proceedings of the Workshop on Computational Linguistics and Language Science. Moscow, Russia, April 26, 2016Vol. 1886.: Aachen: CEUR Workshop Proceedings, 2017. Ch. 5 P. 33–41.

This paper describes a pilot study of the problem of detecting singleton mentions in Russian texts. A noun phrase is considered a singleton mention if it is the only referent of some entity. We discuss various morphosyntactic and lexical features, some of which were used for analogous tasks for English and propose new features derived ...

Added: November 9, 2017

Strategic Development and Dynamic Models of Supply Chains: Search for Effective Model Constructions

Lychkina N. N., , in: Lecture Notes in Networks and Systems. Proceedings of SAI Intelligent Systems Conference (IntelliSys) 2016Vol. 2.: L.: Springer, 2018. Ch. 2 P. 175–185.

In this research the author studies the approaches to simulation of strategic development of supply chains and formation of cooperation strategies on the basis of the methods of ontology engineering, complex system dynamics simulation and agent-based simulation. ...

Added: September 4, 2017

Strategic development and dynamic models of supply chains: search for effective model constructions.

Lychkina N. N., , in: Proceedings of the 2016 SAI Intelligent Systems Conference (INTELLISYS).: L.: IEEE, 2016. P. 737–743.

Added: February 25, 2017

Mention Detection for Improving Coreference Resolution in Russian Texts: A Machine Learning Approach

Toldova S., Ionov M., Computacion y Sistemas 2016 Vol. 20 No. 4 P. 681–696

The paper concerns discourse-new referent detection. The task of coreference resolution is essential in many text-mining applications. The focus in this task is to detect noun phrases (NPs) that refer to the same entity. In languages without articles, there are no overt grammatical clues in an NP for whether it introduces a new referent into ...

Added: December 27, 2016

Error analysis for anaphora resolution in Russian: new challenging issues for anaphora resolution task in a morphologically rich language

Anna Roytberg, Toldova S., Alina Ladygina et al., , in: Proceedings of the Workshop on Coreference Resolution Beyond OntoNotes (CORBON 2016), co-located with NAACL 2016, San Diego, California, June 16, 2016.: Stroudsburg, PA: Association for Computational Linguistics, 2016. P. 74–83.

This paper presents a quantitative and qualitative error analysis of Russian anaphora resolvers which participated in the RU-EVAL event. Its aim is to identify and characterize a set of challenging errors common to stateof-the-art systems dealing with Russian. We examined three types of pronouns: 3rd person pronouns, reflexive and relative pronouns. The investigation has shown ...

Added: December 7, 2016