Automatic Mining of Cause-Effect Discourse Connectives for Russian

Pisarevskaya D.; Kobozeva M.; Petukhova Y.; Sedov S.; S. Toldova

doi:10.1007/978-3-030-37858-5_60

Publications

?

Automatic Mining of Cause-Effect Discourse Connectives for Russian

P. 708–718.

Pisarevskaya D., Kobozeva M., Petukhova Y., Sedov S., Toldova S.

The identification of discourse connectives plays an important role in many discourse processing approaches. Among them, there are functional words usually enumerated in grammars (iz-za ‘due to’, blagodarya ‘thanks to’,) and not grammaticalized expressions (X vedet k Y ‘X leads to Y’, prichina etogo ‘the cause is’). Both types of connectives signal certain relations between discourse units. However, there are no ready-made lists of the second type of connectives. We suggest a method for expanding a seed list of connectives based on their vector representations by candidates for not grammaticalized connectives for Russian. Firstly, we compile a list of patterns for this type of connectives. These patterns are based on the following heuristics: the connectives are often used with anaphoric expressions substituting discourse units (thus, some patterns include special anaphoric elements); the connectives more frequently occur at the sentence beginning or after a comma. Secondly, we build multi-word tokens that are based on these patterns. Thirdly, we build vector representations for the multi-word tokens that match these patterns. Our experiments based on distributional semantics give quite reasonable list of the candidates for connectives.

Language: English

DOI

Text on another site

Keywords: discourse connectives cause-effect relations automatic mining

In book

Digital Transformation and Global Society: 4th International Conference, DTGS 2019, St. Petersburg, Russia, June 19–21, 2019, Revised Selected Papers

Springer, 2019.

Пунктуация в предложениях с союзом "то есть" как отражение развития дискурсивных функций

Kuvshinskaya Y. M., Аксенова А. А., В кн.: Язык и метод: Русский язык в лингвистических исследованиях XXI векаVol. 7.: Kraków: Wydawnictwo Uniwersytetu Jagiellońskiego, 2021. С. 213–222.

Работа посвящена пунктуации в предложениях с союзом "то есть". Данные Национального корпуса русского языка показываютрост частотности употребления этого коннектора в начале независимого предложения, после точки, несмотря на то, что такое употребление пока еще не закреплено в качестве кодифицированной нормы. В работе обсуждаются различия в употреблении "то есть" в середине и в начале предложения: возможность выражать ...

Added: October 25, 2023

Proceedings of DISRPT 2019 - The Workshop on Discourse Relation Parsing and Treebanking. NAACL HLT 2019

Association for Computational Linguistics, 2019.

This book summarizes the main topics at the 2019 workshop on Discourse Relation Parsing and Treebanking (DISRPT 2019). Co-located with NAACL 2019 in Minneapolis, the workshop’s aim was to bring together researchers working on corpus-based and computational approaches to discourse relations. In addition to an invited talk, eighteen papers outlined below were presented, four of which ...

Added: April 22, 2020

Contrast and comparison relations in RST framework

Toldova S., Davydova T., Kobozeva M. et al., , in: Computational Linguistics and Intellectual Technologies Papers from the Annual International Conference “Dialogue” (2019)Issue 18.: M.: Russian State University for the Humanitie, 2019. P. 714–727.

The paper is devoted to a corpus study of the Contrast relation between discourse units in Russian. It is based on the data of the Ru-RSTreebank annotated within the framework of the Rhetorical Structure theory [Mann, Thompson 1988]. The research question is what cue phrases and lexical and grammatical patterns are used to express the ...

Added: April 22, 2020

Automatic Mining of Discourse Connectives for Russian

Toldova S., Pisarevskaya D., Kobozeva M., , in: Artificial Intelligence and Natural Language, 7th International Conference, AINL 2018, St. Petersburg, Russia, October 17–19, 2018, ProceedingsIssue 930.: Switzerland: Springer, 2018. P. 79–87.

The identification of discourse connectives plays an important role in many discourse processing approaches. Among them there are functional words usually enumerated in grammars (iz-za ‘due to’, blagodarya ‘thanks to’,) and not grammaticalized expressions (X vedet k Y ‘X leads to Y’, prichina etogo ‘the cause is’). Both types of connectives signal certain relations between ...

Added: October 26, 2018

The cues for rhetorical relations in Russian: "Cause-Effect" relation in Russian Rhetorical Structure Treebank

Toldova S., Pisarevskaya D., Vasilyeva M. et al., , in: Компьютерная лингвистика и интеллектуальные технологии: По материалам ежегодной международной конференции «Диалог» (Москва, 30 мая — 2 июня 2018 г.)Вып. 17(24).: М.: Издательский центр «Российский государственный гуманитарный университет», 2018. P. 747–761.

The purpose of the paper is to investigate cues signalling the relations between discourse units in Russian. Building a lexicon of discourse connectives is an indispensable subtask in many discourse parsing applications as well as an essential issue in theoretical researches of text coherence. In order to develop such a resource for Russian, we have ...

Added: September 1, 2018