Coreference in Russian Oral Movie Retellings (the Experience of Coreference Relations Annotation in “Russian CliPS ” corpus)

Toldova S. Yu.; Bergelson M. B.; Khudyakova M. V.

?

Coreference in Russian Oral Movie Retellings (the Experience of Coreference Relations Annotation in “Russian CliPS ” corpus)

P. 769–781.

Toldova S. Yu., Bergelson M. B., Khudyakova M. V.

The work deals with adapting the Russian coreference corpus RuCor annotation system (used for written Russian) to the corpus of Russian oral narratives from the Russian Clinical Pear Stories Corpus (Russian CliPS) (Khudyakova et al., 2016). Russian CLiPS is a corpus of Russian “Pear stories” movie (Chafe, 1980) retellings in clinical populations as compared to neurologically healthy people. The analysis deals with 11 texts by healthy people and 9 texts by people with various types of aphasia. The focus is on the specificity of reference choice in oral retellings and the parameters to be used for the annotation procedure to register deviations in referential choice in spoken discourse as compared to the written one. The specific features for annotation of referential choice in clinical populations are also under discussion. The main claims are as follows. Certain types of speech disfluencies should be integrated into the coreference annotation scheme. These are noun phrases, which are repetitions of a previous referent mention, referent renaming, or name correction. Such occurrences can influence the referent activation; on the other hand, they could shed some light on the process of the referential expression choice. The NP morphosyntactic structure and zero-anaphora should have more granulated set of features for coreference devices, as they are more diverse in spoken discourse. Moreover, certain structures, such as adjectives postposition etc. and some types of zeros are characteristic of referential expressions in spoken discourse.

Language: English

Full text

Text on another site

Keywords: афазия референциальный выбор referential choice анафора corpus annotation аннотация корпуса anaphora aphasia spoken narratives устные нарративы

Publication based on the results of:

Лингвистический, когнитивный и прагматический анализ нарративов из корпуса Russian CliPS (2016)

In book

Компьютерная лингвистика и интеллектуальные технологии: По материалам ежегодной международной конференции «Диалог» (Москва,1–4 июля 2016 г.)

Вып. 15. , М.: Изд-во РГГУ, 2016.

Russian CliPS: a Corpus of Narratives by Brain-Damaged Individuals

Khudyakova M., Bergelson M., Akinina Y. et al., , in: Proceedings of the Tenth conference on International Language Resources and Evaluation (LREC'16), Portoroz, Slovenia : ELRA, 2016. [б.и.], 2016. P. 22–26.

In this paper we present a multimedia corpus of Pear film retellings by people with aphasia (PWA), right hemisphere damage (RHD), and healthy speakers of Russian. Discourse abilities of brain-damaged individuals are still under discussion, and Russian CliPS (Clinical Pear Stories) corpus was created for the thorough analysis of micro- and macro-linguistic levels of narratives by PWA ...

Added: October 13, 2016

Interpretation of “embarrassment” laughter in narratives by people with aphasia and non-language-impaired speakers

Khudyakova M., Bergelson M., , in: Proceedings of the 4th Interdisciplinary Workshop on Laughter and Other Non-verbal Vocalisations in Speech, 14-15 April 2015. [б.и.], 2015. P. 45–46.

We present an attempt to describe the semantics of “embarrassment” laughter in aphasic and nonlanguage-impaired discourse based on the samples from the Russian CliPS corpus based on its place in discourse. ...

Added: June 6, 2016

Lexical Diversity in Different Types of Aphasia

Khudyakova M., Stem-, Spraak- en Taalpathologie 2017 Vol. 22 No. 2 P. 110–112

Added: September 21, 2017

Pre-experiments on Annotation of Russian Coreference Corpus

Toldova S., Azerkovich I., Гришина Ю. et al., / NRU HSE. Series WP BRP "Linguistics". 2015.

Building benchmark corpora in the domain of coreference and anaphora resolution is an important task for developing and evaluating NLP systems and models. Our study is aimed at assessing the feasibility of enhancing corpora with information about coreference relations. The annotation procedure includes identification of text segments that are subjects to annotation (markables), marking their ...

Added: December 15, 2015

Interaction and Empathy as Elements of Narrative Strategies in the Russian CliPS Corpus

Bergelson M., Khudyakova M., , in: Компьютерная лингвистика и интеллектуальные технологии: По материалам ежегодной международной конференции «Диалог» (Москва, 31 мая — 3 июня 2017 г.)Т. 2: Компьютерная лингвистика: лингвистические исследования. Вып. 16. М.: Изд-во РГГУ, 2017. P. 55–67.

This paper focuses on evaluation of discourse abilities of speakers with brain damage: people with dynamic aphasia (PWA(d)) and right hemisphere damage (RHD) as compared to healthy speakers of Russian language. The study is based on the material from the Russian CliPS corpus that contains retellings of the Pear Film produced by PWA and RHD, ...

Added: September 21, 2017

Storytelling in Speakers with and without Brain Damage: What Pathology Can Tell us about the Norm

Bergelson M., Akinina Y., Khudyakova M. et al., Discourse Studies 2018

Narrative discourse is widely studied in clinical and healthy populations. This study investigates discourse strategies that people with left- and right-hemisphere brain damage, as well as healthy speakers, use to tell a story. We analyzed microlinguistic properties of picture-elicited discourses, as well as macrolinguistic features, such as balance between narration and description, and between informational ...

Added: October 20, 2017

Интерпретация русских местоимений в контекстах контрфактического тождества: опыт корпусного исследования

Тискин Д. Б., В кн.: Компьютерная лингвистика и интеллектуальные технологии: По материалам ежегодной международной конференции «Диалог» (Москва, 30 мая — 2 июня 2018 г.)Вып. 17(24). М.: Издательский центр «Российский государственный гуманитарный университет», 2018. С. 735–746.

This paper is a first step towards a corpus-based description of the semantics of Russian pronouns in intensional contexts. Having justified the use of corpus in (formal) semantic research, I delineate a particular issue within the topic: whether a given pronoun is interpreted de se or de re in counteridentity contexts. A counteridentity context is a ...

Added: February 19, 2019

Особые свойства риторических отношений "контраст" и "сравнение" на материале разметки в корпусе Ru-Rstreebank

Соколова Е. Г., Toldova S., В кн.: Труды международной конференции "Корпусная лингвистика - 2019". СПб.: Издательство Санкт-Петербургского университета, 2019. С. 127–133.

The work is devoted to the detection of the Contrast vs. Comparison relations within the framework of the Rhetoric structure theory Mann-Thomson. The analysis of annotated data in terms of logical or pragmatic constraints is suggested. This analysis makes it possible to suggest some operational criteria for the relations under discussion. These criteria together with ...

Added: November 25, 2019

A new Russian Aphasia Test: development and standardization of single-word comprehension subtests

Kuptsova S., Soloukhina O., Dragoy O. et al., Stem-, Spraak- en Taalpathologie 2015 Vol. 20 No. 1 P. 82–84

Added: September 28, 2015

Некатегорический референциальный выбор

Khudyakova M., Kibrik A. A., Dobrov G. B., В кн.: Шестая международная конференция по когнитивной науке: Тезисы докладов. Калининград, 23–27 июня 2014 г.Вып. 6. Калининград: [б.и.], 2014. С. 606–607.

В работе представлены данные экспериментальной оценки результатов многофакьторного моделирования референциального выбора на корпусе RefRhet. ...

Added: October 10, 2014

Production of Referring Expressions: Bridging the gap between cognitive and computational approaches to reference

[б.и.], 2013.

PRE-CogSci 2013 is a follow-up to two successful earlier workshops on the production of referring expressions. The first, PRE-CogSci 2009, focussed on the interplay between computational and empirical methods, organised as part of the 31st CogSci conference in Amsterdam. The second, PRE-CogSci 2011 in Boston, broadened this theme to include work on dialogue and linguistic ...

Added: October 25, 2013

Russian CliPS corpus as a clinical tool for aphasiologists

Khudyakova M., Akinina Y., Bergelson M. et al., Aphasiology 2018

In this paper we present a multimedia corpus of Pear film retellings by people with aphasia, right hemisphere damage, and healthy speakers of Russian. Discourse abilities of brain-damaged individuals are still being a matter of discussion, and Russian CliPS was created for the thorough analysis of micro- and macro-linguistic levels of narratives by PWA and ...

Added: September 29, 2016

Асимметрия употребления местоимений что и кто и морфологическая одушевлённость

Letuchiy A., Труды института русского языка им. В.В. Виноградова 2017 № XIII С. 272–281

The paper focuses on one syntactic restriction on the use of the interrogative pronoun čto ‘what’. Contrary to kto ‘who’, čto disfavours constructions where it is syntactically parallel and co-referent to the anaphoric pronouns on ‘he’, ona ‘she’, and ono ‘it’. For instance, in the construction kogo “ego” (lit. ‘who “he”’), which the Russian speakers use to find out ...

Added: March 15, 2018

Cigogo nominal demonstratives: morphology and semantics

Beletskiy S., , in: Проблемы общей и востоковедной лингвистики. Сочетаемость языковых единиц и языковые модели. Памяти З.М. Шаляпиной (1946-2020). М.: ИВ РАН, 2021. P. 100–111.

This paper presents the system of nominal demonstratives in Cigogo language spoken in Tanzania. The system includes 4 sets of forms build by reduplication of the pronominal prefix that is further suffixed with formants no, o, lya. Besides the special deictic (proximal, medial, distant to/from speaker/hearer) and anaphoric functions, they fulfill a wide range of discourse functions: anaphoric, cataphoric, ...

Added: February 11, 2023

Diffusion-tensor imaging of major white matter tracts and their role in language processing in aphasia.

Ivanova M., Isaev D. Y., Dragoy O. et al., Cortex 2016 Vol. 85 P. 165–181

A growing literature is pointing towards the importance of white matter tracts in understanding the neural mechanisms of language processing, and determining the nature of language deficits and recovery patterns in aphasia. Measurements extracted from diffusion-weighted (DW) images provide comprehensive in-vivo measures of local microstructural properties of fiber pathways. In the current study, we compared ...

Added: June 5, 2016

Referential Choice: Predictability and Its Limits

Kibrik A. A., Khudyakova M., Dobrov G. B. et al., Frontiers in Psychology 2016 Vol. 7 No. 1429 P. 1–21

We report a study of referential choice in discourse production, understood as the choice between various types of referential devices, such as pronouns and full noun phrases. Our goal is to predict referential choice, and to explore to what extent such prediction is possible. Our approach to referential choice includes a cognitively informed theoretical component, ...

Added: September 28, 2016

Discourse features of blogs in subcorpus of Russian Ru-RSTreebank

Toldova S., Davydova T., Kobozeva M. et al., , in: Компьютерная лингвистика и интеллектуальные технологии: по материалам ежегодной международной конференции «Диалог» (Москва, 17–20 июня 2020 г.)Issue 19(26): дополнительный том. -, 2020. P. 747–761.

The paper presents a corpus study of the discourse features in the corpus of blogs. It is based on the data of Ru-RSTreebank annotated within the framework of the Rhetorical Structure theory [Mann, Thompson 1988]. The Ru-RSTreebank represents genres of news and popular science, scientific papers, and blogs texts. Blog subcorpus contains such topics as ...

Added: November 17, 2021

More than the verbal stimulus matters: Visual attention in Language assessment for people with aphasia using multiple-choice image displays.

Heuer S., Ivanova M., Hallowell B., Journal of Speech, Language, and Hearing Research 2017 Vol. 60 P. 1348–1361

Purpose: Language comprehension in people with aphasia (PWA) is frequently evaluated using multiple-choice displays: PWA are asked to choose the image that best corresponds to the verbal stimulus in a display. When a nontarget image is selected, comprehension failure is assumed. However stimulus-driven factors unrelated to linguistic comprehension may influence performance. In this study we ...

Added: June 16, 2017

Оформление прямого дополнения в финно-угорских языках: между предикацией и дискурсом

Сердобольская Н. В., Toldova S., Урало-алтайские исследования 2017 № 26 (4) С. 92–112

The work is devoted to the differential object marking in some of the Finno-Ugric languages, namely in Komi-Zyryan, Besrmyan dialect of the Udmurt language, Moksha-Mordvin language. In the previous works the analysis of the differential object marking phenomenon was limited to the analysis of a sentence, in spite of the fact, that main factors that ...

Added: April 15, 2017

Указательная анафора в мультимодальной коммуникации

Николаева Ю. В., Евдокимова А. А., Budennaya E., В кн.: Компьютерная лингвистика и интеллектуальные технологииВып. 20 (27): Дополнительный том. Изд-во РГГУ, 2021. С. 1130–1143.

Статья посвящена взаимодействию анафорических указательных выражений с параллельными жестами рук и головы на материале мультимодального корпуса RUPEX. Анализ выявил ряд корреляций между ролью говорящего (Рассказчик / Пересказчик / Комментатор) и его невербальным поведением. Было обнаружено, что Пересказчик, не видевший фильма, чаще прибегал к указательной анафоре, по сравнению с Рассказчиком. По-видимому, данными действиями Пересказчик стремился воссоздать ...

Added: August 11, 2021

Плеонастические причастия в современной русской речи: функции и тенденции развития

Ю. М. Кувшинская, Н. А. Зевахина, Acta Linguistica Petropolitana. Труды института лингвистических исследований 2023 Т. 19 № 1 С. 138–192

The paper studies tendencies in the use of full single (i.e. without their arguments) redundant participles in the attributive position in the Russian written discourse. Relying upon the data of the Russian National Corpus and the Corpus of Russian Student Texts, as well as a number of the examples collected from various written sources, the ...

Added: December 8, 2022

Святой источник в Себеже: нарративы о происхождении и формы почитания

Ignatiev D., Живая старина 2020 № 3 С. 5–8

The article adresses the folklore narratives and practices related to the 'holy' spring in the city of Sebezh (Pskov oblast'). Typologically stable features get distinguished from the local specifics. ...

Added: January 5, 2021

Особенности реорганизации речевых зон мозга у больных с разными формами афазии

Kuptsova S., Vlasova R., Dragoy O. et al., Вестник Воронежского государственного университета. Серия: Лингвистика и межкультурная коммуникация 2015 № 4 С. 74–81

The present study is aimed at investigating brain activation patterns associated with languageprocessing in patients with fluent and non-fluent aphasia withdifferent localizations of cerebral lesions. Sixteen healthy subjects and eighteen patients with different forms of aphasia participated in this study. The study was conducted using functional MRimaging method. The data obtained in the study revealed ...

Added: June 5, 2016