Evaluation for morphologically rich language: Russian NLP

Toldova S.; Lyashevskaya O.; A. A. Bonch-Osmolovskaya; Ionov M.

?

Evaluation for morphologically rich language: Russian NLP

P. 300–306.

Toldova S., Lyashevskaya O., Бонч-Осмоловская А. А., Ionov M.

Abstract - RU-EVAL is a biennial event organized in order to estimate the state of the art in Russian NLP resources, methods and toolkits and to compare various methods and principles implemented for Russian. Russian could be treated as an under-resourced language due to the lack of free distributable gold standard corpora for different NLP tasks (each team tried to work out their own standards). Thus, our goal was to work out the uniform basis for comparison of systems based on different theoretical and engineering approaches, to build evaluation resources, to provide a flexible system of evaluation in order to differentiate between non-acceptable and linguistically “admissible” errors. The paper reports on three events devoted to morphological tagging, dependency parsing and anaphora resolution, respectively.

Язык: английский

Полный текст

Текст на другом сайте

Ключевые слова: NLP evaluation coreference resolution morphological tagging dependency parsing

В книге

Proceedings on the International Conference on Artificial Intelligence (ICAI)

Vol. 1. , Las Vegas: CSREA Press, 2015.

Transformer-based approaches for lemmatizing abbreviations in Russian texts

Glazkova A., Ляшевская О. Н., Morozov D. и др., Journal of Mathematical Sciences 2025 Vol. 546 P. 32–47

Добавлено: 10 марта 2026 г.

Proceedings of the Third Workshop on Resources and Representations for Under-Resourced Languages and Domains (RESOURCEFUL-2025)

Tartu: University of Tartu Library, 2025.

Добавлено: 17 июля 2025 г.

From web to dialects: how to enhance non-standard Russian lects lemmatisation?

Афанасьев И. А., Ляшевская О. Н., , in: Proceedings of the 2023 CLASP Conference on Learning with Small Data (LSD).: Gothenburg: Association for Computational Linguistics, 2023. P. 167–175.

Добавлено: 10 декабря 2023 г.

Disambiguation in context in the Russian National Corpus: 20 yeas later

Ляшевская О. Н., Афанасьев И. А., Stefan Rebrikov и др., , in: Компьютерная лингвистика и интеллектуальные технологии: По материалам ежегодной международной конференции «Диалог». Вып. 22.Вып. 22.: [б.и.], 2023. P. 307–318.

Добавлено: 15 сентября 2023 г.

The Use of Khislavichi Lect Morphological Tagging to Determine its Position in the East Slavic Group

Афанасьев И. А., , in: Proceedings of Tenth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial 2023).: Association for Computational Linguistics, 2023. P. 174–186.

Добавлено: 15 мая 2023 г.

Proceedings of Tenth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial 2023)

Association for Computational Linguistics, 2023.

Добавлено: 15 мая 2023 г.

Proceedings of the First Workshop on Computational Approaches to Discourse

Association for Computational Linguistics, 2020.

Добавлено: 18 ноября 2020 г.

Humans Keep It One Hundred: an Overview of AI Journey

Шаврина Т. О., Emelyanov A., Феногенова А. С. и др., , in: Proceedings of The 12th Language Resources and Evaluation ConferenceVol. 12.: European Language Resources Association (ELRA), 2020. P. 2276–2284.

Добавлено: 15 июня 2020 г.

Proceedings of The 12th Language Resources and Evaluation Conference

European Language Resources Association (ELRA), 2020.

Добавлено: 15 июня 2020 г.

A cross-genre morphological tagging and lemmatization of the Russian poetry: distinctive test sets and evaluation

Старченко А. М., Ляшевская О. Н., , in: Digital Transformation and Global Society. Fourth International Conference, DTGS 2019, St. Petersburg, Russia, June 19–21, 2019, Revised Selected Papers.: Springer, 2019. P. 732–743.

Добавлено: 12 июня 2019 г.

Applying statistical tagging to Russian poetry

Старченко А. М., Казакевич Л. В., Ляшевская О. Н., / NRU HSE. Series WP BRP "Linguistics". 2018. No. 76.

Добавлено: 12 декабря 2018 г.

Data Conversion and Consistency of Monolingual Corpora: Russian UD Treebanks

Дроганова К. А., Ляшевская О. Н., Zeman D., , in: Proceedings of TLT 2018 International Workshop on Treebanks and Linguistic Theories, 13-14 November 2018, Oslo, Norway. NEALT Proceedings Series.: Linköping University Electronic Press, 2018. P. 52–65.

Добавлено: 6 ноября 2018 г.

Automatic morphological analysis on the material of Russian social media texts

Феногенова А. С., Kazorin V., Карпов И. А. и др., , in: Proceedings of Third Workshop "Computational linguistics and language science"Issue 4.: Manchester: EasyChair, 2019. P. 11–17.

Добавлено: 5 октября 2018 г.

Employing Wikipedia data for coreference resolution in Russian

Азеркович И. Л., , in: Artificial Intelligence and Natural Language, 7th International Conference, AINL 2018, St. Petersburg, Russia, October 17–19, 2018, ProceedingsIssue 930.: Switzerland: Springer, 2018. P. 107–112.

Добавлено: 5 сентября 2018 г.

Features for Discourse-New Referent Detection in Russian

Толдова С. Ю., Ionov M., , in: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 17th International Conference on Intelligent Text Processing and Computational Linguistics, CICLing 2016Vol. 1. Issue 9623.: Springer Publishing Company, 2018. P. 648–662.

Добавлено: 1 сентября 2018 г.

Text collections for evaluation of Russian morphological taggers

Ляшевская О. Н., Bocharov V., Sorokin A. и др., Jazykovedny Casopis 2017 Vol. 68 No. 2 P. 258–267

Добавлено: 30 января 2018 г.

Identification of Singleton Mentions in Russian

Толдова С. Ю., Max Ionov, , in: CLLS 2016. Computational Linguistics and Language Science. Proceedings of the Workshop on Computational Linguistics and Language Science. Moscow, Russia, April 26, 2016Vol. 1886.: Aachen: CEUR Workshop Proceedings, 2017. Ch. 5 P. 33–41.

Добавлено: 9 ноября 2017 г.

Coreference resolution for Russian: the impact of semantic features

Толдова С. Ю., Maxim Ionov, , in: Computational Linguistics and Intellectual Technologies. International Conference "Dialogue 2017" ProceedingsVol. 1. Issue 16 (23).: M.: -, 2017. P. 339–348.

Добавлено: 12 июля 2017 г.

Mention Detection for Improving Coreference Resolution in Russian Texts: A Machine Learning Approach

Толдова С. Ю., Ионов М., Computacion y Sistemas 2016 Vol. 20 No. 4 P. 681–696

Добавлено: 27 декабря 2016 г.