?
String Similarity Measures for Evaluating the Lemmatisation in Old Church Slavonic
P. 13–35.
Afanasev I., Lyashevskaya O.
In book
Boston, Leiden: Brill, 2024.
Afanasev I., Journal of Language Relationship 2023 Vol. 21 No. 3-4 P. 159–177
Recent advances in computational historical linguistics have inspired a discussion on newly implemented quantitative methods — mainly, it is about their lack of transparency, and the ways to overcome it. This paper aims to demonstrate the advantages of transparency for such tools. The study compares two types of language distance measurement systems used in classification. ...
Added: May 15, 2024
Afanasev I., Lyashevskaya O., Rebrikov Stefan et al., Jazykovedny Casopis 2023 Vol. 74 No. 1 P. 225–233
The need to develop tools for historical and regional variations is becoming more urgent in natural language processing. In this paper, we present two candidate systems for lemmatising historical East Slavic lects (Late Old East Slavic and Middle Russian), as well as modern regional East Slavic lects (Belogornoje and Megra): BERT-based end-to-end pipeline with language-specific ...
Added: December 11, 2023
Plungian V., Урманчиева А. Ю., Slovĕne 2017 Т. 6 № 2 С. 13–56
Перфект, как известно, является одной из самых загадочных форм старославянского языка, семантика которой упорно не поддается описанию. Старославянские тексты представляют собой переводы (прежде всего — с греческого), и в них в значительной степени наблюдается калькирование как в сфере лексики, так и в сфере грамматических форм и конструкций. Но именно перфект нарушает эту картину: соответствия перфектных ...
Added: November 11, 2023
Plungian V., Урманчиева А. Ю., Slavisticna Revija 2018 Т. 66 № 4 С. 421–440
В работе предпринята попытка приблизиться к пониманию семантики старославянского перфекта — аналитической глагольной формы, состоящей из l-причастия смыслового глагола и вспомогательного глагола byti в презенсе. Выделены типичные контексты употребления данной формы; основной задачей исследования является оценка получившегося «семантического портрета» старославянского перфекта с точки зрения типологических ожиданий, сформировавшихся в отношении перфектных форм в языках мира. Показано, ...
Added: November 11, 2023
Afanasev I., Babanov A., , in: Literature, Language and Computing: Russian Contribution.: Springer, 2023.
Added: September 15, 2023
Kislova E., Quaestio Rossica 2019 Т. 7 № 2 С. 475–491
In the early eighteenth century, the main model of literacy training in Russia was the universal “traditional” model based on the Alphabet Book, the Psalter, and the Book of Hours. The Petrine reforms established a new educational model for the clergy, i. e. the so-called “Latin model”. It was described in the Ecclesiastic Regulation and enjoyed state support; by ...
Added: March 12, 2023
Afanasev I., / NRU HSE. Series WP BRP "Linguistics". 2021.
The article considers a lemmatiser that is developed specifically for Old Church Slavonic (OCS). The introduction underlines the problem of the lack of lemmatisers that might deal with different datasets of the OCS. The review gives a short description of previous attempts and current trends in lemmatisation. The lemmatiser is hybrid-based and uses the advantages ...
Added: December 28, 2021
Afanasev I., Journal of Applied Linguistics and Lexicography 2020 Vol. 2 No. 2 P. 147–159
The article focuses on archaeolinguistics as a separate field of knowledge and outlines the features that distinguish it from other disciplines in comparative studies. It analyses the existing text collections and shows how they may find application in a corpus-based research in ancient languages. It also discusses approaches to creating new corpora of texts. The ...
Added: December 28, 2021
Automation In Cognitive Linguistics: The Case Of Old Slavonic And Old Church Slavonic Literary Texts
Afanasev I., Крючкова О. Ю., Lingua Viva 2019 № 29 С. 48–56
The article considers the case of one particular research in the border field between cognitive linguistics and lexical semantics, namely the study of conceptual opposition “us – them”, as exemplified in the Old Slavonic and the Old Church Slavonic literary texts. The researcher faces the problem of processing of hundreds of lexemes, and it is ...
Added: December 28, 2021
Gippius A., В кн.: Sub specie aeternitatis: Сборник научных статей к 60-летию Вадима Борисовича Крысько.: М.: Азбуковник, 2021. С. 560–567.
The article proposes a new interpretation of the ancient Bulgarian inscription of 954/955 from the village Chernoglavtsy (Shumen region). The personal name Neostѫnѧ identified in the text is treated as a neuter *-ȩt- stem. ...
Added: October 28, 2021
Lyashevskaya O., Afanasev I., Jazykovedny Casopis 2021 Vol. 72 No. 2 P. 556–567
We present a hybrid HMM-based PoS tagger for Old Church Slavonic. The training corpus is a portion of one text, Codex Marianus (40k) annotated with the Universal Dependencies UPOS tags in the UD-PROIEL treebank. We perform a number of experiments in within-domain and out-of-domain settings, in which the remaining part of Codex Marianus serves as ...
Added: October 21, 2021
Tyers F. M., Bibaeva M., / Series 2020.iwclul-1.2 "Proceedings of the Sixth International Workshop on Computational Linguistics of Uralic Languages". 2020.
Lemmatisers in Uralic languages are required for dictionary lookup, an
important task for language learners. We explore how to decide which
of the rule-based and unsupervised categories is more efficient to
invest in. We present a comparison of rule-based and unsupervised
lemmatisers, derived from the Giellatekno finite-state morphology
project and the Morfessor surface segmenter trained on Wikipedia,
respectively. The comparison spanned ...
Added: April 20, 2021
Grishchenko A., Die Welt der Slaven. Internationale Halbjahresschrift für Slavistik 2018 Т. LXIII № 2 С. 189–214
This article collects and analyzes all forms of the names of the Hebrew months in the medieval Slavonic-Russian literature. The first list of these names appeared in the multilingual set of names by Pseudo-John of Damascus, translated from Greek into Old Bulgarian and preserved in the Izbornik of 1073. Then other lists of Hebrew months, translated ...
Added: October 21, 2020
Gippius A., , in: Der Hochaltar des Hildesheimer Domes und sein ReliquienschatzVol. 1: Saskia Roth. Der Ort und seine Geschichte.: Schnell und Steiner, 2018. P. 173–182.
A parchment fragment from the medieval reliquary of Hildesheim Cathedral is shown to be an 11th century Bulgarian merchant’s letter. ...
Added: February 21, 2019
Dereza O., , in: Proceedings of Third Workshop "Computational linguistics and language science"Issue 4.: Manchester: EasyChair, 2019. P. 113–124.
Lemmatisation, which is one of the most important stages of text preprocessing, consists in grouping the inflected forms of a word together so they can be analysed as a single item. This task is often considered solved for most modern languages irregardless of their morphological type, but the situation is dramatically different for ancient languages. Rich inflectional system and ...
Added: December 12, 2018
Bucharest: Editura Academiei Romane/Publishing House of the Romanian Academy, 2016.
Added: October 30, 2018
Dragoş Gh. Năstăsoiu, Adashinskaya A., MuseIKON. A Journal of Religious Art and Culture/Revue d’Art et de Culture Religieuse 2017 No. 1 P. 25–44
Bien que la restauration de l’église de Ribiţa (région de Hunedoara) eût démarré aux années 1994-1995 et qu’elle soit encore inachevée, les travaux de décapage des peintures murales ont mis en lumière un certain nombre de nouvelles données permettant à formuler certaines hypothèses sur la datation des peintures murales. Poussée par le désir de corriger ...
Added: October 30, 2018
Polivanova A., М.: Институт славяноведения РАН, 2018.
База данных позволяет осуществлять поиск и делать разнообразные выборки по материалу грамматического и корневого словарей книги [Поливанова А. К. Старославянский язык: Грамматика. Словари.]. Предусмотрена возможность использования виртуальной клавиатуры, запросов с переменными, прямой и инвертированной сортировки по любому из полей, частичных фильтров и других инструментов электронных баз данных. ...
Added: October 24, 2018
Zanchi C., Naccarato C., , in: Типология морфосинтаксических параметров. Материалы международной конференции «Типология морфосинтаксических параметров 2016»Вып. 3.: М.: МГПУ, 2016. P. 359–390.
This paper is devoted to Old Church Slavonic (OCS) and Old Russian (OR) compound verbs with stacked prefixes. Although prefixes are a well investigated topic as regards modern Slavic languages, multiple prefixation in ancient Slavic languages still needs to be extensively explored. This work is a further step in this direction: via a careful manual ...
Added: October 4, 2018