Quantitative Analysis of Passives with Agent Phrase Based on Multilingual Parallel Data
Нестеренко Л. В.
Issue 2865. , [б.и.], 2021
, , et al., , in: Компьютерная лингвистика и интеллектуальные технологии: По материалам ежегодной Международной конференции «Диалог» (Бекасово, 30 мая–3 июня 2012 г.). В 2 томах. Т. 1: Основная программа конференции. Вып. 11.: М.: Российский государственный гуманитарный университет, 2012.. P. 362-369.
The paper presents a project aimed at the development of a Russian Learner Parallel Corpus, discusses the existing analogues, describes the current status and the tasks in which it could be used. The existing parallel corpora contain (comparatively) “correct” translations; whereas the aim of the present project is to create a sufficiently large corpus of ...
Added: February 13, 2013
, , , Lingua Posnaniensis 2016 Vol. 58 No. 2 P. 129-149
The main goal of our paper is to give a first, general description of middle voice in Bantu. As will be shown, this language group has a set of verbal derivational morphemes that challenges some of the concepts related to the middle domain. First of all, as of yet no description has been found of a language ...
Added: October 17, 2020
, Вестник Томского государственного университета 2018 Т. 22 № 435 С. 187-194
As was initially suggested by data-driven teaching pioneers not only the researcher, but also the learner should be given the chance of studying language through corpus or get access to authentic linguistic data. Working on that assumption,the article elaborates on the potential of corpus analysis for the purpose of L2 teaching. Firstly, a succession of ...
Added: January 21, 2018
, Теория и практика перевода 2016 № 1/2016(18) С. 19-25
A grammatical voice (diathesis) is one of the most complex grammar categories in any language. The article offers an insight of syntactic analysis used to study passive Arabic-Russian based on the relevant sources of information. ...
Added: April 12, 2017
Параллельные белорусско-русский и русско-белорусский корпусы: совместный проект Национального корпуса русского языка
, , В кн.: Корпусы национальных языков: модели и технологии. Труды Казанской школы по компьютерной и когнитивной лингвитике TEL-2012. .: Каз.: Издательство «Фэн» Академии наук Республики Татарстан, 2012.. С. 54-60.
Added: April 23, 2013
, , in: Proceedings of the 4th Biennial International Workshop on Balto-Slavic Natural Language Processing. .: Association for Computational Linguistics, 2013.. P. 63-68.
The present paper introduces approach to improve English-Russian sentence alignment, based on POS-tagging of automatically aligned (by HunAlign) source and target texts. The initial hypothesis is tested on a corpus of bitexts. Sequences of POS tags for each sentence (exactly, nouns, adjectives, verbs and pronouns) are processed as “words” and Damerau-Levenshtein distance between them is ...
Added: September 5, 2013
, , , in: Proceedings of the Second Workshop on Corpus-Based Research in the Humanities CRH-2, 25-26 January 2018 Vienna, Austria. .: Wien: Gerastree Proceedings, 2018.. P. 201-205.
The paper discusses the marking of the composition location in the Poetic Corpus of Russian that enables customizing subcorpora by these locations and subsequent search by this parameter. The place names indicated by the authors are extracted, tagged and “normalized”, that is, all the different versions of names and minor locations are boiled down to ...
Added: August 30, 2018
, , Frontiers in Artificial Intelligence and Applications 2016 Vol. 289 P. 130-135
This paper presents the current status of the Latvian-Russian parallel corpus, which is an ongoing project within the Russian National Corpus. It discusses the existing parallel corpora including Latvian texts, availability of sources and the main principles and tools of alignment and morphological annotation, as well as further plans for developing the corpus. ...
Added: August 30, 2018
, , et al., , in: Computational Linguistics and Intellectual Technologies: Proceedings of the International Conference “Dialogue 2018”. .: [б.и.], 2018.. P. 619-636.
The paper addresses an issue of an automatic data collection for lexical typological studies in the Frame approach paradigm. A research in this framework is based on the analysis of distributional properties of the lexemes in question. Hence, questionnaires for such studies consist of typical contexts where lexical items from a given semantic domain can ...
Added: October 17, 2018