• A
  • A
  • A
  • ABC
  • ABC
  • ABC
  • А
  • А
  • А
  • А
  • А
Regular version of the site
Of all publications in the section: 150
Sort:
by name
by year
Working paper
Zarifyan M., Melnik A. A., Vyrenkova A. S. Linguistics. WP BRP. НИУ ВШЭ, 2016
This paper presents a set of drills that train students' competence in using the adjectives that belong to the semantic domain of FAR. The drills are based on and exemplify theoretical results of a study focusing around the comparison of synonymous adjectives across six languages and aim at granting students with a systematic and holistic picture of the domain in question. The training materials highlight the basic semantic parametres governing the correct lexical choice as well as the relevant distributional and grammatical patterns. Exemplified here are the drills for Russian and English languages. p.p1 {margin: 0.0px 0.0px 0.0px 0.0px; font: 12.0px Helvetica}
Added: Dec 13, 2016
Working paper
Nina Ladinskaya, Chrabaszcz A., Lopukhina A. Linguistics. WP BRP. НИУ ВШЭ, 2019. No. 18.
While different researchers agree that the acquisition of Russian nominal cases proceeds somewhat sequentially, there is no consensus about the exact order of case acquisition in the literature on L1 acquisition (see Ceitlin, 2000; Gvozdev, 1981, 2007; Gagarina & Voeikova, 2009). Besides, accumulated longitudinal data are sparse and disparate, coming from children of different ages, socio-economic statuses, and language acquisition backgrounds. We adopt a psycholinguistic approach to examine acquisition of Russian nominal case inflections by Russian monolingual children (2-5 years old). The goal of the study is twofold: it sets out 1) to examine at what age children learn to generalize rules of noun case usage, and 2) to identify the order of acquisition of Russian oblique case inflections. Children perform a picture-based sentence completion task in which they have to finish the sentence by naming real or nonexisting objects in the pictures. Five sentence frames are constructed to bias the children’s responses towards the use of a noun form in one of the five oblique Russian cases, across three declensions plus plural forms. Data collection is in progress, but interim results show that monolingual Russian-speaking children learn to generalize morphological rules to novel nouns by the age of 2 and that nouns in the plural form are acquired later in language development compared to singular forms. Within the singular forms, 3rd declension cases, especially the instrumental case, present most difficulty. Additionally, 2-3-year-old children tend to substitute oblique cases with the nominative case forms. The results corroborate some of the previous findings and add additional insights into the acquisition of Russian case.
Added: Oct 30, 2019
Working paper
Lyashevskaya O., Vlasova E., Litvintseva K. et al. Linguistics. WP BRP. НИУ ВШЭ, 2018. No. 77.
A data analysis tool of the Corpus of Russian Poetry (a part of the Russian National Corpus) is designed for quantitative research in various areas of versology and linguistics aspects of poetic texts. The core part, a statistic database of the corpus, includes annotation at the level of texts, verses, words as well as patterns of words, letters and stress. The tool allows a user to study certain properties (e. g. rhyming patterns, lexical co-occurrence) taken alone and in their interaction, both in the whole corpus and in subcorpora. Besides that, it facilitates contrastive studies of two chosen subcorpora and a few case studies demonstrating applicable methods and potential for further research in the field of the Digital literary studies.
Added: Dec 13, 2018
Working paper
Kazun A. D., Kazun A. Political Science. PS. Высшая школа экономики, 2017. No. WP BRP 51/PS/2017.
Donald Trump and his team have often been accused of having close ties to Russia and Russians expressed much greater support for Trump than residents of other countries. This article provides the first systematic analysis of Russian media coverage of Trump's activities during and after the election campaign. It concludes, on the basis of a sentiment analysis of relevant articles and network agendas of 500 Russian magazines and 250 leading federal newspapers, that Trump's media portrayal was not necessarily positive. During the election, Trump was portrayed by the Russian media not as Russia's favorite candidate, but as Hillary Clinton's opponent and a critic of U.S. recent policies. Only for a short period after Trump's victory in the elections, did the Russian media represent him as a friend of Russia, since there was hope that the new president would lift political and economic sanctions. Trump's policies failed to meet the expectations of the Russian people, and from the beginning of 2017, media coverage of Trump has become critical, hopes for lifting of political sanctions have weakened, and public opinion about the U.S. president has turned negative.
Added: Nov 13, 2017
Working paper
Koltsova O., Pashakhin S. Basic research program. WP BRP. National Research University Higher School of Economics, 2017. No. 79/SOC/2017.
Although conflict representation in media has been widely studied, few attempts have been made to perform large-scale comparisons of agendas in the media of conflicting parties, especially for armed country-level confrontations. In this paper, we introduce quantitative evidence of agenda divergence between the media of conflicting parties in the course of the Ukrainian crisis 2013-2014. Using 45,000 messages from the online newsfeeds of a Russian and a Ukrainian TV channels, we perform topic modelling coupled with qualitative analysis to reveal crisis-related topics, assess their salience and map evolution of attention of both channels to each of those topics. We find that the two channels produce fundamentally different agenda sequences: in particular, while the Russian channel pays little attention to confrontation between the Ukrainian government and the opposition before the regime change, the Ukrainian channel is less inclined to cover armed violence in East Ukraine and refugees after the regime change.
Added: Dec 29, 2017
Working paper
Kazun A. D. Political Science. PS. Высшая школа экономики, 2017. No. WP BRP 49/PS/2017.
The purpose of this study is to test the agenda-setting theory, according to which the media has a significant impact on what people consider to be important events. We compare the results of the Levada Center surveys on the most memorable events of the month with the number of publications on these issues in the press. We focused on the period from January 2014 to December 2016. A total of 884 events were analyzed in the article. The results of the study confirm the impact of discussions in the media on people's attention to the problem. The results also show that the discussions in the media one week before the date of polling are more important than the issues covered over the entire month. People better remember those events which took place shortly before the polling, as well as the events the discussion of which intensified during this period. It is also important to note the role of regional publications in the sensitization of the public to various issues. The issues covered by the national newspapers and news agencies, but ignored by the regional press, are much worse remembered by the population. The results of the study are controlled for background of the discussion: the presence of important events in each relevant month, which monopolized the public attention, is taken into account.
Added: Sep 6, 2017
Working paper
Vinogradov A. Working Papers of Humanities. WP. Издательский дом НИУ ВШЭ, 2013. No. WP BRP 31/HUM/2013.
The Acts of Andrew and Matthias contains in a not evident form a feast cycle, which could represent a liturgical practice of the 4th c.-community where the Acts were written. Three days of Andrew’s torments could be linked with Holy Wednesday, Holy Thursday and Good Friday. They are preceded by 27 or 28 days, i.e., 4 weeks of grief and some kind of fast. Andrew’s torments are followed by his miraculous healing that could be compared to Resurrection and Easter. After it Andrew is most likely to have baptized the cannibals ; it reminds one of an ancient tradition of single baptism on Holy Saturday. Finally Andrew has to remain in the city for seven more days ; together with the day of baptism they form those eight days for neophytes’ confirmation before ablutions and at the same time complete the Octave of Easter, that ends with taking baptismal cloths off the neophytes. Andrew’s departure could also be compared with Ascension that was linked with While Sunday. A liturgical circle could be seen on a lower level as well in ch. 2–3 : Matthias’ prayer – Christ’s admonition with citation of the Gospel – «peace be with you» – Matthias’ exclamation – singing of Psalms.
Added: May 15, 2013
Working paper
Bergelson M., Kibrik A. A. Linguistics. WP BRP. НИУ ВШЭ, 2017. No. 55.
The paper describes the ongoing research project on Ninilchik Russian as a unique variety of the Russian language. We believe it is a remnant of Alaskan Russian – a language that emerged at the end of the 18th century as a result of Russian colonial presence in Alaska and served as a means of communication in Russian America until the end of the Russian period in 1867. By that time Alaskan Russian became the native language for the people of mixed Russian/Native origin residing in various parts of Alaska. Ninilchik was one such place and, due to many factors combined, became a major location where this linguistic variety kept developing and serving as a means of communication, creating and maintaining cultural identity, and holding together the community of brave, persistent, and self-sustained people. Thanks to the people of Ninilchik, Alaskan Russian is still alive in the 21st century. The paper deals with two aspects of this multifaceted linguistic phenomenon. One is a theoretical problem of the “archaeological approach” to the language data which reflects a rather short but diverse history of Alaskan Russian and involves contact studies. Another is the Ninilchik Russian Dictionary project that allows to record both items and concepts, s well as the sociocultural narratives together making up the special story of the linguistic and cultural community
Added: Dec 3, 2017
Working paper
Plisetskaya A. D. Linguistics. WP BRP. НИУ ВШЭ, 2013. No. WP BRP 03/LNG/2013.
This article focuses on the presidential rhetoric of Russian and American leaders at the end of their latest campaigns, taking their victory speeches given immediately after winning their respective elections as examples. The comparative cross-cultural research presented in this study includes cognitive, corpus, and rhetorical approaches and is carried out within the framework of critical discourse analysis. The interconnection between language, culture and politics is evident through metaphors used by national leaders. The metaphor THE RUSSIAN NATION IS AT WAR reconstructed in President Putin’s victory speech is quite different from the metaphor THE AMERICAN NATION IS ONE FAMILY found in President Obama’s victory speech. Archetypal metaphors found in both speeches reflect public values that turn out to be highly contrastive and explain some cultural and political differences between the great powers.
Added: Jan 17, 2014
Working paper
Starchenko A., Kazakevich L., Lyashevskaya O. Linguistics. WP BRP. НИУ ВШЭ, 2018. No. 76.
The poetic texts pose a challenge to full morphological tagging and lemmatization since the authors seek to extend the vocabulary, employ morphologically and semantically deficient forms, go beyond standard syntactic templates, use non-projective constructions and non-standard word order, among other techniques of the creative language game. In this paper we evaluate a number of probabilistic taggers based on decision trees, CRF and neural network algorithms as well as one state-of-the-art dictionary-based tagger. The taggers were trained on prosaic texts and tested on three poetic samples of different complexity. Firstly, we discuss the method to compile the gold standard datasets for the Russian poetry. Secondly, we evaluate the taggers’ performance in the identification of the part of speech tags and lemmas. Finally, we analyze different types of errors in the taggers’ output. We analyse the confusion matrix of the parts of speech and mismatches in lemma annotation.
Added: Dec 12, 2018
Working paper
Fenogenova A., Kuzmenko E. Linguistics. WP BRP. НИУ ВШЭ, 2016
Learner corpora constitute an effective resource for specialists in fields of second language acquisition, foreign language teaching and corpus linguistics. They tend to get significant scholarly help from statistical tools of various kinds. However, for valuable usage of a corpus it should provide convenient and powerful tools for searching and manipulating data. In this paper we focus on searching tools, presented in \textit{Russian Error-Annotated Learner English Corpus (REALEC)}, report our attempts to improve the format of the searching tools in our corpora. We also provide evidences that database search is much more efficient than common text search and demonstrate that search functionality in corpora is of great importance for research efficiency and extensive facilities.
Added: Dec 14, 2016
Working paper
Lyashevskaya O., Пантелеева И. М. Linguistics. WP BRP. НИУ ВШЭ, 2017
The paper presents a Universal Dependencies (UD) annotation scheme for a learner English corpus. The REALEC dataset consists of essays written in English by Russian-speaking university students in the course of general English. The essays are a part of students' preparation for the independent final examination similar to the international English exam. While adjusting existing dependency parsing tools to a learner data, one has to take into account to what extent students' mistakes provoke errors in the parser output. The ungrammatical and stylistically inappropriate utterances may challenge parsers' algorithms trained on grammatically appropriate written texts. In our experiments, we compared the output of the dependency parser UDpipe (trained on UD-English 2.0) with the results of manual parsing, placing a particular focus on parses of ungrammatical English clauses. We show how mistakes made by students influence the work of the parser. Overall, UDpipe performed reasonably well (UAS 92.9, LAS 91.7). The following cases cause the errors in automatic annotation a) incorrect detection of a head, b) incorrect detection of the relation type, as well as c) both. We propose some solutions which could improve the automatic output and thus make the assessment of syntactic complexity more reliable.
Added: Dec 15, 2017
Working paper
Khazova A. Linguistics. WP BRP. НИУ ВШЭ, 2018. No. 71.
The article deals with the method of automatic detection of authors ' gender identity on the material of fiction prose of 1980-2000. During this period, there is a special construct, called "women's prose", which is characterized by a special genre and stylistic originality. We set ourselves the task to find out whether the concept of “women's prose” refers only to the non-text reality or is clearly reflected at the level of language. We have collected corpus of texts 1980-2000 and conducted that identified the most effective machine learning algorithms for the classification of male and female prose.
Added: Dec 11, 2018
Working paper
Dolgorukova N. Working Papers of Humanities. WP. Издательский дом НИУ ВШЭ, 2016
The paper analyses the first French critiques of the two Mikhail Bakhtin’s monographs and the careful exploration of these reviews enables to explain why they presented him as a formalist. It also traces the reasons of irrelevance of the thinker’s ideas in the early French reception.   
Added: Oct 18, 2016
Working paper
Orekhov B. Linguistics. WP BRP. НИУ ВШЭ, 2015
There is a special type of corpora, a poetic corpus, which have a special metric markup. They allow us to do the study of poetry by the methods that are familiar to corpus linguistics in general. We have created the Bashkir poetic corpus. It allows us to answer some questions about the structure of the Bashkir poetry. Texts in the corpus are annotated with morphological tags, each single token having a set of tags, and with special metric tags, enabling search in lines of specific meter, in rhyming position, etc.
Added: Apr 15, 2015
Working paper
Kozlov A. Linguistics. WP BRP. НИУ ВШЭ, 2018
The paper focuses on the syntax of noun phrases in Chukchi (Chukokto-Kamchatkan). In Chukchi, structure of noun phrases is drastically different dependent on the case of  its head in several respects: we discuss expression of number, incorporation of adnominal dependents and order of constituent. We suggest several order hierarchies, which can partially explain the observed facts.
Added: Dec 14, 2018
Working paper
Виняр А. И., Налетова П. Р., Казакова П. Н. Linguistics. WP BRP. НИУ ВШЭ, 2017. № 58.
This paper is devoted to denominal verb constructions (DNV construction, in a sense of [Gerdts & Marlett 2008]) in Chukchi language. The goals of this paper are twofold. The first aim is purely descriptive: we will clarify the description of morphosyntax and semantics of six affixal verbs which participate in the formation of DNVs in Chukchi, and also suggest a new way of analyzing the polysemy of these and other Chukchi affixal predicates. The second aim of the paper is to compare DNVs in Chukchi to the closely related phenomenon of Noun Incorporation in the same language. We will highlight the similarities between the two constructions and examine different approaches ([Mithun 1997], [Johns 2007], [Gerdts & Marlett 2008], [Muro 2008] and [Barrie 2006]) to the relationship between these phenomena. We will show that it is natural to analyze Chukchi DNVs as a result of the grammaticalization of noun incorporation construction (see [Mithun 1997] for the similar analysis for Salish and Wakashan languages).
Added: Dec 12, 2017
Working paper
Arkhangelskiy T. Linguistics. WP BRP. НИУ ВШЭ, 2014. No. 10.
The Beserman dialect of the Udmurt language has a large number of clitics, mainly discourse markers, which have never been described in detail. These clitics occur very frequently in texts and often appear in clusters. The rules which govern the ordering of clitics within a cluster are especially interesting because the binary relation “clitic A normally precedes clitic B in a cluster” proves to be intransitive, thus making it impossible to use a clitic template to describe the ordering. In this paper, the rules which govern placement of individual clitics in a sentence and their mutual order in clusters are described. The research is based on a corpus of approximately 64,000 tokens and the authorʼs own field data collected in the village of Shamardan, Yukamensk region, Udmurtia.
Added: Oct 4, 2014
Working paper
Khudyakova M., Shapiro Maria M., Grabovskaya M. et al. Linguistics. WP BRP. НИУ ВШЭ, 2016. No. 48.
Coherence is a semantic property of the text to make sense to readers or listeners and is crucial for any text. Various coherence measures have been developed for assessment of discourse abilities in different clinical populations. However, the results of decades of research on coherence of speech of individuals with brain damage have yielded contradictive results. We suggest that this might be due to the different sensitivity of the methods. In this study we two measures of global coherence and five measures of local coherence on the same set of texts by healthy speakers of Russian and people with dynamic aphasia in order to find which methods allow to distinguish between the two groups and how these results correlate. The material for the study is texts from the Russian CliPS corpus which is a collection of oral retellings of the pear film by individuals with brain damage and healthy speakers of Russian language.
Added: Oct 13, 2016
Working paper
Tagabileva M., Kirianov D. Linguistics. WP BRP. НИУ ВШЭ, 2014. No. 17/LNG/2014.
This paper describes the competition of complementation strategies of Russian verbs of request. We conducted a corpus-based study of six strategies of compelement encoding: infinitive, nominalisation, prepositional phrase with nominalisation, finite clause introduced with complementizer čtoby, non-finite clause introduced with complementizer čtoby (i.e. čtoby + infinitive), prepositional phrase with čtoby and expletive pronoun to (o tom čtoby). The results suggest that there are several factors influencing the competition under consideration (such as coreference of different participants, request dictance, etc.) and that these factors belong to different language domains, including semantics. This challenges the classic point of view according to which complementation studies should be focused primarily on syntax. 
Added: Mar 25, 2015
Working paper
Sibirtseva V. Linguistics. WP BRP. НИУ ВШЭ, 2014. No. 7.
Currently many software applications, enabling text analysis, are being created for different purposes (semantic reference tools, concordancers, sentiment analysis etc.), but not used by literary researchers. Computer software allows to facilitate the search of the required information and to save time considerably. With such approach to the field of linguistic and literary analysis, a comparative analysis in particular, new opportunities and unexpected horizons are being offered. The present paper suggest a critical review of existing computer resources, related to text processing, and a consistent description of application of programs, successfully tested on literary materials and used for text analysis at the Faculty of Humanities (HSE Branch in Nizhny Novgorod): linguistic annotated text corpora; collections of literary texts of one author; different computer tools such as AntConc concordancer, multifunctional text analyzer LEKTA, LF aligner for text alignment, i.e. those tools which allow varying of loaded and analyzed text collections. Computer-based text analysis shall be practiced only with further literary description and interpretation. With this comparison of data, retrieved in the process of computer-based analysis, with existing traditional researches may become a new stage of literary text analysis.
Added: Apr 25, 2014