The Second Genitive in Russian
This paper is an overview of the so-called second genitive in Russian, a nominal form available for a minority of Russian nouns but widely used with these nouns in certain contexts. In many ways, the second genitive is a secondary case. Thus, it may always be substituted with a regular genitive form, while the opposite is not true. A major subset of the contexts where the second genitive may be used fits into what is known as a functional category of partitive, so this form is sometimes called Russian partitive. To a certain extent, indeed, the second genitive is the form with which the regular genitive may be substituted in partitive contexts. The analysis of the distribution of the second genitive shows, however, that the partitive meaning is not the only function of this form. Not less if not more widespread are uses in combinations with prepositions. These and other types of contexts should be taken into account to build a comprehensive picture of the category distribution and functional load.
The article discusses the most recent trends in the development of the English progressive. A corpus-based approach to linguistic research is seen as an effective means of determining reliability of the data retrieved and helps track the major diachronic dynamic in the increasing frequency of the progressive aspect that has taken place since the beginning of the 20th century. The article specifically deals with the extension of the progressive to new constructions, such as modal, present perfect and past perfect passive progressive, and also accounts for the use of progressive forms in the contextual environment not generally characteristic of them.
The paper describes the current stat of the corpus of the modern Albanian which is being created as a joint effort of the Institute of Linguistics (St. Petersburg) and the department of linguistics of the Higher School of Economics, Moscow.
This paper is devoted to the use of two tools for creating morphologically annotated linguistic corpora: UniParser and the EANC platform. The EANC platform is the database and search framework originally developed for the Eastern Armenian National Corpus (www.eanc.net) and later adopted for other languages. UniParser is an automated morphological analysis tool developed specifically for creating corpora of languages with relatively small numbers of native speakers for which the development of parsers from scratch is not feasible. It has been designed for use with the EANC platform and generates XML output in the EANC format.
UniParser and the EANC platform have already been used for the creation of the corpora of several languages: Albanian, Kalmyk, Lezgian, Ossetic, of which the Ossetic corpus is the largest (5 million tokens, 10 million planned for 2013), and are currently being employed in construction of the corpora of Buryat and Modern Greek languages. This paper will describe the general architecture of the EANC platform and UniParser, providing the Ossetic corpus as an example of the advantages and disadvantages of the described approach.
The project we present – Russian Learner Translator Corpus (RusLTC) is a multiple learner translator corpus which stores Russian students’ translations out of English and into it. The project is being developed by a cross-functional team of translator trainers and computational linguists in Russia. Translations are collected from several Russian universities; all translations are made as part of routine and exam assignments or as submissions for translation contests by students majoring in translation. As of March 2014 RusLTC contains the total of nearly 1.2 million word tokens, 258 source texts, and 1,795 translations. The paper gives a brief overview of the related research, describes the corpus structure and corpus-building technologies used; it also covers the query tool features and our error annotation solutions. In the final part we make a summary of the RusLTC-based research, its current practical applications and suggest research prospects and possibilities.
Four electronic corpora created in 2011 within the framework of the “Corpus Linguistics: the Albanian, Kalmyk, Lezgian, and Ossetic Languages” Program of Fundamental Research of the RAS are presented. The interface and functionalities of these corpora are described, engineering problems to be solved in their creation are elucidated, and the promises of their development are discussed. A particular emphasis is made on the compilation of dictionaries and automatic grammatical markup of the corpora.
This book is a collection of articles dealing with various aspects of grammatical relations and argument structure in the languages of Europe and North and Central Asia (LENCA). Topics covered with respect to individual languages are: split-intransitivity (Basque), causativization (Agul), transitives and causatives (Korean and Japanese), aspectual domain and quantification (Finnish and Udmurt), head-marking principles (Athabaskan languages), and pragmatics (Eastern Khanty and Xibe). Typology of argument-structure properties of ‘give’ (LENCA), typology of agreement systems, asymmetry in argument structure, typology of the Amdo Sprachbund, spatial realtors (Northeastern Turkic), core argument patterns (languages of Northern California), and typology of grammatical relations (LENCA) are the topics of articles based on cross-linguistic data. The broad empirical sweep and the fine-tuned theoretical analysis highlight the central role of argument structure and grammatical relations with respect to a plethora of linguistic phenomena.
The form whose main function is to express indirect commands, called the third person Imperative, Jussive or Exhortative, when compared to the prototypical (second person) Imperative, shows semantic and formal similarities and distinctions at the same time. The study describes formal and functional patterns of Jussive and places this category within the typology of the related categories, such as Imperative and Optative, based on data from six East Caucasian languages (Archi, Agul, Akhvakh, Chechen, Icari and Kumyk). Five formal patterns of Jussive are attested in these languages, including a specialized form, constructions derived from want, from tell him to do and from make him do and the Optative. Jussive forms may express such meanings as third person command, indirect causation, permission, indifference towards the accomplishment of an action and an assumption. While the Jussive is crucially different from the second person Imperative in that it introduces a third participant, this article shows that it is the addressee, not a third person, who is the central participant of a Jussive situation from both formal and functional points of view.
The volume presents several papers on Mehweb, a one-village language spoken in the central part of Daghestan, a republic of the Russian Federation.