Некоторые особенности адыгейского электронного корпуса
The paper deals with concepts and formal conventions which are accepted for the West Circassian electronic corpus by its authors. The ideas discussed here are induced by the specifics of automatic morphological parsing of West Circassian texts required for grammatical annotation of morphemes and word forms.
The article discusses the most recent trends in the development of the English progressive. A corpus-based approach to linguistic research is seen as an effective means of determining reliability of the data retrieved and helps track the major diachronic dynamic in the increasing frequency of the progressive aspect that has taken place since the beginning of the 20th century. The article specifically deals with the extension of the progressive to new constructions, such as modal, present perfect and past perfect passive progressive, and also accounts for the use of progressive forms in the contextual environment not generally characteristic of them.
The project we present – Russian Learner Translator Corpus (RusLTC) is a multiple learner translator corpus which stores Russian students’ translations out of English and into it. The project is being developed by a cross-functional team of translator trainers and computational linguists in Russia. Translations are collected from several Russian universities; all translations are made as part of routine and exam assignments or as submissions for translation contests by students majoring in translation. As of March 2014 RusLTC contains the total of nearly 1.2 million word tokens, 258 source texts, and 1,795 translations. The paper gives a brief overview of the related research, describes the corpus structure and corpus-building technologies used; it also covers the query tool features and our error annotation solutions. In the final part we make a summary of the RusLTC-based research, its current practical applications and suggest research prospects and possibilities.
Four electronic corpora created in 2011 within the framework of the “Corpus Linguistics: the Albanian, Kalmyk, Lezgian, and Ossetic Languages” Program of Fundamental Research of the RAS are presented. The interface and functionalities of these corpora are described, engineering problems to be solved in their creation are elucidated, and the promises of their development are discussed. A particular emphasis is made on the compilation of dictionaries and automatic grammatical markup of the corpora.
In this paper I will analyse the syntactic properties of valency-changing derivations and other syntactic processes in Adyghe (a language of the West Caucasian family spoken in the Republic of Adygheya and the Krasnodar region of Russia, and also in some countries of western Asia such as Turkey). My aim is to determine whether these processes testify to syntactic ergativity or accusativity in Adyghe, or whether they in fact shed no light at all on the question of Adyghe alignment behaviour.
In the present paper, I base my analysis of syntactic ergativity on the evidence of valency-changing derivation only. I choose not to consider other pivot properties related to ergativity / accusativity (coordination reduction, relativization, subordinate clauses etc.; see Dixon 1994; Van Valin and LaPolla 1997). It seems to me more justifiable to restrict myself to the data presented by derivational behaviour alone, since in a single article it is impossible to analyse the whole range of data related to ergativity in a polysynthetic language like Adyghe; moreover, the valency-changing derivational system may be organized ergatively, for example, while other syntactic processes are organized accusatively, or vice versa.
The processes analysed in this paper can be divided into two groups, based on the kind of information they provide about ergativity in Adyghe.
First of all, there are derivations which can be regarded as semantically motivated (though syntactic motivation can also be proposed for these processes).
Secondly, there are derivations which are only compatible with transitive verbs, namely the inadvertitive and potential. These transformations are more significant for our analysis, since they show that Adyghe is syntactically ergative.