Automated assessment of learner text complexity
EFL methodology has always recognized the importance of giving student learners of foreign languages regular and quick feedback on student speech production, both written and oral, and over the past two decades there appeared various tools for the provision of automated instant feedback. The presented paper offers an application that focuses on measuring text complexity, and the results are translated into feedback related to the author’s language proficiency. Along with some standard text complexity features, this tool takes into account those that are significant for Russian learners of English. The application provides students with advice on how to improve the weaker aspects of the evaluated essay by giving the statistics of the relevant linguistic features of the text in two different colours for the better and worse levels. We point out what text features are more relevant for the assessment of the essays written in English by Russian students. We analyzed 3440 texts from Russian Error-Annotated English Learner Corpus, and for each of them we calculated the text criteria values. Then we used the methods of machine learning and statistical analysis to predict the grade that could be received for the essay.
The conference was organised under the aegis of the Learner Corpus Association and was hosted by Eurac Research Institute for Applied Linguistics. It was themed "Widening the scope of learner corpus research" and brought together researchers and language teachers, software developers and linguists from 23 countries around the world.
In modern EFL teaching in Russia, much attention is paid to making students aware of variations in the cultural schemata represented by their L1 and the target language, as well as behavioral patterns of their speakers. At the same time, researchers and teaching practitioners scarcely address certain linguistic issues of Russian L1 prosodic interference that lead to attitudinal confusion on the part of native English speakers even when utterances produced by Russian EFL learners are void of any grammatical and lexical errors.
The study examines Russian L1 intonation in English, analyzes the differences in the pragmatic meaning created by the wrong application of L1 intonation contours, and looks at the reasons leading to the failure of the educational system to address the issue. Specifically, the paper investigates features of Russian L1 prosodic interference that affect communication and lead to misunderstanding of Russian speakers’ attitude or intent from native speakers’ perspective, as well as questions the importance of teacher beliefs in dealing with the problem. The results emphasize the importance of intonation teaching in an English classroom and suggest possible ways of dealing with institutional constraints that impede full-fledged intonation study.
The paper describes the learner corpus composed of English essays written by native Russian speakers. REALEC (Russian Error-Annotated Learner English Corpus) is an error-annotated, available online corpus, now containing more than 200 thousand word tokens in almost 800 essays. It is one of the first Russian ESL corpora, dynamically developing and striving to improve both in size and in features offered to users. We describe our perspective on the corpus, data sources and tools used in compiling it. Elaborate self-made classification of learners’ errors types is thoroughly described. The paper also presents a pilot experiment on creating test sets for particular learners’ problems using corpus data.
The project we present – Russian Learner Translator Corpus (RusLTC) is a multiple learner translator corpus which stores Russian students’ translations out of English and into it. The project is being developed by a cross-functional team of translator trainers and computational linguists in Russia. Translations are collected from several Russian universities; all translations are made as part of routine and exam assignments or as submissions for translation contests by students majoring in translation. As of March 2014 RusLTC contains the total of nearly 1.2 million word tokens, 258 source texts, and 1,795 translations. The paper gives a brief overview of the related research, describes the corpus structure and corpus-building technologies used; it also covers the query tool features and our error annotation solutions. In the final part we make a summary of the RusLTC-based research, its current practical applications and suggest research prospects and possibilities.
The article focuses on the ordering of arguments in Russian. The central class of phenomena is represented by verbs having a clausal complement that has a fixed position with respect to other arguments and the matrix verb. I mostly analyze verbs like "ščitat’" ‘consider’, "rassmatrivat’ kak" ‘regard as’, "trebovat’" ‘request, require’, which require the clausal complement to be situated after the nominal object, but not before it. I seek to find the reasons for this restriction, which is in general not characteristic of Russian syntax. The conclusion is that it results from several factors: the syntactic weight of complement clauses, looseness of syntactic link with the head, compared to nominal arguments, and so on. The data of other Russian constructions (e. g., coordinate structures) also show that clausal constituents tend to be situated after nominal ones.
The paper discusses case (non-)coincidence in elliptical coordinated constructions, which is one of the most wide-spread type of errors that Russian native speaker make.
The workshop series on Natural Language Processing (NLP) for Computer-Assisted Language Learning (CALL) – NLP4CALL – is a meeting place for researchers working on the integration of Natural Language Processing and Speech Technologies in CALL systems and exploring the theoretical and methodological issues arising in this connection.
On the body material in the article, common errors in the use and construction of the verb form are considered - from the theoretical and typological points of view. The data of the RLC educational building containing texts of students of the Russian language as a foreign language are used. Identified "weaknesses" in the assimilation of this topic by foreign students. An attempt was made to create a typology of errors. It is shown that the observed errors in the formation of verbs are generally expected; they are also confirmed by the result of the study. The article analyzes the possibilities of using the Russian educational building (RUK, Russian Learner Corpus - RLC) in the practice of teaching RCTs while studying the theme "verb type". A total of 900 examples of verbs are considered; errors in the verbs of the present, past and future tense are noted in 330 of them. The aim of the study is to clarify the derivation rules for the verb type and the rules for using species, as well as to analyze the types of errors issued by the academic building and to establish a correlation between the rule and usage.
The paper is focused on the study of reaction of italian literature critics on the publication of the Boris Pasternak's novel "Doctor Jivago". The analysys of the book ""Doctor Jivago", Pasternak, 1958, Italy" (published in Russian language in "Reka vremen", 2012, in Moscow) is given. The papers of italian writers, critics and historians of literature, who reacted immediately upon the publication of the novel (A. Moravia, I. Calvino, F.Fortini, C. Cassola, C. Salinari ecc.) are studied and analised.
In the article the patterns of the realization of emotional utterances in dialogic and monologic speech are described. The author pays special attention to the characteristic features of the speech of a speaker feeling psychic tension and to the compositional-pragmatic peculiarities of dialogic and monologic text.