Automated assessment of learner text complexity

O. Lyashevskaya; Irina Panteleeva; Olga Vinogradova

doi:10.1016/j.asw.2021.100529

Publications

?

Automated assessment of learner text complexity

Assessing Writing. 2021. No. 49. Article 100529.

Lyashevskaya O., Irina Panteleeva, Olga Vinogradova

EFL methodology has always recognized the importance of giving student learners of foreign languages regular and quick feedback on student speech production, both written and oral, and over the past two decades there appeared various tools for the provision of automated instant feedback. The presented paper offers an application that focuses on measuring text complexity, and the results are translated into feedback related to the author’s language proficiency. Along with some standard text complexity features, this tool takes into account those that are significant for Russian learners of English. The application provides students with advice on how to improve the weaker aspects of the evaluated essay by giving the statistics of the relevant linguistic features of the text in two different colours for the better and worse levels. We point out what text features are more relevant for the assessment of the essays written in English by Russian students. We analyzed 3440 texts from Russian Error-Annotated English Learner Corpus, and for each of them we calculated the text criteria values. Then we used the methods of machine learning and statistical analysis to predict the grade that could be received for the essay.

Research target: Philology and Linguistics

Priority areas: humanitarian

Keywords: языковая интерференция learner corpora L1 interference учебные корпуса syntactic complexity morphological complexity lexical complexity параметры сложности текста discourse-organising units

Publication based on the results of:

Automated Detection of Writing Inaccuracies for Students of English in Russia (2020)

Inspector: The Tool For Automated Assessment Of Learner Text Complexity

Olga I. Vinogradova, Olga N. Lyashevskaya, Irina M. P., / NRU Higher School of Economics. Series WP BRP 55/LNG/2017. 2019. No. 79.

Added: October 10, 2019

Исследование языковых ошибок на материале учебных текстов на английском языке

Якушева Т. В., Kuzmenko E., Grabovskaya M. et al., Научно-техническая информация. Серия 2: Информационные процессы и системы 2016 С. 27–35

Статья посвящена разработке классификации ошибок для аннотации учебного корпуса REALEC (Russian Error-Annotated Learner English Corpus) на основе предварительной ручной разметки 170 письменных текстов академического стиля. Рассматриваются вопросы представления разметки, структуры классов и их взаимодействия. Фасетный принцип классификации позволяет приписать ошибке одновременно нескольких видов и уровней, включая критичность ошибки, причину совершения ошибки и лингвистический тип ошибки, ...

Added: December 17, 2016

Word-formation complexity: a learner corpus-based study

Lyashevskaya O., Pyzhak J.V., Vinogradova O. I., Russian Journal of Linguistics 2022 Vol. 26 No. 2 P. 471–492

This article explores the word-formation dimension of learner text complexity which indicates how skilful the non-native speakers are in using more and less complex - and varied - derivational constructions. In order to analyse the association between complexity and writing accuracy in word formation as well as interactive effects of task type, text register, and ...

Added: October 5, 2022

Proceedings of the 4th workshop on NLP for Computer Assisted Language Learning at NODALIDA 2015, Vilnius, 11th May, 2015

Linköping University Electronic Press, 2015.

The workshop series on Natural Language Processing (NLP) for Computer-Assisted Language Learning (CALL) – NLP4CALL – is a meeting place for researchers working on the integration of Natural Language Processing and Speech Technologies in CALL systems and exploring the theoretical and methodological issues arising in this connection. ...

Added: May 31, 2015

Sprachliche Interferenzen bei Russisch-Deutsch-Mehrsprachigen

[б.и.], 2013.

Im Laufe ihres Lebens erwerben und/oder lernen die meisten Menschen mehr als eine Sprache. Dies können zwei (oder mehrere) Erstsprachen, Dialekt- und Standardsprache, die Erstsprache und eine Fremdsprache oder mehrere Fremdsprachen (die man in der Schule oder in der neuen Sprachumgebung erlernt) sein. Aber in welchem Zusammenhang stehen die Sprachen bei einem bilingualen Individuum zueinander, ...

Added: November 13, 2019

The backyard of EFL teaching: issues behind L1 prosodic interference in Russian English

Popkova E., Journal of Language and Education 2015 Vol. 1 No. 4 P. 37–44

In modern EFL teaching in Russia, much attention is paid to making students aware of variations in the cultural schemata represented by their L1 and the target language, as well as behavioral patterns of their speakers. At the same time, researchers and teaching practitioners scarcely address certain linguistic issues of Russian L1 prosodic interference that ...

Added: August 31, 2015

Hedges in Russian EAP writing: A corpus-based study of research papers in management

Smirnova E. A., Стринюк С. А., Journal of English as a Lingua Franca 2020 Vol. 9 No. 1 P. 81–101

The fact that English has become a lingua franca of academic communication has led to increased attention to teaching English for academic purposes (EAP) at the academia. Academic discourse markers, such as hedges, have been an important topic in academic writing research whose prime aim is helping non-Anglophone researchers to present their research findings in ...

Added: October 14, 2020

Языковая рефлексия в современной Беларуси сквозь призму комментариев в интернет-СМИ

Somin A., Вестник РГГУ. Серия «Филологические науки. Языкознание»/ Московский лингвистический журнал 2015 Т. 17 (1) С. 62–86

The paper describes attitudes to the Belarusian language in modern Belarusian society examined through the prism of comments on news articles on the Web. Besides comments on the sociolinguistic news articles, comments on the articles that are neutral from the point of view of the content but are written in Belarusian are analyzed, where the ...

Added: March 14, 2016

A Searching Tool for Russian Error-Annotated Learner English Corpus

Fenogenova A., Kuzmenko E., / NRU HSE. Series WP BRP "Linguistics". 2016.

Learner corpora constitute an effective resource for specialists in fields of second language acquisition, foreign language teaching and corpus linguistics. They tend to get significant scholarly help from statistical tools of various kinds. However, for valuable usage of a corpus it should provide convenient and powerful tools for searching and manipulating data. In this paper ...

Added: December 14, 2016

Глаголы с фиксированным порядком дополнений в русском языке и свойства сентенциальных актантов

Letuchiy A., Русский язык в научном освещении 2018 № 2 (36) С. 180–198

The article focuses on the ordering of arguments in Russian. The central class of phenomena is represented by verbs having a clausal complement that has a fixed position with respect to other arguments and the matrix verb. I mostly analyze verbs like "ščitat’" ‘consider’, "rassmatrivat’ kak" ‘regard as’, "trebovat’" ‘request, require’, which require the clausal complement to be situated ...

Added: September 22, 2019

4th Learner Corpus Conference. LCR 2017. Book of Abstracts

Bozen: [б.и.], 2017.

The conference was organised under the aegis of the Learner Corpus Association and was hosted by Eurac Research Institute for Applied Linguistics. It was themed "Widening the scope of learner corpus research" and brought together researchers and language teachers, software developers and linguists from 23 countries around the world. ...

Added: November 7, 2017

Лексическая типология в ошибках изучающих русский как иностранный: анализ глагола падать в текстах инофонов

Vyrenkova A. S., Acta Linguistica Petropolitana. Труды института лингвистических исследований 2020 Т. XVI Ч.1 С. 368–385

This paper investigates the use of the Russian verb padat’ ‘to fall’ and its quasi-synonyms. Padat’ is dominant in the system of Russian predicates of falling — and therefore should be suitable for describing any type of uncontrolled downward motion. However, in a number of contexts a diff erent means of expression is required. These ...

Added: August 2, 2021

Widening the scope of learner corpus research

John Benjamins Publishing Company, 2020.

The first volume will focus on the theme of learner corpora in research of the CAF (complexity, accuracy and fluency) triad, as it seems to have been a recurrent theme in many conference presentations. ...

Added: October 28, 2019

АВТОМАТИЗИРОВАННАЯ ОЦЕНКА ЛЕКСИКОНА ОБУЧАЮЩИХСЯ ПРИ ПОМОЩИ УЧЕБНОГО КОРПУСА

Vinogradova O. I., ПОЛИЛИНГВИАЛЬНОСТЬ И ТРАНСКУЛЬТУРНЫЕ ПРАКТИКИ 2018 Vol. 15 No. 2018/3 P. 372–380

The role of access to a learner corpus has proved to increase efficiency of L2 acquisition for learners as well as teaching efficiency for EFL instructors. This paper presents a computer tool for a learner corpus designed at the School of Linguistics of the Higher School of Economics for both categories of users. REALEC, Russian ...

Added: November 8, 2017

Hedges in L2 Academic Writing: a Comparative Analysis of Learner and Expert Corpora

Smirnova E. A., Strinyuk S. A., RELC Journal 2019

The use of hedges has been an important topic in academic writing research. Much of this research has focused on L1 academic texts. This study investigates the use of the most frequent hedging devices in the corpus of 58 works written by Russian university students and compares it to the corpus of articles published in ...

Added: December 3, 2017

Russian in contact with Southern Tungusic languages: Evidence from the Contact Russian Corpus of Northern Siberia and the Russian Far East

Stoynova N., Slavica Helsingiensia 2019 No. 52 P. 9–36

The paper contains a description of the variety of Russian used by bilingual speakers of Southern Tungusic languages (some Nanai dialects and Ulch). The morphosyntactic contact-induced peculiarities of their speech are the focus of this paper, while cases of phonetic and lexical interference are discussed in less detail. The study is based on the data ...

Added: October 25, 2019

Использование русского учебного корпуса в преподавании РКИ: вид глагола

Olshevskaya M., Международный аспирантский вестник. Русский язык за рубежом 2018 № 1 С. 13–18

On the body material in the article, common errors in the use and construction of the verb form are considered - from the theoretical and typological points of view. The data of the RLC educational building containing texts of students of the Russian language as a foreign language are used. Identified "weaknesses" in the assimilation ...

Added: October 19, 2017

Авторитет в профессиональном сообществе. К 70-летию Владимира Вениаминовича Агеносова.

Pavlovets M., Литература в школе 2012 № 4 С. 30–31

Article is devoted to the anniversary of the famous literary critic, editor in chief of teaching methods in literature for school and university V.V. Aguenosoff ...

Added: March 9, 2015

Конвербное наращение -et в языке идиш

Andriyanets V., Тирош труды по иудаике 2016

В данной статье исследуется поведение конвербного наращения –et в устной и письменной речи носителей языка идиш. С помощью, в основном, корпуса устной речи EYDES и поисковой машины Google исследуется география феномена, а также стилистические особенности его употребления. ...

Added: December 10, 2015

"Рассуждение о подлинной и мнимой красоте" Пьера Николя в контексте идей французского классицизма

Al-Faradzh E. A., Вестник Русской христианской гуманитарной академии 2013 Т. 14 № 1 С. 179–187

В данной статье речь идет о малоизученном тексте янсенистского автора Пьера Николя, который посвящен вопросам эстетики. Этот текст оказывается моментом кристаллизации традиции перед ее решающим обновлением. Николь характеризует идею красоты с точки зрения понятий, которые также находятся в центре классицистических доктрин -- природа, разум, единство, истина. В данной статье проводится семантический анализ понятия красоты, которое ...

Added: November 18, 2013

Понятие «банкротство» в координатах правовой лингвистики: русско-англо-французские аппроксимации

Vlasenko S. V., Галимов А. Р., Вестник Тверского государственного университета. Серия: Филология 2012 Т. 10 № 2 С. 21–28

«Bankruptcy» Concept Within the Legal Linguistics Coordinates: Russian–English–French Approximations The article addresses the notion of bankruptcy as perceived by speakers of current Russian, English and French languages both lawyers and participants in professional communication from other trades. Semantic structure of the term is identified based on its lexicographic and regulatory definitions. ...

Added: October 4, 2012

Национальные коды в языке и литературе. Язык как культурно-историческое достояние народа

Изд-во ННГУ им. Н.И. Лобачевского, 2020.

Сборник содержит статьи, подготовленные по материалам докладов Международной научной конференции «Национальные коды в языке и литературе» (Нижегородский государственный университет им. Н.И. Лобачевского, Институт филологии и журналистики, 31 октября – 2 ноября 2019 г.). Рассматриваются актуальные проблемы функционирования русского языка в синхронии и диахронии в разных коммуникативных сферах. Особое внимание уделяется исследованию языка как фактора национальной ...

Added: September 30, 2020

Социально-политический дискурс (Лингводидактические аспекты обучения иностранным языкам). – М.: ИПК МГЛУ «Рема», 2010. – 192 с. (Вест. Моск. гос. лингвист. ун-та; вып. 8 (587). Сер. Педагогические науки). – С. 103 – 119. "Особенности медиадискурса и оптимизация обучения общественно-политической лексике"

Nikitina E. V., [б.и.], 2010.

Added: April 6, 2013

Пересказ как искусство историка: к вопросу о рукописной трансмиссии историописания в Древней Руси и Древней Скандинавии

Daria G., Славяноведение 2020 № 4 С. 30–49

До 1016 г. версии «Повести временных лет» (ПВЛ) по ее основным киевским спискам и по Новгородской первой летописи младшего извода (Н1Лмл.) обычно соотносятся либо как близкие копии, либо как расширенная/сжатая редакции одного текста. Это правило нарушается лишь в рассказе о конфликте Ярослава и новгородцев под 1015–1016 гг. Здесь ПВЛ и Н1Лмл. соотносятся скорее как пересказы ...

Added: September 28, 2020