MULTI-LEVEL STUDENT ESSAY FEEDBACK IN A LEARNER CORPUS

O. I. Vinogradova; O. Lyashevskaya; Irina Panteleeva

?

MULTI-LEVEL STUDENT ESSAY FEEDBACK IN A LEARNER CORPUS

P. 373-386.

Vinogradova O. I., Lyashevskaya O., Irina Panteleeva

The paper presents the results of using some computer tools and applications for the purposes of the automated and semi-automated syntactical, lexica, and error analysis of student essays in a learner corpus. The texts in the corpus were written in English by Russian learners of English. The experiment in the research consisted in comparing the parameters of different types and at different levels in the essays graded by professional examiners as the best and those graded the lowest in the pool of about 2000 essays. At the first stage in the experiment the authors applied a syntactical tool for parsing the sentences, then analyzed the results of lexical observations in those texts, and finally collected the statistics related to the errors pointed out in manual expert annotation. The parameters that had very different values for the “good” and for the “bad” essays are regarded by the authors as worthy parts of the feedback a student has to get for the text uploaded into the learner corpus.

Language: English

Full text

Text on another site

Keywords: corpus research computational linguistics learner corpus

Publication based on the results of:

Лексикологические исследования на базе учебного корпуса REALEC (Learner corpus REALEC: Lexicological observations) (2016)

In book

Computational Linguistics and Intellectual Technologies. International Conference "Dialogue 2017" Proceedings

Vol. 1. Issue 16 (23). , M. : -, 2017

Punctuation in L2 English: Computational Methods Applied in the Study of L1 Interference

Vinogradova O. I., Viklova A., Smilga V., , in : Emerging Writing Research from the Russian Federation. : WAC Clearinghouse, University Press, Colorado, 2021. Ch. 9. P. 211-233.

Added: February 4, 2020

Widening the scope of learner corpus research

John Benjamins Publishing Company, 2020

The first volume will focus on the theme of learner corpora in research of the CAF (complexity, accuracy and fluency) triad, as it seems to have been a recurrent theme in many conference presentations. ...

Added: October 28, 2019

THE ROLE AND APPLICATIONS OF EXPERT ERROR ANNOTATION IN A CORPUS OF ENGLISH LEARNER TEXTS

Vinogradova O. I., , in : Компьютерная лингвистика и интеллектуальные технологии: По материалам ежегодной международной конференции «Диалог» (Москва,1–4 июля 2016 г.). Вып. 15.: М. : Изд-во РГГУ, 2016. P. 830-840.

The paper presents the rationale for the decisions that were taken in the set-up and further development of a learner corpus of student texts written in English by Russian learners of English, the only Russian learner corpus in the open access. The tool of manual expert annotation is in the focus of the present observations, ...

Added: May 18, 2016

TaLC 12 - Teaching and Language Corpora Conference

[б.и.], 2016

Various issues relating to the questions of learner corpus researches and their use in teaching are presented. These include the issue of a norm in corpora whether the norm should necessarily be native and what problems a native norm may present. Learners who behave differently from native speakers do not necessarily use language incorrectly as ...

Added: December 10, 2016

An Experimental Study of Hybrid Machine Learning Models for Extracting Named Entities

Lei J., Bolshakova E. I., , in : Proceedings of Third Workshop "Computational linguistics and language science". Issue 4.: Manchester : EasyChair, 2019. P. 50-60.

The paper describes two hybrid neural network models for named entity recognition (NER) in texts, namely Bi-LSTM-CRF and Gated-CNN-CRF, as well as results of experiments with them. ...

Added: November 3, 2019

Regular polysemy: from sense vectors to sense patterns

Lopukhina A., Лопухин К. А., , in : The 26th International Conference on Computational Linguistics (COLING 2016). : [б.и.], 2016. P. 19-23.

Regular polysemy was extensively investigated in lexical semantics, but this phenomenon has been very little studied in distributional semantics. We propose a model for regular polysemy detection that is based on sense vectors and allows to work directly with senses in semantic vector space. Our method is able to detect polysemous words that have the ...

Added: December 1, 2016

The 26th International Conference on Computational Linguistics (COLING 2016)

[б.и.], 2016

Added: December 1, 2016

Компьютерная лингвистика и интеллектуальные технологии: По материалам ежегодной международной конференции «Диалог» (Москва, 17 июня — 20 июня 2020 г.). Дополнительный том материалов

M. : ., 2020

Дополнительный том включает доклады Международной конференции по компьютерной лингвистике и интеллектуальным технологиям «Диалог 2020», не вошедших в основной сборник. Доклады представляют широкий спектр теоретических и прикладных исследований в области описания естественного языка, моделирования языковых процессов, создания практически применимых компьютерных лингвистических технологий. Для специалистов в области теоретической и прикладной лингвистики и интеллектуальных технологий. ...

Added: July 3, 2020

The bimodal corpus of Russian-Turkic bilinguals' speech (RuTuBic)

Artemenko E., Резанова З. И., Темникова И. Г. et al., Компьютерная лингвистика и интеллектуальные технологии 2019 Vol. Suppl No. 18 P. 200-210

The paper presents Russian-Turkic Bilingual Corpus (RuTuBiC) design, its basic identifying features: the aim of producing a corpus, the types of texts it contains, metatextual markup and error annotation principles, technological (IT, digital) concepts. The current state and development trends of the corpus are discussed. The corpus started as an integral part of a research project ...

Added: May 4, 2022

Применение учебного корпуса в преподавании темы "Confusables"

Klimova M., Overnikova D., Смилга В. К., В кн. : Пространство научных интересов: иностранные языки и межкультурная коммуникация – современные векторы развития и перспективы: сборник статей по результатам VI научной межвузовской онлайн-конференции молодых ученых 22.04.2021 г. : М. : [б.и.], 2021.

The article is devoted to teaching error-prone lexical items with the help of the learner corpus REALEC (Russian Error-Annotated Learner English Corpus). The word groups under consideration included near-synonymous numerical nouns (amount, number, quantity), near-synonymous nouns related to possibility (possibility, opportunity, ability, potential), and a pair of paronyms note and notice. Due to being the ...

Added: October 31, 2021

Компьютерная лингвистика и интеллектуальные технологии: По материалам ежегодной международной конференции «Диалог» (Москва, 17 июня — 20 июня 2020 г.)

М. : Изд-во РГГУ, 2020

Papers from the Annual International Conference “Dialogue” (2020). Issue 19 ...

Added: June 26, 2020

Digital Geography: Proceedings of the International Conference on Internet and Modern Society (IMS 2022)

Springer, 2024

Presents select papers from the International Conference on Internet and Modern Society (IMS 2022) Examines Smart Cities, Digital Sustainability, Digital Divides, and Social Media Movements Discusses cutting edge work on Digital Urbanism and Cyber Psychology ...

Added: December 10, 2022

Data Analytics and Management in Data Intensive Domains. 23rd International Conference, DAMDID/RCDL 2021, Moscow, Russia, October 26–29, 2021, Revised Selected Papers

Springer, 2022

“Data Analytics and Management in Data Intensive Domains” conference (DAMDID) is planned as a multidisciplinary forum of researchers and practitioners from various domains of science and research promoting cooperation and exchange of ideas in the area of data analysis and management in data intensive domains. Approaches to data analysis and management being developed in specific data intensive domains of X-informatics (such as X = astro, bio, chemo, geo, medicine, neuro, physics, ...

Added: August 30, 2021

Digital Transformation and Global Society. Third International Conference, DTGS 2018, St. Petersburg, Russia, 2018, Revised Selected Papers. Part II. Communications in Computer and Information Science 859

Springer, 2018

This two volume set (CCIS 858 and CCIS 859) constitutes the refereed proceedings of the Third International Conference on Digital Transformation and Global Society, DTGS 2018, held in St. Petersburg, Russia, in May/June 2018. The 75 revised full papers and the one short paper presented in the two volumes were carefully reviewed and selected from 222 submissions. ...

Added: November 15, 2018

Proceedings of the 8th Conference on Artificial Intelligence and Natural Language, AINL 2019. CCIS

Springer, 2019

Added: November 3, 2019

Clausal complexity of expert and student writing: a corpus-based analysis of papers in social sciences

Smirnova E. A., Language Learning in Higher Education 2022 Vol. 12 No. 2 P. 453-475

Syntactic complexity has been extensively approached in the fields of corpus linguistics and academic discourse studies. However, works focusing on disciplinary variation in terms of linguistic complexity and comparison of professional and novice academic writing are scarce. Addressing these issues is likely to have important implications for EAP/ESP practitioners in terms of selection of target ...

Added: December 7, 2022

USE OF LEARNER CORPUS IN GENERAL ENGLISH AND ACADEMIC ENGLISH COURSES AT THE HIGHER SCHOOL OF ECONOMICS

Vinogradova O. I., , in : Conference Proceedings. The Future of Education International Conference The Future of Education, 6th edition. : Padova : libreriauniversitaria, 2016. P. 310-314.

There have been many reports on advances in the development of learner corpora that have made it possible to effectively use these collections of texts for the benefit of the learning process. This paper lists all possible applications in English courses taught to Bachelor students of a middle-size learner corpus REALEC, which comprises student written ...

Added: March 1, 2017

Clausal complexity features in professional and student academic writing: A corpus-based analysis of texts in management and economics

Smirnova E. A., Journal of English for Academic Purposes 2020

The study is a quantitative analysis of the use of clausal complexity features in two kinds of corpora: expert corpora which comprise articles published in peer-reviewed journals in management and economics and learner corpora of students’ research papers in the same disciplines. The syntactic constructions selected for the analysis are taken from various guidebooks and ...

Added: October 20, 2019

Corpus of Russian student texts: design and prospects

Zevakhina N., Dzhakupova S., , in : Материалы 21-й Международной конференции по компьютерной лингвистике "Диалог". : М. : Изд-во РГГУ, 2015.

The Corpus of Russian Student Texts (CoRST) is a computational and research project started in 2013 at the Linguistic Laboratory for Corpora Research Technologies at HSE. It comprises a collection of Russian texts written by students from various Russian universities. Its main research goal is to examine language deviations viewed as markers of language change. ...

Added: May 20, 2015

Предсказания, большие данные и новые измерители: о возможности технологий компьютерной лингвистики в теоретических лингвистических исследованиях

Bonch-Osmolovskaya A. A., Вопросы языкознания 2016 № 2 С. 100-120

Статья посвящена обзору работ последних лет, в которых теоретическая исследовательская задача решается с помощью методов или инструментов, используемых в компьютерной лингвистике. В обзоре проводится подробный анализ того, как именно с помощью применения того или иного инструмента или метода можно получить новые знания о природе языка. В частности, выделяются два основных направления, развитие которых в рамках ...

Added: April 14, 2015

Corpus Methods in Pragmatics: The Case of English and Russian Emotions

Apresyan V., Intercultural Pragmatics 2013 Vol. 10 No. 4 P. 533-568

The present paper is a comparative corpus study of the verbal expression of emotional etiquette in American English and Russian. The study is conducted against the backdrop of certain assumptions regarding the cross-cultural centrality and marginality of emotions as formulated in the current research on cross-cultural pragmatics. The paper employs corpus-based methods to test the ...

Added: October 13, 2013

Компьютерная лингвистика и интеллектуальные технологии: По материалам ежегодной международной конференции «Диалог» (Москва, 29 мая — 1 июня 2019 г.)

М. : Издательский центр «Российский государственный гуманитарный университет», 2019

The book includes 64 papers submitted to the International conference in computer linguistics and intellectual technologies Dialogue 2019 and presents a broad spectrum of theoretical and applied research of natural language description, language simulation, and creation of applied computer technologies. ...

Added: October 16, 2019

Corpus Linguistics 2015: Abstract Book

Lancaster : Lancaster University Press, 2015

The main trends and achievements in corpus linguistics are presented in this collection os abstracts of plenaries, papers and posters presented at the 8th internation conference Corpus Linguistics - 2015 (Lancaster University, UCREL, July 2015) ...

Added: October 17, 2015

4th Learner Corpus Conference. LCR 2017. Book of Abstracts

Bozen : [б.и.], 2017

The conference was organised under the aegis of the Learner Corpus Association and was hosted by Eurac Research Institute for Applied Linguistics. It was themed "Widening the scope of learner corpus research" and brought together researchers and language teachers, software developers and linguists from 23 countries around the world. ...

Added: November 7, 2017