• A
  • A
  • A
  • АБВ
  • АБВ
  • АБВ
  • A
  • A
  • A
  • A
  • A
Обычная версия сайта
  • RU
  • EN
  • HSE University
  • Publications
  • Articles
  • Here We Go Again: Modern GEC Models Need Help with Spelling
  • RU
  • EN
Расширенный поиск
Высшая школа экономики
Национальный исследовательский университет
Priority areas
  • business informatics
  • economics
  • engineering science
  • humanitarian
  • IT and mathematics
  • law
  • management
  • mathematics
  • sociology
  • state and public administration
by year
  • 2027
  • 2026
  • 2025
  • 2024
  • 2023
  • 2022
  • 2021
  • 2020
  • 2019
  • 2018
  • 2017
  • 2016
  • 2015
  • 2014
  • 2013
  • 2012
  • 2011
  • 2010
  • 2009
  • 2008
  • 2007
  • 2006
  • 2005
  • 2004
  • 2003
  • 2002
  • 2001
  • 2000
  • 1999
  • 1998
  • 1997
  • 1996
  • 1995
  • 1994
  • 1993
  • 1992
  • 1991
  • 1990
  • 1989
  • 1988
  • 1987
  • 1986
  • 1985
  • 1984
  • 1983
  • 1982
  • 1981
  • 1980
  • 1979
  • 1978
  • 1977
  • 1976
  • 1975
  • 1974
  • 1973
  • 1972
  • 1971
  • 1970
  • 1969
  • 1968
  • 1967
  • 1966
  • 1965
  • 1964
  • 1963
  • 1958
  • More
Subject
News
May 15, 2026
Preserving Rationality in a Period of Turbulence
The HSE International Laboratory for Logic, Linguistics and Formal Philosophy studies logic and rationality in a transformed world characterised by a diversity of logical systems and rational agents. The laboratory supports and develops academic ties with Russian and international partners. The HSE News Service spoke with the head of the laboratory, Prof. Elena Dragalina-Chernaya, about its work.
May 15, 2026
‘All My Time Is Devoted to My Dissertation
Ilya Venediktov graduated from the Master’s programme at the HSE Tikhonov Moscow Institute of Electronics and Mathematics through the combined Master’s–PhD track and is currently studying at the HSE Doctoral School of Engineering Sciences. At present, he is undertaking a long-term research internship at the University of Science and Technology of China in Hefei, where he is preparing his dissertation. In this interview, he explains how an internship differs from an academic mobility programme, discusses his research topic, and describes the daily life of a Russian doctoral student in China.
May 15, 2026
‘What Matters Is Not What You Study, but Who You Study with
Katerina Koloskova began studying Arabic expecting to give it up after a year—now she cannot imagine her life without it. In an interview for the Young Scientists of HSE University project, she spoke about two translated books, an expedition to Socotra, and her love for Bethlehem.

 

Have you spotted a typo?
Highlight it, click Ctrl+Enter and send us a message. Thank you for your help!

Publications
  • Books
  • Articles
  • Chapters of books
  • Working papers
  • Report a publication
  • Research at HSE

?

Here We Go Again: Modern GEC Models Need Help with Spelling

Proceedings of the Institute for System Programming of the RAS. 2023. Vol. 35. No. 5. P. 215–228.
Starchenko V., Starchenko A.

The study focuses on how modern GEC systems handle character-level errors. We discuss the ways these errors effect the performance of models and test how models of different architectures handle them. We conclude that specialized GEC systems do struggle against correcting non-existent words, and that a simple spellchecker considerably improve overall performance of a model. To evaluate it, we assess the models over several datasets. In addition to CoNLL-2014 validation dataset, we contribute a synthetic dataset with higher density of character-level errors and conclude that, provided that models generally show very high scores, validation datasets with higher density of tricky errors are a useful tool to compare models. Lastly, we notice cases of incorrect treatment of non-existent words on experts' annotation and contribute a cleared version of this dataset. In contrast to specialized GEC systems, LLaMA model used for GEC task handles character-level errors well. We suggest that this better performance is explained by the fact that Alpaca is not extensively trained on annotated texts with errors, but gets as input grammatically and orthographically correct texts.

Research target: Philology and Linguistics Computer Science
Language: English
DOI
Text on another site
Keywords: validationвалидацияпредобработкаpreprocessingспеллчекерGECspellchecksynthetic datasetsисправление грамматических ошибоксинтетические датасеты
Publication based on the results of:
Constituent structure and constituents' interpretation in the grammar architecture of the languages of Russian (2023)
Similar publications
Лично-числовая асимметрия: согласование пассивных миративов в казымском диалекте хантыйского языка
Starchenko A., Toldova S., Типология морфосинтаксических параметров 2023 Т. 6 № 1 С. 130–148
The study focuses on a previously unrecorded model of split agreement in the mirative paradigm in Kazym Khanty. Split agreement is found when comparing active and passive mirative constructions, as well as in a limited set of uses of non-finite forms. In the passive voice, unlike the active voice, the 3rd person is unmarked and the ...
Added: May 14, 2026
QGKM: A Quantum Fidelity-Based Graph Clustering Framework for Robust Data Pattern Recognition in Education Social Networks QGKM: A Quantum Fidelity-Based Graph Clustering Framework for Robust Data Pattern Recognition in Education Social Networks
Neal N. X., Weiqing L., Dacheng H. et al., Algorithms 2026 Vol. 19 No. 5 P. 1–22
In the era of data-driven education, educational social networks generate large volumes of high-dimensional and complex-structured data through learner interactions, collaborative activities, and resource-sharing behaviors, posing significant challenges to traditional unsupervised learning methods. Such data often exhibit non-convex distributions, heterogeneity, and noise sensitivity, making conventional clustering approaches insufficient for capturing their intrinsic structural relationships. To ...
Added: May 13, 2026
Глаголы перемещения веществ в славянских языках
Fedorov D., Jezikoslovni Zapiski 2026 № 32(1) С. 23–52
This article describes verbs denoting motion of liquid and dry substances in Slavic langu­ages. The research explores how Slavic languages lexicalize different situations within the semantic field of substance motion and identifies the parameters that drive this lexicalization (e.g., type of substance, intensity and quantization of flow, and causation). Adjacent gram­matical phenomena such as argument ...
Added: May 13, 2026
Образ женщины сквозь года: диахронический анализ репрезентации женщин в российской агитационной рекламе
Gabrielova E., Максименко О. И., Социальные и гуманитарные науки на Дальнем Востоке 2026 Т. 23 № 1 С. 241–249
The article presents a diachronic analysis of the representation of women in Russian advertising, based on agitation posters from 1917-1990 and social and motivational advertising materials from 2000-2020. The aim of the study is to identify the evolution of verbal and visual strategies for constructing the image of women in the changing socio-political and cultural ...
Added: May 13, 2026
Proceedings of the 9th Student Research Workshop associated with the International Conference Recent Advances in Natural Language Processing
Velichkov B., Nikolova-Koleva I., Slavcheva M., Shumen: INCOMA Ltd, 2025.
The RANLP 2025 Student Research Workshop (RANLPStud’2025) is a special track of the established international conference Recent Advances in Natural Language Processing (RANLP’2025). The RANLPStud is being organised for the 9th time and this year is running in parallel with the other tracks of the main RANLP 2025 conference. The target of RANLPStud’25 is to be a ...
Added: May 12, 2026
Интегрированная среда моделирования для верификации и валидации программ управления подключенными и высокоавтоматизированными транспортными средствами
Stepanyants V., Долгов И. М., Хорошилов Г. С. et al., Труды Института системного программирования РАН 2026 Т. 38 № 3 С. 95–110
Highly automated and connected vehicles are gradually entering the market. Currently, solutions are being proposed that allow these technologies to be used for cooperative driving automation, which can significantly improve traffic safety. Such technologies and their software should be tested to ensure safety before being implemented in real systems. Verification and validation of vehicular control ...
Added: May 12, 2026
Connected and Automated Vehicle Scenario Manager Graphical User Interface
Tikhonov R., Efendiev M. T., Fedotenkov A. A., 2026 International Russian Smart Industry Conference (SmartIndustryCon) 2026 P. 542–547
High-fidelity simulation environments like CARLA and ROS are essential for connected and automated vehicle research. They allow researchers to verify and validate new software and technology without the time, financial, and safety overheads of real-world testing. However, their operation requires considerable expertise for creating platform-specific scenario configuration files, which complicates the research workflow. This paper ...
Added: May 11, 2026
«Плоский мир» Т. Пратчетта глазами русскоязычного фандома
Кульков А. Н., Tsvetkova M. V., Вестник Томского государственного университета. Филология 2026 № 100 С. 158–173
Впервые делается попытка рассмотреть особенности фанфикшн как акта продуктивной рецепции, возникшего на основе цикла романов Терри Пратчетта о Плоском мире в России. Проведенный анализ показывает, что прежде всего авторы фанфиков стремятся передать стилистику и комическое начало оригинального цикла Пратчетта, вне зависимости от жанра и формата создаваемых ими произведений. Фикрайтеры наиболее часто обращаются к таким форматам, ...
Added: May 10, 2026
Proceedings 2026 IEEE 11th International Conference on Smart Cloud SmartCloud 2026 8-10 May 2026
Los Alamitos: IEEE Computer Society, 2026.
It is a great pleasure for us to welcome you on behalf of the conference committees, to the 11th IEEE International Conference on Smart Cloud (IEEE SmartCloud 2026), we are glad that we can have this international conference in New York city, USA. Now, please allow us to introduce the IEEE SmartCloud 2026 conference. The ...
Added: May 10, 2026
От неизвестности к прозрачности: обзор технологий объяснимого ИИ (XAI)
Avdoshin S. M., Pesotskaya E. Y., Информационные технологии 2026 Т. 32 № 4 С. 185–194
With the rapid advancement of artificial intelligence, and deep learning in particular, models have emerged that are capable of delivering highly accurate predictions. However, the internal logic of such models remains difficult to interpret—an issue of critical importance, especially in domains where the correctness of an algorithm directly affects high-stakes decision-making. One promising avenue for ...
Added: May 8, 2026
Explainable AI for Industry 5.0: Shedding light on the black box
Avdoshin S. M., Pesotskaya E. Y., Business Informatics 2026 Vol. 20 No. 1 P. 7–28
The rapid development of artificial intelligence (AI) is accompanied by increasing computational complexity and decreasing model transparency, which significantly limits its adoption in critical domains that require a high level of trust, interpretability, and justification of decisions. Under these conditions, the field of Explainable Artificial Intelligence (XAI) has gained particular importance as it focuses on approaches and technologies that ...
Added: May 8, 2026
Русскоязычная версия Шкалы экотревожности Хогг (HEAS-RU)
Nartova-Bochaver S. K., Stakina Y., Тренина М. Е. et al., Клиническая и специальная психология 2026 Т. 15 № 1 С. 166–181
Context and relevance. Eco-anxiety is the anxiety arising in connection with real and possible natural changes and disasters. Eco-anxiety is a significant destabilizer of human activity and therefore needs to be monitored or intervened, which requires a tool to assess its severity. Objective. The present study is aimed at adapting the Hogg Eco-Anxiety Scale (HEAS) ...
Added: April 18, 2026
Statistically distinguishable rating scales
Pomazanov M. V., The Journal of Risk Model Validation 2026 Vol. 20 No. 1 P. 1–24
This paper proposes a method of designing a statistically distinguishable rating scale that is not excessive in relation to the existing observation statistics. This allows for more stable validation with a fixed maximum number of violations of the Wald criterion compared with the excess scales usually used by banks. The increased validation robustness will reduce the calibration probability of ...
Added: December 9, 2025
Model Risk for Acceptable, but Imperfect, Discrimination and Calibration in Basel PD and LGD Models
Penikas H. I., / Series Доклады Банка России "Серия докладов об экономических исследованиях". 2022. No. 92.
The Basel Internal-Ratings-Based (IRB) approach allows banks to use sufficiently good credit risk models for the daily computation of their capital adequacy ratio. However, being sufficiently good does not naturally mean being perfect. Conventionally, risk managers increase the mean probability of default (PD) and loss given default (LGD) values by some margin when developing a model. They expect that it is sufficient to offset for potential model risk. This add-on, ...
Added: November 10, 2025
InGrid: Towards a Simulation-Based Automated Decision-Making System for Transportation
Stepanyants V., , in: 2025 International Russian Automation Conference (RusAutoCon).: IEEE, 2025. P. 982–986.
Transportation systems are complicated and deal with significant problems. With the pool of possible solutions being wide, extensive transportation planning has to be involved. However, planning based on expert opinions is significantly limited in terms of rapidity, accuracy, and confidence. Computer-aided design and automated decision-making systems are the next step to ensure transportation system development ...
Added: October 3, 2025
Городская идентичность — как измерять?
Reznichenko S. I., Рожкова Н. А., Человек 2025 Т. 36 № 4 С. 58–73
Identification with the city not only supports the psychological wellbeing of the resident, but also becomes a resource for the development of the city, because committed citizens are more motivated to preserve and protect their city. Meanwhile, there are few ways to measure this phenomenon, and the available ones have limitations or have not been ...
Added: September 1, 2025
Психометрические свойства русскоязычной версии шкалы психологического дистресса Р. Кесслера
Kislitsyn D., Schapov D., Aleksandrova E., Психиатрия 2025 Т. 23 № 2 С. 65–77
Background: the effectiveness of a primary screening system for psychiatric disorders in the population can be enhanced by using short scales with high psychometric properties. Aim: to evaluate the psychometric properties of the Russian version of the Kessler Psychological Distress Scale. Participants and Methods: the data from three online surveys conducted as part of the study ...
Added: July 7, 2025
Statistically distinguishable rating scale
Pomazanov M. V., / arXiv. Серия q-fin.RM "Quantitative Finance > Risk Management". 2025.
The article proposes a method of designing a statistically distinguishable rating scale that is not excessive in relation to the existing observation statistics. This allows for more stable validation with a fixed maximum number of violations of the Wald criterion compared to an excess scale, which is usually used by banks. The increased robustness of ...
Added: March 19, 2025
WAM, SWAN and WAVEWATCH III in the Finnish archipelago–the effect of spectral performance on bulk wave parameters
Björkqvist J., Vähä-Piikkiö O., Alari V. et al., Journal of Operational Oceanography 2020 Vol. 13 No. 1 P. 55–70
WAM, SWAN and WAVEWATCH III® were implemented to the Finnish archipelago with a 0.1 nmi grid. A comparison with coastal wave buoy observations showed that the models agreed on the significant wave height, with biases and root-mean-square-errors (RMSE) differing at most 0.06 m. In a general sense, WAM propagated most long wave energy into the archipelago, while SWAN ...
Added: February 11, 2025
WAM, SWAN and WAVEWATCH III in the Finnish archipelago – the effect of spectral performance on bulk wave parameters
Björkqvist J. -., Vähä-Piikkiö O., Alari V. et al., Journal of Operational Oceanography (United Kingdom) 2020 Vol. 13 No. 1 P. 55–70
WAM, SWAN and WAVEWATCH III® were implemented to the Finnish archipelago with a 0.1 nmi grid. A comparison with coastal wave buoy observations showed that the models agreed on the significant wave height, with biases and root-mean-square-errors (RMSE) differing at most 0.06 m. In a general sense, WAM propagated most long wave energy into the archipelago, while SWAN ...
Added: December 10, 2024
Психометрические свойства русскоязычной версии Шкалы воспринимаемого стресса (версии PSS-4, 10, 14)
Zolotareva A., Клиническая и специальная психология 2023 Т. 12 № 1 С. 18–42
This study was aimed to adapt and analyze the psychometric properties of the Perceived Stress Scale (PSS) in its full (PSS-14) and two short versions (PSS-10, PSS-4). Psychometric analysis of the Russian versions of the PSS was performed on a sample of 558 Russianspeaking respondents, including 278 men and 280 women aged 18 to 78 ...
Added: April 18, 2023
Как увеличить годовую норму прибыльности розничного портфеля, оптимизируя уровень отказа и повышая силу дискриминации?
Pomazanov M. V., Риск-менеджмент в кредитной организации 2022 Т. 48 № 4 С. 31–39
Описанный подход к валидации риск-менеджмента розничного портфеля применялся в крупнейших банках и дал безупречно обос­нованный результат, ускоривший коррекцию риск-политик в существенных сегментах розничных продуктов вплоть до закрытия одних и расширения планов размещения других. На основе оценки экономической выгоды от усиления риск-менеджмента с учетом текущих и плановых объемов могут быть обоснованно увеличены бюджеты на расширение аналитического ...
Added: December 20, 2022
Second-order accuracy metrics for scoring models and their practical use
M. V. Pomazanov, , in: 9th International Conference on Information Technology and Quantitative ManagementIssue 214.: Elsevier, 2022. P. 565–572.
The paper proposes new second-order accuracy metrics for scoring/rating models, which show the target preference of the model - it is better to diagnose "good" objects or better to diagnose "bad" ones for a constant generally accepted predictive power determined by the first-order metric - the Gini index. Two metrics proposed, they have both an ...
Added: December 9, 2022
Validation of the effectiveness of the bank retail portfolio risk management procedure
Pomazanov M. V., , in: The 8th International Conference on Information Technology and Quantitative Management (ITQM 2020 & 2021): Developing Global Digital Economy after COVID-19Vol. 199: The 8th International Conference on Information Technology and Quantitative Management (ITQM 2020 & 2021): Developing Global Digital Economy after COVID-19.: Manchester: Elsevier, 2022. P. 798–805.
The article considers the issue of quantifying the quality of discrimination (approval) in the retail portfolio segments based on current statistical data on the level of refuse of customers who applied to the bank, the current level of defaults and market data (credit history bureaus). The analysis of the economic efficiency of the practiced level ...
Added: November 18, 2022
  • About
  • About
  • Key Figures & Facts
  • Sustainability at HSE University
  • Faculties & Departments
  • International Partnerships
  • Faculty & Staff
  • HSE Buildings
  • HSE University for Persons with Disabilities
  • Public Enquiries
  • Studies
  • Admissions
  • Programme Catalogue
  • Undergraduate
  • Graduate
  • Exchange Programmes
  • Summer University
  • Summer Schools
  • Semester in Moscow
  • Business Internship
  • Research
  • International Laboratories
  • Research Centres
  • Research Projects
  • Monitoring Studies
  • Conferences & Seminars
  • Academic Jobs
  • Yasin (April) International Academic Conference on Economic and Social Development
  • Media & Resources
  • Publications by staff
  • HSE Journals
  • Publishing House
  • iq.hse.ru: commentary by HSE experts
  • Library
  • Economic & Social Data Archive
  • Video
  • HSE Repository of Socio-Economic Information
  • HSE1993–2026
  • Contacts
  • Copyright
  • Privacy Policy
  • Site Map
Edit