• A
  • A
  • A
  • АБВ
  • АБВ
  • АБВ
  • A
  • A
  • A
  • A
  • A
Обычная версия сайта
  • RU
  • EN
  • HSE University
  • Publications
  • Book chapter
  • Transformers: “The End of History” for Natural Language Processing?
  • RU
  • EN
Расширенный поиск
Высшая школа экономики
Национальный исследовательский университет
Priority areas
  • business informatics
  • economics
  • engineering science
  • humanitarian
  • IT and mathematics
  • law
  • management
  • mathematics
  • sociology
  • state and public administration
by year
  • 2027
  • 2026
  • 2025
  • 2024
  • 2023
  • 2022
  • 2021
  • 2020
  • 2019
  • 2018
  • 2017
  • 2016
  • 2015
  • 2014
  • 2013
  • 2012
  • 2011
  • 2010
  • 2009
  • 2008
  • 2007
  • 2006
  • 2005
  • 2004
  • 2003
  • 2002
  • 2001
  • 2000
  • 1999
  • 1998
  • 1997
  • 1996
  • 1995
  • 1994
  • 1993
  • 1992
  • 1991
  • 1990
  • 1989
  • 1988
  • 1987
  • 1986
  • 1985
  • 1984
  • 1983
  • 1982
  • 1981
  • 1980
  • 1979
  • 1978
  • 1977
  • 1976
  • 1975
  • 1974
  • 1973
  • 1972
  • 1971
  • 1970
  • 1969
  • 1968
  • 1967
  • 1966
  • 1965
  • 1964
  • 1963
  • 1958
  • More
Subject
News
May 15, 2026
Preserving Rationality in a Period of Turbulence
The HSE International Laboratory for Logic, Linguistics and Formal Philosophy studies logic and rationality in a transformed world characterised by a diversity of logical systems and rational agents. The laboratory supports and develops academic ties with Russian and international partners. The HSE News Service spoke with the head of the laboratory, Prof. Elena Dragalina-Chernaya, about its work.
May 15, 2026
‘All My Time Is Devoted to My Dissertation
Ilya Venediktov graduated from the Master’s programme at the HSE Tikhonov Moscow Institute of Electronics and Mathematics through the combined Master’s–PhD track and is currently studying at the HSE Doctoral School of Engineering Sciences. At present, he is undertaking a long-term research internship at the University of Science and Technology of China in Hefei, where he is preparing his dissertation. In this interview, he explains how an internship differs from an academic mobility programme, discusses his research topic, and describes the daily life of a Russian doctoral student in China.
May 15, 2026
‘What Matters Is Not What You Study, but Who You Study with
Katerina Koloskova began studying Arabic expecting to give it up after a year—now she cannot imagine her life without it. In an interview for the Young Scientists of HSE University project, she spoke about two translated books, an expedition to Socotra, and her love for Bethlehem.

 

Have you spotted a typo?
Highlight it, click Ctrl+Enter and send us a message. Thank you for your help!

Publications
  • Books
  • Articles
  • Chapters of books
  • Working papers
  • Report a publication
  • Research at HSE

?

Transformers: “The End of History” for Natural Language Processing?

P. 677–693.
Chernyavskiy A., Ilvovsky D., Nakov P.
Language: English
Full text
DOI
Text on another site
Keywords: segmentationtransformerssequence classification
Publication based on the results of:
Синтез логических и статистических методов машинного обучения для междисциплинарных приложений (2021)

In book

Machine Learning and Knowledge Discovery in Databases. Research Track: European Conference, ECML PKDD 2021, Bilbao, Spain, September 13–17, 2021, Proceedings,
* 3. , Springer, 2021.
Similar publications
The Third Visual Object Tracking Segmentation VOTS2025 Challenge Results
Kristan M., Matas J., Tokmakov P. et al., , in: 2025 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW).: Honolulu: IEEE, 2025. P. 7472–7490.
The VOTS2025 is the third edition of the Visual Object Tracking Segmentation benchmark. Organised the VOT initiative, VOTS builds on 10 years of experience in organising VOT challenges. Building on the tracking setup introduced in VOTS2023, the challenge continues to integrate short-term and long-term tracking, as well as single-target and multi-target scenarios, using segmentation masks ...
Added: May 3, 2026
Определение фолликулярного резерва яичников по данным ультразвукового исследования на основе методов машинного обучения
Moshkin A., Лапутин Ф. А., Сидоров И. В., DIGITAL DIAGNOSTICS 2024 Т. 5 № S1 С. 40–42
BACKGROUND: Ovarian reserve reflects a woman's ability to successfully realize reproductive function. The assessment of ovarian reserve is an urgent task for clinical practice [1] and is important in scientific research. The use of computerized diagnostic image processing methods can accelerate and facilitate the performance of routine tasks in clinical practice. Their use in retrospective ...
Added: February 21, 2026
Multimodal graph, surface, and language-based model for protein protein interaction prediction
Arteaga Moreano B. D., Chervov N., Poptsova M., Scientific Reports 2026 Vol. 16 No. 1 Article 4772
Accurate prediction of protein-protein interactions (PPIs) is fundamental to understanding biological processes and disease mechanisms. While deep learning offers a powerful alternative to costly experimental methods, existing approaches often overlook critical protein-surface information and rely on simplistic feature fusion techniques, thereby limiting performance. To address this, we introduce GSMFormer-PPI, a novel multimodal framework that integrates ...
Added: February 4, 2026
Segmentation of Vertebral Arteries on the MR Images
Prikhodko R., Moshkin A., Romanov A., , in: 2025 International Russian Automation Conference (RusAutoCon).: IEEE, 2025. P. 273–278.
The vertebral arteries are one of the most important sources of blood supply to the brain, therefore any pathological changes in them can be the reason behind serious diseases. Magnetic Resonance Imaging (MRI) allows diagnosticians to examine main arteries, which is exceptionally important for effective diagnosis. However, because of the small size of arteries relative ...
Added: November 6, 2025
GraphTyper: Вывод типов из графовой репрезентации кода посредством нейронных сетей
Арутюнов Г. А., Avdoshin S. M., Труды Института системного программирования РАН 2024 Т. 36 № 4 С. 69–80
Although software development is mostly a creative process, there are many scrutiny tasks. As in other industries, there is a trend for automation of routine work. In many cases, machine learning and neural networks have become a useful assistant in that matter. Programming is not an exception: GitHub has stated that Copilot is already used ...
Added: November 1, 2024
Mass Higher Education and the Changing Labour Market for Graduates: Between Employability and Employment
Edward Elgar Publishing, 2024.
As higher education continues to expand and an increasing number of graduates enter the workforce, this insightful book considers the crucial social and economic questions raised by this societal shift. Fátima Suleman, Pedro Videira and Pedro Teixeira bring together an array of experts to illustrate the connections between higher education and the labour market across ...
Added: August 15, 2024
Analyzing the Robustness of Vision & Language Models
Shirnin A., Andreev N., Potapova S. et al., IEEE/ACM Transactions on Speech and Language Processing 2024 Vol. 32 P. 2751–2763
We present an approach to evaluate the robustness of pre-trained vision and language (V&L) models to noise in input data. Given a source image/text, we perturb it using standard computer vision (CV) / natural language processing (NLP) techniques and feed it to a V&L model. To track performance changes, we explore the problem of visual ...
Added: July 19, 2024
Grammar in Language Models: BERT Study
Chistyakova K., Kazakova Tatiana, / NRU HSE. Series WP BRP "Linguistics". 2023. No. 115.
The problem of language models’ interpretation is extensively inspected, but no universal answers have been found. Our study offers to combine widely accepted probing methods with a novel approach to a neural network under investigation. We propose to break grammatical forms on the pre-training step in order to get two "sibling" models, as it casts ...
Added: November 29, 2023
Segmentation of Prostate Cancer on TRUS Images Using ML
Zaev R., Romanov A., Solovyev R., , in: 2023 International Russian Smart Industry Conference (SmartIndustryCon), 27-31 March 2023.: Sochi: IEEE, 2023. P. 460–465.
Medical research has made tremendous progress in detecting various pathologies in the human body. There is still the problem of the speed of the process, and the lack of a sufficient number of trained professionals in this field. Detection of prostate cancer, in particular, without surgery is a very labor- intensive process. A neural network-based ...
Added: July 30, 2023
A Pipeline for Traffic Accident Dataset Development
Stepanyants V., Mantsa Andzhusheva, Romanov A., , in: 2023 International Russian Smart Industry Conference (SmartIndustryCon), 27-31 March 2023.: Sochi: IEEE, 2023. P. 621–626.
Many traffic accidents happen on the roads every day and a lot of them are captured on traffic or dashboard cameras. This data could be used to train machine learning models to predict dangerous situations so that they can be prevented. For that, it should be organized into datasets. Nowadays a limited amount of traffic ...
Added: July 30, 2023
CrowdChecked: Detecting Previously Fact-Checked Claims in Social Media
Hardalov M., Chernyavskiy A., Koychev I. et al., , in: Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing (Volume 1: Long Papers).: Association for Computational Linguistics, 2022. P. 266–285.
While there has been substantial progress in developing systems to automate fact-checking, they still lack credibility in the eyes of the users. Thus, an interesting approach has emerged: to perform automatic fact-checking by verifying whether an input claim has been previously fact-checked by professional fact-checkers and to return back an article that explains their decision. ...
Added: May 21, 2023
Batch-Softmax Contrastive Loss for Pairwise Sentence Scoring Tasks
Chernyavskiy A., Ilvovsky D., Kalinin P. et al., , in: Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL 2022).: Association for Computational Linguistics, 2022. P. 116–126.
The use of contrastive loss for representation learning has become prominent in computer vision, and it is now getting attention in Natural Language Processing (NLP).Here, we explore the idea of using a batch-softmax contrastive loss when fine-tuning large-scale pre-trained transformer models to learn better task-specific sentence embeddings for pairwise sentence scoring tasks.We introduce and study ...
Added: October 4, 2022
Training Transformers Together
Borzunov A., Ryabinin M., Dettmers T. et al., , in: Proceedings of the NeurIPS 2021 Competitions and Demonstrations Track.: PMLR, 2022. P. 335–342.
Added: July 27, 2022
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics
Association for Computational Linguistics, 2022.
Uncertainty estimation (UE) of model predictions is a crucial step for a variety of tasks such as active learning, misclassification detection, adversarial attack detection, out-of-distribution detection, etc. Most of the works on modeling the uncertainty of deep neural networks evaluate these methods on image classification tasks. Little attention has been paid to UE in natural ...
Added: May 17, 2022
Fine-Tuning Transformers: Vocabulary Transfer
Samenko I., Tikhonov A., Kozlovskii B. et al., / Series Computer Science "arxiv.org". 2021.
Transformers are responsible for the vast majority of recent advances in natural language processing. The majority of practical natural language processing applications of these models is typically enabled through transfer learning. This paper studies if corpus-specific tokenization used for fine-tuning improves the resulting performance of the model. Through a series of experiments, we demonstrate that ...
Added: January 17, 2022
It’s All in the Heads: Using Attention Heads as a Baseline for Cross-Lingual Transfer in Commonsense Reasoning
Tikhonov A., Ryabinin M., , in: Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021.: Association for Computational Linguistics, 2021. P. 3534–3546.
Added: September 30, 2021
LIORI at SemEval-2021 Task 8: Ask Transformer for measurements
Davletov A., Gordeev D., Nikolay Arefyev et al., , in: Proceedings of the 15th International Workshop on Semantic Evaluation (SemEval-2021).: Association for Computational Linguistics, 2021. P. 1249–1254.
This work describes our approach for subtasks of SemEval-2021 Task 8: MeasEval: Counts and Measurements which took the official first place in the competition. To solve all subtasks we use multi-task learning in a question-answering-like manner. We also use learnable scalar weights to weight subtasks’ contribution to the final loss in multi-task training. We fine-tune ...
Added: September 23, 2021
Оттенки "зеленой" коммуникации в ретейле: экономический анализ учета экотренда
Lebedev A. V., Исраелян Е. А., Маркетинговые коммуникации 2021 № 02(114) С. 124–138
During the pandemic, health has become a core value. Grocery retailers realized the benefits of communicating with eco-consumers. The authors identified customer segments based on their attitude to healthy lifestyles. The article provides data on the gender and age structure of consumers with the average values of receipts, describes portraits of different types of eco-consumers ...
Added: June 10, 2021
Spatially intermixed objects of different categories are parsed automatically
Khvostov V., Lukashevich A., Utochkin I. S., Scientific Reports 2021 No. 11 P. 1–8
Added: January 26, 2021
  • About
  • About
  • Key Figures & Facts
  • Sustainability at HSE University
  • Faculties & Departments
  • International Partnerships
  • Faculty & Staff
  • HSE Buildings
  • HSE University for Persons with Disabilities
  • Public Enquiries
  • Studies
  • Admissions
  • Programme Catalogue
  • Undergraduate
  • Graduate
  • Exchange Programmes
  • Summer University
  • Summer Schools
  • Semester in Moscow
  • Business Internship
  • Research
  • International Laboratories
  • Research Centres
  • Research Projects
  • Monitoring Studies
  • Conferences & Seminars
  • Academic Jobs
  • Yasin (April) International Academic Conference on Economic and Social Development
  • Media & Resources
  • Publications by staff
  • HSE Journals
  • Publishing House
  • iq.hse.ru: commentary by HSE experts
  • Library
  • Economic & Social Data Archive
  • Video
  • HSE Repository of Socio-Economic Information
  • HSE1993–2026
  • Contacts
  • Copyright
  • Privacy Policy
  • Site Map
Edit