• A
  • A
  • A
  • АБВ
  • АБВ
  • АБВ
  • A
  • A
  • A
  • A
  • A
Обычная версия сайта
  • RU
  • EN
  • HSE University
  • Publications
  • Articles
  • Multilingual hope speech detection: A Robust framework using transfer learning of fine-tuning RoBERTa model
  • RU
  • EN
Расширенный поиск
Высшая школа экономики
Национальный исследовательский университет
Priority areas
  • business informatics
  • economics
  • engineering science
  • humanitarian
  • IT and mathematics
  • law
  • management
  • mathematics
  • sociology
  • state and public administration
by year
  • 2027
  • 2026
  • 2025
  • 2024
  • 2023
  • 2022
  • 2021
  • 2020
  • 2019
  • 2018
  • 2017
  • 2016
  • 2015
  • 2014
  • 2013
  • 2012
  • 2011
  • 2010
  • 2009
  • 2008
  • 2007
  • 2006
  • 2005
  • 2004
  • 2003
  • 2002
  • 2001
  • 2000
  • 1999
  • 1998
  • 1997
  • 1996
  • 1995
  • 1994
  • 1993
  • 1992
  • 1991
  • 1990
  • 1989
  • 1988
  • 1987
  • 1986
  • 1985
  • 1984
  • 1983
  • 1982
  • 1981
  • 1980
  • 1979
  • 1978
  • 1977
  • 1976
  • 1975
  • 1974
  • 1973
  • 1972
  • 1971
  • 1970
  • 1969
  • 1968
  • 1967
  • 1966
  • 1965
  • 1964
  • 1963
  • 1958
  • More
Subject
News
June 25, 2026
HSE Researchers Make Aldehydes Perform Dual Function
Chemists from HSE University have discovered a way to carry out a reductive addition reaction without using an external reducing agent. Instead, the required 'resource' is supplied by the aldehyde itself, one of the reaction participants. This approach helps prevent unwanted side reactions, reduces toxicity, and simplifies the production and synthesis of organic molecules, including those used in the manufacture of medicines. The study has been published in Journal of Catalysis.
June 25, 2026
HSE Scientists Explain Why Findings in Autism Research Differ
Researchers from the Cognitive Health and Intelligence Centre at HSE University conducted the first-ever systematic review of studies on the specifics of emotion-from-motion perception in autism. The review showed that differences found between autistic and non-autistic individuals are largely associated with the experimental design and the types of tasks given to study participants. The review findings have been published in Research in Autism.
June 22, 2026
‘In Science, You Are Your Own Boss
Polina Nasledskova is interested in identifying gaps in linguistics and topics that have been overlooked by other researchers. In an interview for the  Young Scientists of HSE University project, she spoke about rare ordinal numerals in Nakh-Daghestanian languages, the benefits of knitting for concentration, and the beauty of the Patriarshy Bridge.

 

Have you spotted a typo?
Highlight it, click Ctrl+Enter and send us a message. Thank you for your help!

Publications
  • Books
  • Articles
  • Chapters of books
  • Working papers
  • Report a publication
  • Research at HSE

?

Multilingual hope speech detection: A Robust framework using transfer learning of fine-tuning RoBERTa model

Journal of King Saud University - Computer and Information Sciences. 2023. Vol. 35. No. 8. Article 101736.
Malik M. S., Nazarova A., Mona M. J., Ignatov D. I.

Hope Speech Detection (HSD) from social media is a new direction for promoting and supporting positive content to encourage harmony and positivity in society. As users of social media belong to different linguistic communities, hope speech detection is rarely studied as a multilingual task considering low-resource languages. Moreover, prior studies explored only monolingual techniques, and the Russian language is not addressed. This study tackles the issue of Multi-lingual Hope Speech Detection (MHSD) in English and Russian languages using the transfer learning paradigm with fine-tuning approach. We explore joint multi-lingual and translation-based approaches to tackle the task of multilingualism, where the latter approach adopts the translation mechanism to transform all content into one language and then classify them. The joint multi-lingual method handles it by designing a universal classifier for various languages. We explore the strengths of the Robustly Optimized BERT Pre-Training Approach (RoBERTa) that showed a benchmark in capturing the semantics and contextual information within the content. The proposed framework consists of several stages: 1) data preprocessing, 2) representation of data using RoBERTa models, 3) fine-tuning phase, and 4) classification of hope speech into two labels. A new Russian corpus for hope speech detection is built, containing YouTube comments. Several experiments are conducted in English and Russian languages by using semi-supervised bilingual English and Russian datasets. The findings show that the proposed framework demonstrated benchmark performance and outperformed the baselines. Furthermore, the translation-based approach (Russian-RoBERTa) offered the best performance by achieving 94% accuracy and 80.24% f1-score.

Research target: Computer Science
Language: English
Full text
DOI
Text on another site
Keywords: transfer learningthe Russian languageXLM-RoBERTaHope speechTranslation-basedMulti-lingual
Publication based on the results of:
Models and method for analysis of unstructured data, data mining and recommender systems (2023)
Similar publications
Incorporating Scientific Knowledge into Neural Network Density Functionals
Medvedev M., Journal of Chemical Theory and Computation 2026 Vol. 22 No. 9
Density functional theory (DFT) is the workhorse of modern reactions and materials modeling. While the exact functional remains unknown, many approximations to it have been constructed either by hand-crafting functional forms to satisfy exact constraints or by machine learning. In this work, we show how both of these approaches can be fused to build both ...
Added: June 26, 2026
Моделирование полностью роботизированного склада со стеллажами глубокого хранения
Gadzhimirzaev S., Хельвас А. В., Computer Research and Modeling 2026 Vol. 18 No. 2 P. 423–438
This article presents a model of a fully automated warehouse with deep storage racks designed for boxed goods storage. The study focuses on optimizing warehouse operations through discrete multiagent simulation of shuttle movements for pallet loading and unloading tasks. The authors investigate various product placement strategies, including the Nearest Channel Positioning Algorithm (NCPA), Most Empty ChannelGroup Placement (MECGP), and ...
Added: June 24, 2026
A machine learning dataset on winter roads of Krasnoyarsk Krai, Russia for the forestry and infrastructural projects
Podolskaia E., Sinitsina A., European Journal of Forest Engineering 2026 Vol. 12 No. 1 P. 7–21
Machine learning in transport modeling has become a trend in science and industry. In this paper, we observe its main directions and focus on a dataset of seasonal road creation. Seasonality as a parameter in transport modeling has a significant impact on transport scenarios but is underestimated worldwide and in Russia, despite modern data challenges. ...
Added: June 24, 2026
The state and prospects of using virtual reality technologies in sports: a brief review
Atlasov B., Selskiy A., Russian Journal of Information Technology in Sports 2025 Vol. 2 No. 1 P. 13–21
The article examines the current state of the global virtual and augmented reality (VR/AR) technology market in sports, noting its growth, although slower than previously expected. Special attention is paid to the Russian market, where the development of VR technologies in sports lags behind world leaders such as the United States, EU countries and China, ...
Added: June 23, 2026
AI & PDE: ICLR 2026 Workshop on AI and Partial Differential Equations
[б.и.], 2026.
Added: June 23, 2026
2025 9th International Conference on Information, Control, and Communication Technologies (ICCT-2025)
IEEE, 2026.
The 9th International Scientific Conference on Information, Control, and Communication Technologies (ICCT-2025) had been held October 7-11, 2025 in Gomel, Belarus. The main technical areas and applications covered by the proceedings are optoelectronics, acousto-optic, microwave technology, antenna systems, measuring technology, metamaterials, nanostructures, nanofilms, photonic crystals, biology and medicine, biophotonics, bioengineering, neural networks in communication technologies; ...
Added: June 23, 2026
Proceedings of the 4th Workshop on NLP for Music and Audio (NLP4MusA 2026)
Buzaev F., Mullakhmetov R., Bogachev R. et al., Association for Computational Linguistics, 2026.
Playlist generation based on textual queries using large language models (LLMs) is becoming an important interaction paradigm for music streaming platforms. User queries span a wide spectrum from highly personalized intent to essentially catalog-style requests. Existing systems typically rely on non-personalized retrieval/ranking or apply a fixed level of preference conditioning to every query, which can ...
Added: June 22, 2026
Zα and Zβ Localize ADAR1 to Flipons That Modulate Innate Immunity, Alternative Splicing, and Nonsynonymous RNA Editing
Herbert A., Cherednichenko O., Lybrand T. et al., International Journal of Molecular Sciences 2025 Vol. 26 No. 6 Article 2422
The double-stranded RNA editing enzyme ADAR1 connects two forms of genetic programming, one based on codons and the other on flipons. ADAR1 recodes codons in pre-mRNA by deaminating adenosine to form inosine, which is translated as guanosine. ADAR1 also plays essential roles in the immune defense against viruses and cancers by recognizing left-handed Z-DNA and ...
Added: June 22, 2026
Международная конференция «Математические идеи академика П.Л. Чебышёва, их приложения в естественных науках и технологи- ях искусственного интеллекта», приуроченная к 205-й годовщине со дня его рождения» : Материалы конференции. / (Обнинск, 14–16 мая 2026 г.): Материалы конференции. Под ред. акад. В.Б. Бетелина. — Калуга: Калужский печатный двор, 2026. — 232 с.
Калужский печатный двор, 2026.
Conference Proceedings INTERNATIONAL CONFERENCE “Mathematical Ideas of Academician P.L. Chebyshev, Their Applications in Natural Sciences and Artificial Intelligence Technologies” dedicated to the 205th anniversary of his birth ...
Added: June 20, 2026
ИНТЕГРАЦИЯ ТЕХНОЛОГИИ ГЕНЕРАТИВНОГО ИСКУССТВЕННОГО ИНТЕЛЛЕКТА В ОБРАЗОВАТЕЛЬНЫЙ ВИДЕОКОНТЕНТ
Stognieva O., Чеснокова Н. Е., Отечественная и зарубежная педагогика 2026 Т. 1 № 3 (115) С. 123–131
Integration of generative artificial intelligence tools into educational practice highlights the need for pedagogically grounded approaches to their use in the creation of educational video content, which is increasingly applied in language and professionally oriented instruction. The purpose of this article is to conduct a comparative analysis of educational video content created using generative AI tools ...
Added: June 20, 2026
Benchmarking DNA large language models on quadruplexes
Cherednichenko O., Herbert A., Poptsova M., Computational and Structural Biotechnology Journal 2025 Vol. 27 P. 992–1000
Large language models (LLMs) in genomics have successfully predicted various functional genomic elements. While their performance is typically evaluated using genomic benchmark datasets, it remains unclear which LLM is best suited for specific downstream tasks, particularly for generating whole-genome annotations. Current LLMs in genomics fall into three main categories: transformer-based models, long convolution-based models, and state-space models ...
Added: June 19, 2026
Kolmogorov–Arnold networks for genomic tasks
Poptsova M., Briefings in Bioinformatics 2025 Vol. 26 No. 2 P. 1–11
Kolmogorov–Arnold networks (KANs) emerged as a promising alternative for multilayer perceptrons (MLPs) in dense fully connected networks. Multiple attempts have been made to integrate KANs into various deep learning architectures in the domains of computer vision and natural language processing. Integrating KANs into deep learning models for genomic tasks has not been explored. Here, we ...
Added: June 19, 2026
Графовые паттерны в несогласованных декларативных моделях процессов
Анненков А. Н., Nesterov R., Моделирование и анализ информационных систем 2026 Т. 33 № 2 С. 176–205
Declarative process models are widely used in process mining to describe flexible process behavior through sets of constraints. However, models discovered automatically from event logs may contain inconsistent constraints, which can make them difficult to interpret and unusable for execution, conformance checking, or further analysis. Existing methods for consistency analysis either rely on automata-based constructions ...
Added: June 18, 2026
Advances in Information Retrieval: 48th European Conference on Information Retrieval, ECIR 2026, Delft, The Netherlands, March 29 – April 2, 2026, Proceedings, Part II. (LNCS, volume 16484)
Cham: Springer Publishing Company, 2026.
The four-volume set LNCS 16483-16486 constitutes the refereed conference proceedings of the 48th European Conference on Information Retrieval, ECIR 2026, held in Delft, The Netherlands, during March 29–April 2, 2026. The 46 full papers and 37 short papers presented together with 10 findings papers, 9 reproducibility papers, 17 resource papers, 11 workshop papers, 7 tutorial papers, ...
Added: June 18, 2026
Искусственный интеллект как роза научной деятельности: исследование Тимоти Гауэрса
Poddiakov A., Троицкий вариант. Наука 2026 № 12 С. 24–25
В научно-популярной заметке представлен обзор содержания поста филдсовского медалиста Тимоти Гауэрса о возможностях ИИ в математике и содержания комментариев под постом. Обзор сделан в основном чат-ботом DeepSeek. В заключение обсуждается возможность не только решения задач искусственным интеллектом, но и их постановки. ...
Added: June 18, 2026
Exploring New Frontiers in Vertical Federated Learning: the Role of Saddle Point Reformulation
Beznosikov A., Kormakov G., Grigorievskiy A. et al., Journal of Optimization Theory and Applications 2026 Vol. 209 Article 18
The objective of Vertical Federated Learning (VFL) is to collectively train a model using features available on different devices while sharing the same users. This paper focuses on the saddle point reformulation of the VFL problem via the classical Lagrangian function. We first demonstrate how this formulation can be solved using deterministic methods.More importantly, we explore various stochastic modifications to ...
Added: June 17, 2026
Supervised Learning in Critical Phenomena—Statistical and Systematic Accuracy
Chertenkov V. I., Shchur L., Lobachevskii Journal of Mathematics 2026 Vol. 47 No. 2 P. 720–727
Supervised machine learning is successfully applied to the study of critical phenomena and allows us to obtain a numerical estimate of the phase transition temperature and the correlation length exponent. We discuss the influence of possible systematic errors, as well as statistical errors, on the accuracy of such numerical estimates. Errors in the training and ...
Added: June 16, 2026
Enhancing Emotion Recognition in Speech Based on Self-Supervised Learning: Cross-Attention Fusion of Acoustic and Semantic Features
Deeb B., Andrey V. Savchenko, Makarov I., IEEE Access 2026 Vol. 13 P. 56283–56295
Speech Emotion Recognition has gained considerable attention in speech processing and machine learning due to its potential applications in human-computer interaction, mental health monitoring, and customer service. However, state-of-the-art models for speech emotion recognition use many parameters, which leads to computational complexity. In this paper, we introduce a novel deep-learning model to enhance the accuracy ...
Added: June 16, 2026
Automated detection of wolf howls using audio spectrogram transformers
Makarov N., Savchenko A., Zemtsova I. et al., Scientific Reports 2025 Vol. 15 Article 26641
The grey wolf (Canis lupus) is a pivotal species for ecological studies. As a key participant in ecosystem processes, it also serves as a model for investigating social structure formation and ecological adaptation. However, the species’ complex social behavior, spatial dynamics, and expansive habitats make monitoring and population assessments across large areas particularly challenging. In recent years, audio traps ...
Added: June 16, 2026
Artificial intelligence framework for multi-pathology risk assessment from retinal fundus images: deep learning approach to 15-disease screening
Vasilev R., Savchenko A., Blinov P. et al., Frontiers in Medicine 2026 Vol. 13
Automated disease screening systems face challenges when applied to multi-class medical image analysis, particularly under severe class imbalance inherent in clinical datasets. Retinal fundus imaging enables non-invasive screening for multiple ocular and systemic diseases simultaneously, yet current automated systems typically assess risk for only a single pathology or a limited disease range. We developed a ...
Added: June 16, 2026
From Data to Signs: A Foundation Model for Multilingual Sign Language Recognition
Novopoltsev M., Tulenkov A., Murtazin R. et al., IEEE Access 2025 Vol. 13 P. 188170–188181
Video-based Isolated Sign Language Recognition (ISLR) problem presents significant challenges in scaling across diverse languages due to data scarcity and the computational costs associated with training of language-specific models. In this paper, we introduce a novel training pipeline that leverages self-supervised learning on a large-scale sign language dataset. To obtain the foundation model, we utilize ...
Added: June 16, 2026
Extraction of properties of anisotropic spin model by deep transfer learning methods
D.D. Sukhoverkhova, L.N. Shchur, , in: Параллельные вычислительные технологии – XIX всероссийская конференция с международным участием, ПаВТ'2025. Короткие статьи и описания плакатов.: Издательский центр ЮУрГУ, 2025. P. 82–89.
We apply supervised deep machine learning techniques to extract properties of the anisotropic Ising model. We consider two cases of anisotropy: orthogonal and diagonal. From the predictions of the neural network, we obtained phase probability functions, from which we measured two quantities: the critical temperature and the critical exponent of the correlation length. We estimated ...
Added: December 4, 2025
Machine Learning Domain Adaptation in Spin Models with Continuous Phase Transitions
Chertenkov V., Shchur L., Physical Review E - Statistical, Nonlinear, and Soft Matter Physics 2025 Vol. 112 No. 3 Article 034104
The main question raised in the  article  is whether a neural network trained on a spin lattice model in one universality class   can be used to test a model in another universality class. The quantities of interest are the critical phase transition temperature and the correlation length exponent. In other words, the question of ...
Added: August 12, 2025
Сборник статей по результатам IX научной межвузовской конференции молодых ученых «Пространство научных интересов: иностранные языки и межкультурная коммуникация — современные векторы развития и перспективы»
Гаевская М. А., М.: НИУ ВШЭ, 2025.
The article focuses on the 2000s from the axiological linguistics perspective. The author explores how the decade functions as an axiogenic (creating or forming values) context. Using such methods as literature review, corpus analysis method, and the method of axiological analysis the author has found and described the axiological field formed within the first 10 ...
Added: May 12, 2025
  • About
  • About
  • Key Figures & Facts
  • Sustainability at HSE University
  • Faculties & Departments
  • International Partnerships
  • Faculty & Staff
  • HSE Buildings
  • HSE University for Persons with Disabilities
  • Public Enquiries
  • Studies
  • Admissions
  • Programme Catalogue
  • Undergraduate
  • Graduate
  • Exchange Programmes
  • Summer University
  • Summer Schools
  • Semester in Moscow
  • Business Internship
  • Research
  • International Laboratories
  • Research Centres
  • Research Projects
  • Monitoring Studies
  • Conferences & Seminars
  • Academic Jobs
  • Yasin (April) International Academic Conference on Economic and Social Development
  • Media & Resources
  • Publications by staff
  • HSE Journals
  • Publishing House
  • iq.hse.ru: commentary by HSE experts
  • Library
  • Economic & Social Data Archive
  • Video
  • HSE Repository of Socio-Economic Information
  • HSE1993–2026
  • Contacts
  • Copyright
  • Privacy Policy
  • Site Map
Edit