• A
  • A
  • A
  • АБВ
  • АБВ
  • АБВ
  • A
  • A
  • A
  • A
  • A
Обычная версия сайта
  • RU
  • EN
  • HSE University
  • Publications
  • Articles
  • How to detect propaganda from social media? Exploitation of semantic and fine-tuned language models
  • RU
  • EN
Расширенный поиск
Высшая школа экономики
Национальный исследовательский университет
Priority areas
  • business informatics
  • economics
  • engineering science
  • humanitarian
  • IT and mathematics
  • law
  • management
  • mathematics
  • sociology
  • state and public administration
by year
  • 2027
  • 2026
  • 2025
  • 2024
  • 2023
  • 2022
  • 2021
  • 2020
  • 2019
  • 2018
  • 2017
  • 2016
  • 2015
  • 2014
  • 2013
  • 2012
  • 2011
  • 2010
  • 2009
  • 2008
  • 2007
  • 2006
  • 2005
  • 2004
  • 2003
  • 2002
  • 2001
  • 2000
  • 1999
  • 1998
  • 1997
  • 1996
  • 1995
  • 1994
  • 1993
  • 1992
  • 1991
  • 1990
  • 1989
  • 1988
  • 1987
  • 1986
  • 1985
  • 1984
  • 1983
  • 1982
  • 1981
  • 1980
  • 1979
  • 1978
  • 1977
  • 1976
  • 1975
  • 1974
  • 1973
  • 1972
  • 1971
  • 1970
  • 1969
  • 1968
  • 1967
  • 1966
  • 1965
  • 1964
  • 1963
  • 1958
  • More
Subject
News
April 28, 2026
Scientists Develop Algorithm for Accurate Financial Time Series Forecasting
Researchers at the HSE Faculty of Computer Science benchmarked more than 200,000 model configurations for predicting financial asset prices and realised volatility, showing that performance can be improved by filtering out noise at specific frequencies in advance. This technique increased accuracy in 65% of cases. The authors also developed their own algorithm, which achieves accuracy comparable to that of the best models while requiring less computational power. The study has been published in Applied Soft Computing.
April 27, 2026
Fair Division: How Mathematics Helps to Divide the Indivisible
How can items be allocated among participants so that no one feels short-changed? Alexander Karpov, Assistant Professor at the Faculty of Economic Sciences, and his Singaporean colleague, Prof. Warut Suksompong, set out to find a mathematical answer to this question. In this interview, they discuss how a model of rational preferences is constructed, why one cannot rely on a simple sum of values, and where an algorithm that asks a minimal number of questions can be useful.
April 24, 2026
Electronics of the Future: Why Superconductors and Spintronics Work Together
It was once believed that superconductivity and magnetism avoided each other like the devil avoids holy water. However, modern nanostructures prove the opposite. A Russian theoretical physicist and Indian experimentalists have joined forces to create the electronics of the future—free from energy losses. Nataliya Pugach, Professor at the School of Electronic Engineering at HSE MIEM and Leading Research Fellow at the Quantum Nanoelectronics Laboratory, explains how a long-standing acquaintance in Cambridge grew into a mirror laboratory project with the Indian Institute of Technology Bombay (IIT Bombay), how superconducting spintronics works, and what surprises a researcher in India beyond the university campus.

 

Have you spotted a typo?
Highlight it, click Ctrl+Enter and send us a message. Thank you for your help!

Publications
  • Books
  • Articles
  • Chapters of books
  • Working papers
  • Report a publication
  • Research at HSE

?

How to detect propaganda from social media? Exploitation of semantic and fine-tuned language models

PeerJ Computer Science. 2023. Vol. 9. Article e1248.
Malik M. S., Imran T., Mona Mamdouh J.

 

Online propaganda is a mechanism to influence the opinions of social media users. It is a growing menace to public health, democratic institutions, and public society. The present study proposes a propaganda detection framework as a binary classification model based on a news repository. Several feature models are explored to develop a robust model such as part-of-speech, LIWC, word uni-gram, Embeddings from Language Models (ELMo), FastText, word2vec, latent semantic analysis (LSA), and char tri-gram feature models. Moreover, fine-tuning of the BERT is also performed. Three oversampling methods are investigated to handle the imbalance status of the Qprop dataset. SMOTE Edited Nearest Neighbors (ENN) presented the best results. The fine-tuning of BERT revealed that the BERT-320 sequence length is the best model. As a standalone model, the char tri-gram presented superior performance as compared to other features. The robust performance is observed against the combination of char tri-gram + BERT and char tri-gram + word2vec and they outperformed the two state-of-the-art baselines. In contrast to prior approaches, the addition of feature selection further improves the performance and achieved more than 97.60% recall, f1-score, and AUC on the dev and test part of the dataset. The findings of the present study can be used to organize news articles for various public news websites.

Research target: Computer Science
Language: English
Full text
DOI
Text on another site
Keywords: propagandasemantic analisysbinary modellinguisticword2vecnews mediaBERT
Publication based on the results of:
Models and method for analysis of unstructured data, data mining and recommender systems (2023)
Similar publications
Bioinspired Method of Agent Redistribution between Groups
Karpova Irina Petrovna, Pattern Recognition and Image Analysis 2025 Vol. 35 No. 4 P. 1138–1144
A solution to the problem of redistributing agents between groups based on simulating a form of social parasitism in ants known as slave-making is considered. To provide a comprehensive solution, the problem is integrated with a method of orientation based on visual landmarks and a compass, including route memorization and return. The models and mechanisms ...
Added: April 29, 2026
Natural hazard database from Internet publications: text mining with a large language model
Derkacheva A., Sakirkina M., Kraev G. et al., /. 2026.
Comprehensive data on natural hazards and their consequences are crucial for effective for risk assessment, adaptation planning, and emergency response. However, many countries face challenges with fragmented, inconsistent, and inaccessible data, particularly regarding local-scale events. To address this data gap in Russia, we developed an end-to-end processing pipeline that scrapes news from various online sources, ...
Added: April 28, 2026
Influence of the Normal Magnetic Component to Magnetotail Current Sheet Forma
Domrin V. I., Malova H. V., V. Yu. Popov et al., Cosmic Research 2026 Vol. 64 No. 2 P. 238–252
During magnetospheric perturbations a relatively thin current sheet with thickness about several proton gyroradii forms in the Earth’s magnetotail. In a framework of the kinetic model describing current sheet thinning in the magnetotail, the processes of its formation are investigated depending on the normal magnetic field magnitude which affects both the current sheet structure and particle dynamics within ...
Added: April 27, 2026
Asymmetric Equilibrium Structures of Superthin Current Sheets: The Asymmetry of Plasma Sources
Tsareva O. O., Malova H. V., V. Yu. Popov et al., Plasma Physics Reports 2026 Vol. 52 No. 2 P. 179–185
The influence of asymmetry of plasma sources on the structure and spatial localization of a superthin current sheet (STCS) supported by demagnetized electrons is studied using a self-consistent model. The simulation takes into account the presence of a single plasma source in the northern hemisphere, which makes the plasma flow asymmetric. It is demonstrated that the asymmetry of ...
Added: April 27, 2026
WWW '26: The ACM Web Conference 2026
NY: Association for Computing Machinery (ACM), 2026.
It is our great pleasure to welcome you to the 35th edition of the Web Conference to be held on June 29 – July 3, 2026, in Dubai, United Arab Emirates. Following discussions with our partners and key stakeholders, we have taken the decision to postpone the ACM Web Conference 2026, initially planned for April 2026. ...
Added: April 23, 2026
Разработка микросервиса ADP для идентификации источников выбросов на основе машинного обучения с подкреплением
Kychkin A., Chernitsin I., Прикладная информатика 2026 Т. 21 № 1 С. 40–58
The results of the development of a software microservice embedded in atmospheric air quality monitoring systems to support the identification of industrial pollution sources are presented. The emission and subsequent spread of harmful substances in the lower layers of the atmosphere is dynamic and characterized by high uncertainty due to the specific features of technological ...
Added: April 23, 2026
2026 International Conference on Artificial Intelligence, Computer, Data Sciences and Applications (ACDSA)
IEEE, 2026.
Added: April 21, 2026
What Drives Multi-Chain Crypto Forecasting: Model Choice, Feature Selection, and Transferability
Wang M., Xiao Y., Braslavski P. et al., Mathematics 2026 Vol. 14 No. 8 Article 1286
Increasingly shaped by heterogeneous on-chain activity rather than a single shared market process, this study investigates 7-day-ahead forecasting using 147 market and on-chain indicators across eight major blockchain ecosystems from October 2023 to April 2025. We benchmark statistical, deep-learning, and foundation-model baselines under multiple feature-selection pipelines using both error metrics and Diebold–Mariano tests. TiRex achieves ...
Added: April 20, 2026
Cross-influence of two societies in deterministic evolutionary game
Shchur L., Antonov D., Burovski E., International Journal of Bifurcation and Chaos in Applied Sciences and Engineering 2026 P. 1–9
We present a simple model that simulates the possible influence of one society on another. Specifically, two societies evolve deterministically according to the well-known Nowak-May spatial game with the addition of mutual influence through connections that reflect the current states of the societies. This may be related to the influence of a global information resource ...
Added: April 20, 2026
Проектирование сети Интернета вещей на основе многокритериальной оптимизации и информационного моделирования здания
Ebraheem A., Информационные процессы 2025 Т. 25 № 4 С. 787–798
The article proposes a method for planning the placement of access points and gateways inside buildings for constructing Internet of Things networks. The basis of the method is the use of information from a building information model, which makes it possible to easily take into account both the geometry and the physical and technical characteristics ...
Added: April 19, 2026
Modeling cosolvent effects on solubility in supercritical CO2 using data-driven approaches
Makarov D. M., Kalikin N., Gurikov P. et al., Journal of Supercritical Fluids 2026 Vol. 235 Article 106979
Supercritical CO2 (scCO2 ) is an environmentally friendly solvent, but its low polarity limits the solubility of polar compounds. Cosolvents are commonly used to enhance solvation capability, yet comprehensive datadriven studies are scarce. We compiled the largest dataset to date — 4401 experimental solubility records with 22 cosolvents for 93 nonionic solutes, plus 4855 records ...
Added: April 19, 2026
2026 28th International Conference on Digital Signal Processing and its Applications (DSPA)
IEEE, 2026.
A.S. Popov Russian Science and Technical Society with support from V. A. Trapeznikov Institute of Control Sciences, V.A. Kotelnikov Institute of Radio Engineering and Electronics, Autex Ltd. is leading the ХХVIII International Conference «Digital Signal Processing and its Applications — DSPA-2026» ...
Added: April 18, 2026
WWW '26: Proceedings of the ACM Web Conference 2026
NY: Association for Computing Machinery (ACM), 2026.
It is our great pleasure to welcome you to the 35th edition of the Web Conference to be held on June 29 – July 3, 2026, in Dubai, United Arab Emirates. Following discussions with our partners and key stakeholders, we have taken the decision to postpone the ACM Web Conference 2026, initially planned for April 2026. ...
Added: April 17, 2026
Сопоставление номенклатур товаров ресторанов и поставщиков с помощью LLM — Case Study для ресторанного холдинга
Jin S., Panfilov P., Сулейкин А. С., Труды Института системного программирования РАН 2025 Т. 37 № 6 С. 163–176
In the modern restaurant business, accurate mapping of product nomenclatures between restaurants and suppliers is a critical task. Effective inventory management and procurement optimization directly impact business profitability. With the increase in suppliers and product variety, traditional mapping methods become less efficient. This study proposes using large language models (LLM) to automate and improve the ...
Added: April 17, 2026
Немцы и османы в Афганистане в годы Первой мировой войны: османский джихад, пропаганда и слухи
Шерстюков С. А., Новая и новейшая история 2026 Т. 70 № 1 С. 79–92
Following the outbreak of the First World War, Germany and the Ottoman Empire sought to draw Afghanistan into the conflict on the side of the Central Powers by dispatching diplomatic and military missions to Kabul. Although these efforts failed, the Ottoman proclamation of jihad and the activities of German and Ottoman agents in Afghanistan produced ...
Added: March 6, 2026
Development of a Language Model for Automated Classification of English-Language Scientific Articles by SRSTI Codes
V. V. Zunin, A. I. Afonin, V. I. Anoshin et al., Automatic Documentation and Mathematical Linguistics 2025 Vol. 59 No. 5 P. 287–293
The development of an artificial intelligence-based language model for classifying English-language scientific articles by SRSTI codes is described. This improves the processes of reviewing and indexing scientific publications. A pre-processed dataset of scientific articles was used for training and testing the models. An architecture for cascade classification was developed, and the performance of models with ...
Added: February 11, 2026
Женщины нацистской Германии в зеркале советской карикатуры (на материале журнала «Крокодил»)
Рябов О. В., Уральский исторический вестник 2019 № 3 С. 84–92
The article deals with analysis of the images of the Nazi Germany women created by the Soviet propaganda during the Great Patriotic War by means of satirical graphics. The base of the research is the caricatures published in the “Crocodile” magazine in June 1941 - May 1945. The author demonstrates that the comic images of ...
Added: January 28, 2026
Уголовная ответственность за неоднократные пропаганду и демонстрирование запрещенной федеральным законодательством атрибутики и символики
Agapov P., Шевелева К. В., Вестник Владимирского юридического института 2025 № 2(75) С. 133–140
Abstract. The article is devoted to the study of criminal liability for the crime pro vided for in Article 282*4 of the Criminal Code of the Russian Federation. Based on the analysis of judicial and investigative practice, the authors identify important features of the crime in question from the point of view of criminal law, ...
Added: January 28, 2026
Internarrativity of Authoritarian Rule: A Discourse Analysis of Vladimir Putin’s Speeches between 2012 and 2019
Vershinin I., Canadian Journal of European and Russian Studies 2025 Vol. 18 No. 3 P. 28–64
This research examines how Russia, with a focus on Putin, shapes its messages by analyzing his interactions with foreign journalists between 2012 and 2019. To understand how these messages work together, the study employs the concept of ‘internarrativity,’ which explores how narratives are interlinked and reinforce one another. The author argues that it is essential ...
Added: December 13, 2025
Печатные агитационные материалы как фактор советской миграции в конце 1940-х — 1950-е гг. (на примере Молотовской области)
Glushkov A., Козлова К. О., Новые исследования Тувы 2025 № 4 С. 107–121
The analysis conducted in the study allowed the authors to trace the evolution of Soviet agitation in the first post-war decade, identify the main campaigning techniques that were notable for their diversity, and conclude that at the turn of the 1940s and 1950s, in the context of voluntary resettlement campaigns, the state was forced to ...
Added: December 1, 2025
Trans-paradigmatic syncretism in case form processing in Russian
Chernova D., Slioussar N., Alexeeva S. V. et al., Journal of Slavic Linguistics 2022 Vol. 30 No. 3 P. 1–16
Added: November 29, 2025
Shrink the Longest: Improving Latent Space Isotropy with Simplicial Geometry
Kudrjashov S., Karpik O., Klyshinskiy E., , in: Analysis of Images, Social Networks and Texts, 12th International Conference, AIST 2024, Bishkek, Kyrgyzstan, October 17–19, 2024, Revised Selected PapersVol. 15419.: Springer, 2024. P. 120–130.
Added: May 29, 2025
Высокоуровневая семантическая интерпретация структуры статических моделей для русского языка
Serikov O., Ganeeva V., Аксенова А. А. et al., Вестник Новосибирского государственного университета. Серия: Лингвистика и межкультурная коммуникация 2023 Т. 21 № 1 С. 67–82
Since its inception, the Word2vec vector space has become a universal tool both for scientific and practical activities. Over time, it became clear that there is a lack of new methods for interpreting the location of words in vector spaces. The existing methods included consideration of analogies or clustering of a vector space. In recent ...
Added: April 28, 2025
War of Patriotisms: Propaganda and Mass Sentiments in Russia during the Period of the Empire's Collapse. Moscow: New Literary Review
Nedopekina A., Laboratorium: Russian Review of Social Research 2024 Vol. 16 No. 1 P. 130–134
Vladislav Aksenov’s book whose title translates as The War of Patriotisms: Propaganda and Mass Moods in Russia during the Collapse of the Empire was published in 2023. This book was released in the popular science series What Is Russia? (Chto takoe Rossiia?) by Novoe literaturnoe obozrenie publishing house. Although this work is partly an adaptation ...
Added: April 10, 2025
  • About
  • About
  • Key Figures & Facts
  • Sustainability at HSE University
  • Faculties & Departments
  • International Partnerships
  • Faculty & Staff
  • HSE Buildings
  • HSE University for Persons with Disabilities
  • Public Enquiries
  • Studies
  • Admissions
  • Programme Catalogue
  • Undergraduate
  • Graduate
  • Exchange Programmes
  • Summer University
  • Summer Schools
  • Semester in Moscow
  • Business Internship
  • Research
  • International Laboratories
  • Research Centres
  • Research Projects
  • Monitoring Studies
  • Conferences & Seminars
  • Academic Jobs
  • Yasin (April) International Academic Conference on Economic and Social Development
  • Media & Resources
  • Publications by staff
  • HSE Journals
  • Publishing House
  • iq.hse.ru: commentary by HSE experts
  • Library
  • Economic & Social Data Archive
  • Video
  • HSE Repository of Socio-Economic Information
  • HSE1993–2026
  • Contacts
  • Copyright
  • Privacy Policy
  • Site Map
Edit