Detecting Ethnic Conflict in Social Media with Transformers and Augmented Data

?

Detecting Ethnic Conflict in Social Media with Transformers and Augmented Data

Procedia Computer Science. 2025. Vol. 258. P. 2382–2390.

Chest X-ray pathology prediction play a very important role in early disease detection, enabling timely intervention and improving patient outcomes. Detection of ethnic conflict mentioning, discussion, or verbal participation therein in user-generated content is a socially important task, as such content has been proven related to ethnic clashes on the ground. Yet this task has not been studied. One of the reasons is the lack of relevant datasets which calls for the usage of data augmentation techniques, still uncommon for NLP. We propose a solution for Russian language by fine-tuning a pretrained transformer encoder enhanced with several standard and novel data augmentation approaches. The highest quality of F1-macro = 0.8 is obtained with fine-tuned ROBERTA model combined with our novel augmentation technique which generates new training data by randomly swapping ethnonyms. This eliminates classification algorithms’ over-reliance on rare ethnonyms and prevents overfitting. Although the contribution of augmentation is modest, when exposed to a relevant adversarial attack, our model turns out to be the most sustainable with its quality advantage over the baseline reaching 0.05 on the target class. This advantage is achieved by training the model on the texts with randomly replaced ethnonyms which eliminates the model’s over-reliance on ethnonyms occurring exclusively or mostly in a single class in the training set. Thus our approach is expected to be useful for elimination of similar effects in the tasks such as aspect-based sentiment analysis with large numbers of aspects. We also conduct error analysis and conclude which categories of texts usually cause inaccurate prediction

Research target: Social Sciences Computer Science

Language: English

DOI

Publication based on the results of:

Analysis of Human Interaction with Information and Improvement of Algorithms of Information Processing (2025)

Local Fault-Tolerant Routing in 3D Mesh NoCs using Single-Hop Rollback

Edward R. Rzaev, Aleksandr Y. Romanov, Andrey M. Sukhov, IEEE Access 2026

This work presents a hierarchy of strictly local fault-tolerant routing algorithms for 3D mesh networks-on-chip, culminating in an algorithm that combines a live-neighbor selection rule with a bounded single-hop rollback mechanism. The proposed algorithms operate exclusively on immediate neighbor information, maintain O(1) per hop complexity, and require no global topology knowledge, additional virtual channels, or ...

Added: July 23, 2026

Long-range machine-learning potentials with environment-dependent charges enable predicting LO-TO splitting and dielectric constants

Korogod D., Shapeev A., Novikov I., Physical Review B: Condensed Matter and Materials Physics 2026 Vol. 114 No. 2 Article 024104

We present two models with explicit long-range electrostatics in the form of Coulomb interactions. Both models include point charges depending on their local atomic environments, and the second model also conserves a total charge of an atomic system. We combine the proposed long-range models with the local moment tensor potential (MTP) and demonstrate that they ...

Added: July 22, 2026

Global optimization of atomic clusters via physically constrained tensor train decomposition

Sozykin K., Rybin N., Chertkov A. et al., Physical Review B: Condensed Matter and Materials Physics 2026 Vol. 113 No. 22 Article 224111

The global optimization of atomic clusters represents a fundamental challenge in computational chemistry and materials science due to the exponential growth of local minima with system size (i.e., the curse of dimensionality). We introduce a framework that overcomes this limitation by exploiting the low-rank structure of potential energy surfaces through tensor train (TT) decomposition. Our ...

Added: July 22, 2026

Местоимения с фокусным антецедентом в русском языке: кореферентные и связанные употребления в корпусах

Tiskin D., Компьютерная лингвистика и интеллектуальные технологии 2026 No. 24 P. 656–665

Despite a lot of interest for the factors influencing the choice of pronoun (reflexive or personal) with an antecedent in Russian, the role of the anaphotic relation—coreference or semantic binding—has been understudied, including disagreements as to the acceptability of particular data points. To clarify things, I employ large corpora (Araneum and GICR) to study the ...

Added: July 19, 2026

WSI-GT: Pseudo-Label Guided Graph Transformer for Whole-Slide Histology

Михайлов И. А., Machine Learning and Knowledge Extraction 2026 Vol. 8 No. 1 Article 8

Whole-slide histology images (WSIs) can exceed 100 k × 100 k pixels, making direct pixel-level segmentation infeasible and requiring patch-level classification as a practical alternative for downstream WSI segmentation. However, most approaches either treat patches independently, ignoring spatial and biological context, or rely on deep graph models prone to oversmoothing and loss of local tissue ...

Added: July 16, 2026

On the construction of Barnes–Wall lattices and their application in cryptography

Kuninets A., Malygina E., Leevik A. G. et al., Journal of Computer Virology and Hacking Techniques 2026 No. 22 Article 62

In this work, we investigate the application of Barnes–Wall lattices in post-quantum cryptographic schemes. We survey and analyze several constructions of Barnes–Wall lattices, including subgroup chains, the generalized k-ing construction, and connections with Reed-Muller codes, highlighting their equivalence over both Z[i] and Z. Building on these structural insights, we introduce a new algorithm for efficient ...

Added: July 16, 2026

Tencent и Open Source. Как относится к открытому ПО самый дорогой бренд Китая?

Silakov D., Системный администратор 2026 № 5 С. 46–51

В предыдущей статье про Open Source в КНР [1] мы рассказали про Alibaba – крупную корпорацию, занимающую тридцатое место в рейтинге самых значимых мировых брэндов за 2025 год [2]. Место почетное, но не первое среди китайских компаний – на тринадцатом месте расположилась Tencent, разработчик WeChat и ряда других продуктов, широко используемых нашими восточными соседями. Tencent ...

Added: July 14, 2026

2026 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

IEEE, 2026.

Added: July 13, 2026

Neural-molecular signatures of insomnia: Insights from signed differential mapping and gene expression analysis

Wang R., He Y., Myachykov A. et al., Neuroimage 2026 Vol. 332 Article 121926

Insomnia disorder (ID) exhibits considerable heterogeneity in neuroimaging findings across studies, and whether functional brain alterations are consistent across resting and task states remains unclear. This study aimed to identify neural dysfunction across states in ID and explore its transcriptomic correlates. ...

Added: July 13, 2026

Integrative profiling of glymphatic dysfunction in adolescent subthreshold depression

Myachykov A., Qiwei G., Ruisi W. et al., Journal of Affective Disorders 2026 Vol. 412 Article 122110

Subthreshold depression (StD) in adolescence is clinically important, but its neurobiological substrates remain unclear. We examined whether adolescents with StD show multimodal MRI alterations related to glymphatic function. ...

Added: July 13, 2026

Mathematical Optimization Theory and Operations Research, 25th International Conference, MOTOR 2026 Irkutsk, Russia, July 6–11, 2026 Proceedings

Switzerland: Springer, 2026.

This volume contains the refereed proceedings of the 25th International Conference on Mathematical Optimization Theory and Operations Research (MOTOR 2026) 1 held during July 6–11 in a picturesque place near Lake Baikal, Irkutsk, Russia. The MOTOR conference is a direct successor and scientific inheritor of several prominent events on mathematical programming, combinatorial and stochastic optimization, ...

Added: July 12, 2026

Proceedings of the International Science Conference “APPLIED RESEARCH. GLOBAL SOLUTIONS” (May 6, 2026). Istanbul. Turkey. Part 2.

Scientific publishing house Infinity, 2026.

Science Conference Proceedings combine materials of the conference – research papers and thesis reports of scientifi c workers. They examine technical, juridical and sociological aspects of research issues. Some articles deal with theoretical and methodological approaches and principles of research questions of personality professionalization. ...

Added: July 12, 2026

Задачи бесконечной регулярной реализуемости

Шиманогов И. Н., Vyalyi M., Дискретный анализ и исследование операций 2025 Т. 32 № 4(166) С. 213–230

A well-studied class of algorithmic problems is that of regular realizability: checking the non-emptiness of the intersection of a regular language with a given language. This problem has a natural algebraic interpretation: verifying whether an element of a Boolean algebra belongs to the kernel of a certain homomorphism. This motivates the consideration of an analogous ...

Added: July 12, 2026

International Academic Conference. Proceedings of the Scientific Forum “Modern Science: Theory and Practice” (April 22, 2026). Belgrade, Serbia. Part 3.

Scientific publishing house Infinity, 2026.

Scientific Forum Proceedings combine materials of the conference – research papers and thesis reports of scientific workers. They examine technical, juridical and sociological aspects of research issues. Some articles deal with theoretical and methodological approaches and principles of research questions of personality professionalization. ...

Added: July 10, 2026

Improving Differential Equation Solving in Compact Language Models via Activation Steering and Reinforcement Learning

Surkov A., Ignatenko V., Koltcov Sergei, Computers, Materials and Continua 2026

Large language models have recently demonstrated promising capabilities in mathematical reasoning; however, their performance on tasks requiring strict symbolic manipulation, such as solving differential equations, remains limited, especially for compact models. In this work, we investigate whether activation steering combined with reinforcement learning can improve the quality of solutions generated by pretrained language models without ...

Added: July 8, 2026

Computational Science and Its Applications – ICCSA 2026 Workshops

Springer, 2027.

The series Lecture Notes in Computer Science (LNCS), including its subseries Lecture Notes in Artificial Intelligence (LNAI) and Lecture Notes in Bioinformatics (LNBI), has established itself as a medium for the publication of new developments in computer science and information technology research, teaching, and education. LNCS enjoys close cooperation with the computer science R & ...

Added: July 8, 2026

Conference Proceedings: 2026 IEEE Ural-Siberian Conference on Biomedical Engineering, Radioelectronics and Information Technology (USBEREIT), 14-15 May 2026

IEEE, 2026.

The purpose of the 2026 IEEE Ural-Siberian Conference on Biomedical Engineering, Radioelectronics and Information Technology (USBEREIT) is to bring together researchers and practitioners from multiple areas of radio science, including biomedical engineering, radioelectronics, microelectronics, information technology, smart energy, information security and others. ...

Added: July 8, 2026

Моделирование специализированных алгоритмов маршрутизации в сетях на кристалле, представленных сериями семейств циркулянтных топологий

Маликов М. А., Монахова Э. А., Rzaev E. et al., Ученые записки Казанского университета. Серия: Физико-математические науки 2026 Т. 168 № 2 С. 269–286

This article examines series of families of two-dimensional circulant networks with rectangular L -shapes, optimal in diameter, as network-on-chip topologies with a minimal number of crossings between the links and a bounded length of the maximum link that does not depend on the network size. New network-on-chip routing algorithms, which use the coordinates of three adjacent zeros in the ...

Added: July 8, 2026

Algorithmic overlaps as thermodynamic variables: From local to cluster Monte Carlo dynamics in critical phenomena

Pilé I., Deng Y., Shchur L., Physical Review B: Condensed Matter and Materials Physics 2026 Vol. 114 No. 1 Article 014101

We investigate the spatial overlap of successive spin configurations in Markov chain Monte Carlo simulations using the local Metropolis algorithm and the Swendsen-Wang and Wolff cluster algorithms. We examine the dynamics of these algorithms for models in different universality classes: Ising model, Potts model with three components, and four-state Potts model. The overlap of two ...

Added: July 6, 2026

Журнал Телекоммуникации №1 за 2026

М.: Наука и технологии, 2026.

«Телекоммуникации» ежемесячный рецензируемый производственный, информационно-аналитический и учебно-методический журнал выходит в свет с июля 2000 г. Для руководителей и работников промышленности, научно-исследовательских и проектно-конструкторских институтов, высших учебных заведений, аспирантов и студентов, а также для специалистов, разрабатывающих, выпускающих и эксплуатирующих средства телекоммуникаций. Новости разработок и производства, прогнозы развития, защита информации, Нормативные, справочные, аналитические и учебно-методические материалы. Переход к глобальному информационному ...

Added: July 4, 2026

"Труды МФТИ" Том 17, № 4 (68) (2025)

МФТИ, 2025.

абота редакции научного журнала «Труды Московского физико-технического института» (кратко «Труды МФТИ»), редакционной коллегии и редакционного совета осуществляется в соответствии с Положением, утвержденным ректором института. В состав редакционной коллегии входят руководители института, факультетов, институтских и факультетских кафедр. Главный редактор журнала —президент МФТИ, член-корр. РАН Кудрявцев Н.Н. Журнал «Труды МФТИ» входит в базу данных РИНЦ (Российский Индекс Научного Цитирования) и доступен в электронной ...

Added: July 4, 2026

Арт-резиденция в России: ролевая диспозиция в (пере)сборке поля современного искусства

Ryabkov Y., Леонтьева А. В., Abramov R., Социология власти 2026 Т. 38 № 2 С. 230–261

This study analyzes art residencies in Russia as institutions that bring together artists, local contexts, and stakeholders. It focuses on three key aspects: strategies for institutional positioning within the art field and in relation to stakeholders; residents' practices regarding the local context; and the nature of the residencies' interactions with local residents. The institutionalization of ...

Added: July 4, 2026

Modulation Recognition for Industrial Internet of Things Communication Signals Under Few-Shot Conditions Based on Attention Mechanism and Relation Network

Hualin M., Jie Z., Jerome Y. et al., Journal of Internet Technology 2026 Vol. 27 No. 3 P. 367–382

In open, interference-prone scenarios, the scarcity of precisely annotated signal samples limits the application of deep learning–based modulation identification, which generally relies on extensive labeled data for stability. Relation Networks, as an emerging class of deep learning models, exhibit rapid convergence in few-shot learning tasks. Motivated by the fast convergence property of relation-based learning and ...

Added: July 3, 2026

Тезисы докладов Пятнадцатых Шмелёвских чтений: (К 100-летию со дня рождения академика Дмитрия Николаевича Шмелева):Жизнь слова: Научное наследие академика Д. Н. Шмелева в контексте современности

М.: Институт русского языка им. В.В. Виноградова РАН, 2026.

Сборник тезисов Пятнадцатых Шмелёвских чтений (К 100-летию со дня рождения академика Дмитрия Николаевича Шмелева) Жизнь слова: Научное наследие академика Д. Н. Шмелева в контексте современности. Охватывает разные аспекты современной русистики: от исторической лексикологии до современных трансформаций прагматики и семантики слов. ...

Added: June 23, 2026