Ad Astra or Astray: Exploring Linguistic Knowledge of Multilingual BERT through NLI Task

?

Ad Astra or Astray: Exploring Linguistic Knowledge of Multilingual BERT through NLI Task

Natural Language Engineering. 2022. P. 1–30.

Tikhonova M., Mikhailov V., Dina Pisarevskaya, Malykh V., Shavrina T.

Recent research has reported that standard fine-tuning approaches can be unstable due to being prone to various sources of randomness, including but not limited to weight initialization, training data order, and hardware. Such brittleness can lead to different evaluation results, prediction confidences, and generalization inconsistency of the same models independently fine-tuned under the same experimental setup. Our paper explores this problem in natural language inference, a common task in benchmarking practices, and extends the ongoing research to the multilingual setting. We propose six novel textual entailment and broad-coverage diagnostic datasets for French, German, and Swedish. Our key findings are that the mBERT model demonstrates fine-tuning instability for categories that involve lexical semantics, logic, and predicate-argument structure and struggles to learn monotonicity, negation, numeracy, and symmetry. We also observe that using extra training data only in English can enhance the generalization performance and fine-tuning stability, which we attribute to the cross-lingual transfer capabilities. However, the ratio of particular features in the additional training data might rather hurt the performance for model instances. We are publicly releasing the datasets, hoping to foster the diagnostic investigation of LMs in a cross-lingual scenario, particularly in terms of benchmarking, which might promote a more holistic understanding of multilingualism in LMs and cross-lingual knowledge transfer.

Research target: Computer Science

Language: English

DOI

Операционная система Linux. Дистрибьюция программного обеспечения

Silakov D., Юрайт, 2025.

В курсе рассматривается операционная система Linux как платформа для разработки, сборки и распространения программного обеспечения. Предложены как классические подходы к доставке приложений с помощью пакетов, так и современные альтернативы, основанные на использовании контейнеров. Интерактивная комбинация теории, контрольных тестов и практических заданий обеспечивает эффективное и интересное погружение в учебный процесс как для студентов, так и для ...

Added: February 15, 2026

Качество программного кода. Позаботьтесь о долгой жизни ваших программных продуктов

Silakov D., Системный администратор 2025 № 10 С. 42–47

Понятие «качество программного продукта» включает в себя не только полноту и корректность реализации требуемого функционала, но и простоту поддержки и модификации программы. Как же обезопасить себя и коллег от кошмара поддержки нечитаемого кода? ...

Added: February 15, 2026

Искусственный интеллект в решении актуальных социальных и экономических проблем ХХI века : сборник статей по материалам Десятой всероссийской научно-практической конференции с международным участием

Yasnitsky L., Plotnikova E. G., Radionova M. V. et al., Пермский государственный национальный исследовательский университет, 2025.

Представлены материалы Десятой всероссийской научно-практической конференции с международным участием «Искусственный интеллект в решении актуальных социальных и экономических проблем ХХI века», которая проводилась 9–10 октября 2025 г. в Перми, ПГНИУ. Сборник предназначен для научных и педагогических работников, преподавателей, аспирантов, магистрантов, студентов и всех, кто интересуется и занимается проблемами развития и применения методов искусственного интеллекта. ...

Added: February 15, 2026

Special Issue Sensing Technology for Smart Cities: Data, Analytics, and Visualizations

Kharlamov A. A., Pilgun M., [б.и.], 2024.

The analysis of large volumes of data collected from heterogeneous sources is increasingly important for the development of megacities, the advancement of smart city technologies, and ensuring a high quality of life for citizens. This study aimed to develop algorithms for analyzing and interpreting social media data to assess citizens’ opinions in real time and ...

Added: February 15, 2026

Программные инструментальные средства для разработки мероприятий по снижению брака серийного производства

Yasnitsky L., Голдобин М. А., Мезенцев А. С., Прикладная математика и вопросы управления 2025 № 2 С. 99–116

Представлен обзор современных методов и основанных на них программных инструментах, применяемых для математического моделирования серийных производственных процессов с целью снижения брака и повышения качества производимых изделий. Перечисляются группы работ, нацеленных на обнаружение и классификацию дефектов, работ, в которых решаются задачи прогнозирования образования дефектов и определения значимости параметров, работ направленных на поиск оптимального сочетания технологических параметров изготовления изделий, ...

Added: February 15, 2026

Управление жизненным циклом информационных систем

Zaramenskikh E., М.: Образовательная платформа Юрайт, 2025.

В курсе рассматривается история и современное состояние информационных систем, а также все этапы их жизненного цикла — от подготовительного этапа до утилизации. Подробно разбирается теория и практика управления жизненным циклом информационных систем, самые разные методологии структурного анализа и моделирования бизнес-процессов, классические и гибкие процессы разработки информационных систем и предназначенные для этого программные инструменты, а также ...

Added: February 15, 2026

Total conditional complexity of certain objects

Vereshchagin N., Information and Computation 2026 Vol. 308 P. 1–12

The fine approach to measure information dependence is based on the total conditional complexity CT( y |x), which is defined as the minimal length of a total program that outputs y on the input x. It is known that the total conditional complexity can be much larger than the plain conditional complexity. Such strings x, y are defined ...

Added: February 14, 2026

Diffusion models for synthetic tabular data generation

Hushchyn M., Telesheva E., Doklady Mathematics 2025 No. 527 P. 388–399

he problem of generating high-quality synthetic data is crucial for many data science tasks. A generated dataset can cut the costs on the augmentation of the existing data with additional instances, for example, in physics, or help with its privacy protection, for instance, in banking. However, generating a tabular dataset is challenging, as the data ...

Added: February 12, 2026

Real-Bogus Classification for ZTF Data Releases: Two Approaches

Semenikhin n., Kornilov M., Lavrukhina A. et al., Communications in Computer and Information Science 2026 Vol. 2641 P. 211–219

We considered two fundamentally different approaches to real-bogus classification within the Zwicky Transient Facility survey data. The first approach is based on neural networks that take sequences of object images as input. The second approach uses features extracted from light curves and classical machine learning methods. Several models for both approaches were tested. Quality metrics ...

Added: February 12, 2026

Проблемы достоверности пользовательских оценок и отзывов на маркетплейсах: системный подход

Полежаева Я. В., Popov V., Бизнес-информатика 2025 Т. 19 № 24 С. 26–41

User ratings and reviews on marketplaces are subject to systematic distortions, creating serious risks for e-commerce participants and reducing the efficiency of market mechanisms. This study presents a comprehensive analysis of information distortion problems, covering the process from rating formation to its systematic accounting. The aim of the work is to systematize factors of information distortion on marketplaces and ...

Added: February 11, 2026

Development of a Language Model for Automated Classification of English-Language Scientific Articles by SRSTI Codes

Zunin V., Afonin A. I., Anoshin V. I. et al., Automatic Documentation and Mathematical Linguistics 2025 Vol. 5 No. 59 P. 287–293

The development of an artificial intelligence-based language model for classifying English-language scientific articles by SRSTI codes is described. This improves the processes of reviewing and indexing scientific publications. A pre-processed dataset of scientific articles was used for training and testing the models. An architecture for cascade classification was developed, and the performance of models with ...

Added: February 11, 2026

Generation of Synthesizable Verilog Code From Natural Language Specifications

Yashchenko D. S., Romanov A., Ziazetdinov A.A. et al., IEEE Access 2026 Vol. 14 P. 4990–5001

This study presents a method for generating synthesizable Verilog code for digital integrated circuits directly from natural-language specifications. The approach combines large language models with parameter-efficient fine-tuning (specifically, Low-Rank Adaptation and Quantized Low-Rank Adaptation) together with a specialized corpus of specification-code pairs that covers common design patterns and varying task complexity. The pipeline includes automated ...

Added: February 11, 2026

Application of MIMO technology in wideband millimeter range wireless communications systems

Tiraspolsky S.A., Ermolayev V. T., Flaksman A. G. et al., Radioelectronics and Communications Systems 2011 Vol. 54 P. 219–226

A concept of using MIMO technology in millimeter range wireless communications systems with orthogonal frequency division multiplexing is considered. The concept is based on dividing transmitting and receiving multi-element antenna arrays into separate sub-arrays with analogue radiation pattern shaping and on using two most powerful space sub-channels for information transmission. Sequence and structure of transmitted ...

Added: February 10, 2026

mmWave SVD-based beamformed MIMO communication systems

Sergey Tiraspolsky, Jeon B., Kim J. et al., Proceedings of the 7th IEEE conference on Consumer communications and networking (CCNC’2010) 2010 P. 834–838

This paper provides concept of data transmission protocol for millimeter wave (mmWave) wireless systems operating in Non-Line-of-Sight environment. This concept is designed to provide an effective and practical functioning of Multiple-Input Multiple-Output (MIMO) transmission mode that exploits combination of Singular Value Decomposition (SVD) of channel matrix and non-adaptive beamforming. The proposed protocol reduces complexity of ...

Added: February 10, 2026

Selective interference cancellation using Kalman filtering

Tiraspolsky S., Rubtsov A., Pudeyev A. et al., Proceedings of the 2006 3rd International Symposium on Wireless Communication Systems, IEEE 2006 P. 21–24

In present paper we have investigated a co-channel interference cancellation technique based on the tracking a limited number of strongest interferers only. With the assumption of synchronous base stations operation with overlapping but different training signals (pilots). Kalman filtering may be used for interfering channels estimation and further calculation of interference correlation matrix. This correlation ...

Added: February 10, 2026

Mobile WiMAX - Deployment Scenarios Performance Analysis

Tiraspolsky S., Malstev A., Rubtosv A. et al., Proceedings of the 2006 3rd International Symposium on Wireless Communication Systems, IEEE 2006 P. 353–357

In this paper, dynamic system level simulation methodology of mobile WiMAX (IEEE Std 802.16e) is described. The system level simulations scenarios (channel models, pathloss and shadow fading, sectorization, frequency reuse planning, system loading, etc) will be introduced. Evaluated performance of mobile WiMAX system such as signal-to-interference + noise ratio distributions, spectral efficiency and system outage ...

Added: February 10, 2026

Эффективность применения грассмановской диаграммообразующей схемы в MIMO системах связи

Тираспольский С.А., Червяков А. В., Труды Научной конференции по радиофизике, ННГУ, 2004 2004 С. 169–171

Диаграмообразование (ДО) в MIMO системах (multiple-input multiple-output systems), одновременно использующих несколько приемопередатчиков на обоих концах линии связи, является достаточно простым способом для повышения пропускной способности и увеличения ОСШ на приемном конце. Для этого в большинстве ранее предлагавшихся методов было необходимо знание на передатчике канальной матрицы или части ее SVD разложения, что требует значительной нагрузки на ...

Added: February 10, 2026

High-resolution capability of adaptive antenna arrays for communication systems

S.A. Tiraspolsky, Gerebryakov G. V., Журнал радиоэлектроники 2002 No. 7

In this paper we investigate comparison methods of different geometric configurations of adaptive antenna arrays for communications on purpose to estimate directions-of-arrival (DOA) of several external signals. The investigated antenna configurations have four elements and eleven wavelengths array size. The best high-resolution algorithm and the best array configuration are defined by numerical simulations. ...

Added: February 10, 2026

Применение адаптивных антенных решеток для увеличения скорости передачи информации

С.А.Тираспольский, Ермолаев В. Т., Флаксман А. Г. et al., Труды Научной конференции по радиофизике, ННГУ, 2002 2002 С. 22–28

В данной работе рассматривается принцип передачи информации и теоретически исследуется пропускная способность MIMO системы в условиях случайного канала распространения радиоволн, обсуждаются различные алгоритмы распределения мощности передатчика по параллельным ортогональным пространственным подканалам. ...

Added: February 10, 2026

Multiple adaptive recursive array for multipath environment

S. Tiraspolsky, Sellone F., Serebryakov G., Proceedings of the International Conference on Electromagnetics in Advanced Applications (ICEAA 01) 2001 P. 691–696

In a wireless communication system, signals sent into the channel interact with the environment in a very complex way. Thereby transmitted signals may be subject to many forms of degradation among which there are causes of multipath propagation: • Reflections due to obstacles with the size greater than a wavelength; • Refractions due to the ...

Added: February 10, 2026

Эффективность линейной обработки сигналов в системах связи в условиях многолучевого ионосферного канала декаметрового диапазона

Тираспольский С.А., Флаксман А. Г., Ермолаев В. Т. et al., Известия высших учебных заведений. Радиоэлектроника 2016 № 1 С. 8–14

Рассмотрены системы связи декаметрового диапазона, работающие в условиях многолучевого ионосферного пространственного канала. С помощью имитационного моделирования на физическом уровне исследованы основные характеристики системы (вероятность битовой и блоковой ошибки, про пускная способность). Показано, что в условиях частотно-селективного канала в полосе 3 кГц линейный алгоритм эквализации обеспечивает высокую эффективность подавления межсимвольной помехи для всех скоростей передачи данных, кроме самой высокой. ...

Added: February 10, 2026

UVIP: Model-Free Approach to Evaluate Reinforcement Learning Algorithms

Belomestny D., Levin I., Naumov A. et al., Journal of Optimization Theory and Applications 2026 Vol. 208 Article 89

Policy evaluation is an important instrument for the comparison of different algorithms in Reinforcement Learning (RL). However, even a precise knowledge of the value function Vπ corresponding to a policy π does not provide reliable information on how far the policy π is from the optimal one. We present a novel model-free upper value iteration ...

Added: February 10, 2026

Основы компьютерной графики

Korolev D., СПб.: Лань, 2026.

Учебное пособие состоит из четырех разделов, где рассматриваются физические основы, аналого-цифровое преобразование графики, сжатие графики и видео, устройства ввода и вывода графической информации; книга повторяет структуру и содержание теоретической части курса. Основной подход —- систематизация школьных знаний и формирование целостной картины работы с графикой и видео «изнутри». На различных примерах показываются элегантные инженерные решения в ...

Added: February 7, 2026

Functional models of elementary discursive units in Russian eSports commentary

Микулинский А. Д., , in: Синергия языков и культур 2022: междисциплинарные исследования.: St. Petersburg: -, 2023. P. 335–351.

The paper is devoted to the issue of the local structure modeling of the eSports commentary spoken genre on an example of the Dota 2 computer discipline. ESports commentary is a spontaneous and creative speech aimed at describing of what is happening on the computer-gaming field. The main factors that force us to study it ...

Added: May 12, 2024