Cross-Domain Limitations of Neural Models on Biomedical Relation Classification

Alimova I.; E. Tutubalina; S. I. Nikolenko

doi:10.1109/ACCESS.2021.3135381

Publications

?

Cross-Domain Limitations of Neural Models on Biomedical Relation Classification

IEEE Access. 2022. Vol. 10. P. 1432–1439.

Alimova I., Tutubalina E., Nikolenko S. I.

Relation extraction (RE) aims to extract relational facts from plain text, which is essential to the biomedical research field with the rapid growth of biomedical literature and generally large volumes of biomedicine-related text coming from various sources. Numerous annotated corpora and state-of-the-art models have been introduced in the past five years. However, there are no general guidelines about evaluating models on these corpora in single- and cross-domain settings with diverse entities and relation types. We aim to fill this gap for the task of detecting whether a relation holds between two biomedical entities given a text span. In this work, we present a fine-grained evaluation intended to perform a comparative evaluation of four biomedical benchmarks and understand the efficiency of state-of-the-art neural architectures based on Long Short-Term Memory (LSTM) with cross-attention and Bidirectional Encoder Representations from Transformers (BERT) for relation extraction across two main domains, namely scientific abstracts and electronic health records. We present a comparative evaluation of biomedical RE datasets, including the PHAEDRA, i2b2/VA, BC5CDR, and MADE corpora. Our evaluation of BioBERT and LSTM for binary classification shows significant divergence in in-domain and out-of-domain performance, finding an average drop in F1-measure of 34.2% for BioBERT. The cross-attention LSTM model developed in this work exhibits better cross-domain performance, with a drop of only 27.6% in F-measure. © 2013 IEEE.

Research target: Computer Science

Language: English

DOI

Keywords: natural language processing bioinformatics relation extraction

Publication based on the results of:

Development of mathematical models and methods for natural language processing, knowledge discovery in data and recommender systems (2022)

Современные проблемы науки

Данилевич Т. В., Yasnitsky L., М.: Юрайт, 2025.

This course examines both the historical aspects of science and current philosophical issues related to its contemporary development and impact on society. It describes the emergence of science and its progressive advancement, its adoption by society, and the introduction and dominance of scientific achievements in all areas of human activity. Particular attention is paid to ...

Added: December 19, 2025

Modeling Pruning as a Phase Transition: A Thermodynamic Analysis of Neural Activations

- Р. М., Koltcov Sergei, Surkov A. et al., Computers, Materials and Continua 2025 P. 1–24

Activation pruning reduces neural network complexity by eliminating low-importance neuron activations, yet identifying the critical pruning threshold—beyond which accuracy rapidly deteriorates—remains computationally expensive and typically requires exhaustive search. We introduce a thermodynamics-inspired framework that treats activation distributions as energy-filtered physical systems and employs the free energy of activations as a principled evaluation metric. Phase-transition–like phenomena ...

Added: December 19, 2025

Распределённые компьютерные и телекоммуникационные сети: управление, вычисление, связь (DCCN-2023)

-, 2023.

В научном электронном издании представлены материалы XXVI Международной научной конференции «Распределенные компьютерные и телекоммуникационные сети: управление, вычисление, связь» по следующим направлениям: - Алгоритмы и протоколы телекоммуникационных сетей - Управление в компьютерных и инфокоммуникационных системах - Анализ производительности, оценка QoS / QoE и эффективность сетей - Аналитическое и имитационное моделирование коммуникационных систем последующих поколений - Эволюция беспроводных сетей в направлении 5G; - Технологии сантиметрового и миллиметрового ...

Added: December 18, 2025

Безопасность России. Правовые, социально-экономические и научно-технические аспекты. Стратегические приоритеты национальной безопасности. Защита традиционных Российских духовно-нравственных ценностей, культуры и исторической памяти как восьмой стратегический приоритет национальной безопасности.

Istratov A., Махутов Н. А., Звягин А. А. et al., АКВАРИУС г.Тула, 2025.

Рассматриваются вопросы формирования и реализации концепций, стратегий и методической базы национальной безопасности, а также научно-методологического обеспечения защиты традиционных российских духовно-нравственных ценностей и культурно-исторической памяти в аспекте достижения целей и решения задач восьмого стратегического приоритета национальной безопасности ...

Added: December 18, 2025

Nonlinear Structure of Super‐Thin Current Sheets With Guide Field: Equilibrium or Dynamic?

Tsareva O., Leonenko M., Grigorenko E. et al., JOURNAL OF GEOPHYSICAL RESEARCH-SPACE PHYSICS 2025 Vol. 130 P. 1–15

1D self‐consistent model of super‐thin current sheet (STCS) based both on a quasi‐adiabatic approach for the demagnetized proton and electron motion is generalized to the case of configuration with nonzero guide field. The part of electron population is supposed to be magnetized (described via guiding center approximation). The magnetic field configuration includes three components: self‐consistent Bx(z) and By(z) components ...

Added: December 18, 2025

Эффективность рынка искусственного интеллекта: ожидания и реальность

Kuzminov Y., Kruchinskaia E., Форсайт 2025 Т. 19 № 4 С. 6–16

The development of Artificial Intelligence (AI) is significantly impacting the global economy, transforming corporate strategies and enhancing operational efficiency. This study aims to analyze the relative efficiency of the Generative AI (GenAI) market, considering the market size of chips, servers, and data center infrastructure required for its operation, and comparing these market sizes with the ...

Added: December 17, 2025

Глубокая нейронная сеть с графовым вниманием для выявления поддельных изображений лица

Pikul A. S., Лепендин А. А., Труды молодых ученых Алтайского государственного университета 2024 № 20 С. 190–193

Представлен новый подход для выявления атак презентации на системы распознавания по лицу. Он основан на использовании механизма графового внимания, применяемого к промежуточным картам характеристик изображений лица, вычисленным сверточной сетью ResNet18. Показано, что предложенный подход позволил добиться высокого качества распознавания поддельных изображений при лицевой биометрической верификации, сравнимого с имеющимися в настоящее время альтернативными решениями. ...

Added: December 12, 2025

Ансамбль современных моделей компьютерного зрения для задачи обнаружения дипфейков

Pikul A. S., Безопасность информационных технологий 2024 Т. 31 № 4 С. 116–127

This article explores the potential use of modern computer vision architectures for the task of deepfake detection. The following architectures are considered: EfficientNet, Vision Transformer (ViT), VisionLSTM (ViL), Vision KAN, and Mamba Vision. The novelty of the approach lies in the application and comparison of these architectures, as well as their combination into paired ensembles ...

Added: December 12, 2025

Enhancing explainability in deepfake detection with graph attention networks

Pikul A. S., Popov I. Y., Безопасность информационных технологий 2025 Vol. 32 P. 73–82

Understanding how artificial intelligence models make decisions is important, especially for difficult tasks like detecting deepfakes, where it's not enough to just get a result – it needs to know why the model made that choice. Many current methods, like Shapley additive explanations (SHAP) and Gradient-weighted Class Activation Mapping (Grad-CAM), help explain these decisions, but ...

Added: December 12, 2025

Российская модель использования ИИ в цифровых экосистемах медиакоммуникационной индустрии

Vartanov S., Tyshetskaya A., Вестник Московского университета. Серия 10: Журналистика 2025 № 5 С. 23–53

Media has been at the forefront of digital transformation in recent years: not only have the methods of creating, selling, storing, and consuming media content and media services changed, but also the structure of the media communication industry (MCI) itself. Considering its new structure and subjectivity, one cannot help but pay attention to artificial intelligence (AI) technologies that manifest ...

Added: December 11, 2025

ComputAgeBench: Epigenetic Aging Clocks Benchmark

Kriukov D., Efimov E., Kuzmina E. et al., ACM Transactions on Knowledge Discovery from Data 2025 Vol. - No. - P. 5560–5570

The success of clinical trials of longevity drugs relies heavily on identifying integrative health and aging biomarkers, such as biological age. Epigenetic aging clocks predict the biological age of individuals using their DNA methylation profiles, commonly retrieved from blood samples. However, there is no standardized methodology to validate and compare epigenetic clock models. We propose ComputAgeBench, ...

Added: December 10, 2025

О базовых математических определениях цифровых технологий и искусственного интеллекта

Semenov A., Доклады Российской академии наук. Математика, информатика, процессы управления (ранее - Доклады Академии Наук. Математика) 2025 Т. 527 № S С. 7–12

The paper proposes a system of definitions for the basic concepts of computability theory that underlie the mathematics of the digital world: algorithm, computability, calculus, object complexity, close to modern undertnding. Hierarchies of the finite and the problem of consistency are considered. ...

Added: December 6, 2025

2025 Systems of Signal Synchronization, Generating and Processing in Telecommunications (SYNCHROINFO)

IEEE, 2025.

The international scientific and engineering conference “Systems of Signal Synchronization, Generating and Processing in Telecommunications” has been held since 1974. For 50 years of work the conference has become a widely known forum for specialists of the field. The papers which are discussed at the conference can be divided into the following chapters: 1) Synchronization Systems and Devices 2) Signal ...

Added: December 6, 2025

Comparative Analysis of Requirements Prioritization Methods for Personalized Nutrition Web Applications

Mozhegova A. S., V.V. Lanin, Proceedings of the Institute for System Programming of the RAS 2025 Vol. 37 No. 5 P. 225–240

This study investigates the application of five requirements prioritization methods – MoSCoW, Kano Model, Weighted Scoring, RICE, and Cost of Delay (CoD) – in the development of a web application for personalized nutrition. The research addresses the challenge of managing limited resources (time, financial, and human) while maximizing user value and ensuring safety in a ...

Added: December 4, 2025

17th International Conference, SCSM 2025, Held as Part of the 27th HCI International Conference, HCII 2025, Gothenburg, Sweden, June 22–27, 2025, Proceedings, Part II. Social Computing and Social Media. LNCS, volume 15787

Fedyanin D., Switzerland: Springer, 2025.

The 17th International Conference on Social Computing and Social Media (SCSM 2025) was an affiliated conference of the HCI International (HCII) conference. It provided an established international forum for the exchange and dissemination of scientific information related to social computing and social media, addressing a broad spectrum of issues expanding our understanding of current and future issues in ...

Added: December 3, 2025

A polynomial-time algorithm recognizing exact cubes of trees

Manuylenko N., Beaudou L., Echeverría H. et al., Procedia Computer Science 2025 Vol. 273 P. 86–93

We prove that the recognition of exact cubes of trees can be done in polynomial time. More precisely, the exact distance power of a graph is a refinement of the more usual notion of graph power. Given a graph G and a positive integer p, the exact distance pth power of G is the graph G#p on the same vertex set where two vertices ...

Added: December 3, 2025

КОМПЕНСАЦИЯ ВЛИЯНИЯ КАПЕЛЬ ДОЖДЯ НА ПРОИЗВОДИТЕЛЬНОСТЬ БЕСПРОВОДНОЙ СИСТЕМЫ С РЕКОНФИГУРИРУЕМОЙ ИНТЕЛЛЕКТУАЛЬНОЙ ПОВЕРХНОСТЬЮ

Тярин А. С., Тронин С. С., Kureev A. et al., Проблемы передачи информации 2025 Т. 61 № 2 С. 83–95

Реконфигурируемая интеллектуальная поверхность (РИП) представляет собой одну из перспективных технологий для повышения пропускной способности и расширения покрытия существующих и будущих беспроводных сетей. Предполагается, что РИП будет активно применяться в сценариях вне помещений, где она будет подвержена влиянию погодных условий, таких как дождь. Дождь, в свою очередь, повлияет на амплитудно- и фазо-частотные характеристики элементарных ячеек (ЭЯ), ...

Added: December 2, 2025

Experimental study of a User-Centric RIS in existing cellular systems

Arseny Poyda, Andrey Tyarin, Kirill Glinskiy et al., Computer Networks 2025 Vol. 263 Article 111219

Reconfigurable Intelligent Surfaces (RISs) are promising for increasing the capacity and coverage of cellular systems by adaptively changing the phase of the reflected signals based on the information about the channel between a Base Station (BS) and a Mobile Station (MS). According to a popular BS-centric approach, the BS controls the RIS via a BS-RIS ...

Added: December 2, 2025

A Unified Framework for Segmentation, Scaling, and Indexing of Contactless Fingerprints

IEEE Transactions on Biometrics, Behavior, and Identity Science 2025 P. 668– 680

Contactless fingerprint identification is gaining prominence due to its convenience, accessibility, security, and hygiene benefits over traditional contact-based methods. However, achieving accurate identification and efficiently searching large-scale databases remain challenging. In this paper, we introduce CFinSegNet, an encoder-decoder network for fingerprint segmentation that operates on preprocessed fingerprints and multiple image representations from diverse color spaces ...

Added: December 2, 2025

Development of Knowledge-based Intelligence for Sustainability Assessment of Russian Regions

Fedoseev D. S., Neroslov A. D., V.V. Lanin, Proceedings of the Institute for System Programming of the RAS 2025 Vol. 37 No. 4-2 P. 207–218

The paper presents the development of a Knowledge-based Intelligence for Sustainability Assessment (KISA) system for the comprehensive assessment of the sustainability of Russian regions, which uses a large language model (LLM) with retrieval-augmented generation (RAG) technology and Rosstat data. KISA automatically selects relevant indicators based on users’ textual queries, determines their weights, and calculates regional ...

Added: December 2, 2025

Utilizing the VirIdAl Pipeline to Search for Viruses in the Metagenomic Data of Bat Samples

Budkina A., Korneenko E., Kotov I. et al., Viruses 2021 No. 10 P. 2006

According to various estimates, only a small percentage of existing viruses have been discovered, naturally much less being represented in the genomic databases. High-throughput sequencing technologies develop rapidly, empowering large-scale screening of various biological samples for the presence of pathogen-associated nucleotide sequences, but many organisms are yet to be attributed specific loci for identification. This ...

Added: September 19, 2025

Transcriptomic Maps of Colorectal Liver Metastasis: Machine Learning of Gene Activation Patterns and Epigenetic Trajectories in Support of Precision Medicine

KUDRYAVTSEVA A., Cancers 2023

Liver metastasis is a significant factor contributing to mortality associated with colorectal cancer. Establishing the biological mechanisms of metastasis is crucial for refining diagnostics and identifying therapeutic windows for interventions. Currently, little is known of the processes that govern the development of liver metastases, the role of the tumor microenvironment, the role of epigenetics, and ...

Added: July 1, 2025

Automatic Morpheme Segmentation for Russian: Can an Algorithm Replace Experts?

Morozov D., Garipov T., Lyashevskaya O. et al., Journal of Language and Education 2024 Vol. 10 No. 4 P. 71–84

Introduction: Numerous algorithms have been proposed for the task of automatic morpheme segmentation of Russian words. Due to the differences in task formulation and datasets utilized, comparing the quality of these algorithms is challenging. It is unclear whether the errors in the models are due to the ineffectiveness of algorithms themselves or to errors and inconsistencies ...

Added: January 7, 2025

Genome-wide association studies of ischemic stroke based on interpretable machine learning

Stefan Nikolić, Ignatov D. I., Khvorykh G. et al., PeerJ Computer Science 2024 Vol. 10 Article e2454

Despite the identification of several dozen genetic loci associated with ischemic stroke (IS), the genetic bases of this disease remain largely unexplored. In this research we present the results of genome-wide association studies (GWAS) based on classical statistical testing and machine learning algorithms (logistic regression, gradient boosting on decision trees, and tabular deep learning model ...

Added: December 11, 2024