Bimodal Cross-Validation Approach for Recommender Systems Diagnostics

D. I. Ignatov; J. Poelmans

doi:10.4018/978-1-4666-1900-5.ch008

Publications

?

Bimodal Cross-Validation Approach for Recommender Systems Diagnostics

Ch. 8. P. 185–195.

Ignatov D. I., Poelmans J.

Recommender systems are becoming an inseparable part of many modern Internet web sites and web shops. The quality of recommendations made may significantly influence the browsing experience of the user and revenues made by web site owners. Developers can choose between a variety of recommender algorithms; unfortunately no general scheme exists for evaluation of their recall and precision. In this chapter, the authors propose a method based on cross-validation for diagnosing the strengths and weaknesses of recommender algorithms. The method not only splits initial data into a training and test subsets, but also splits the attribute set into a hidden and visible part. Experiments were performed on a user-based and item-based recommender algorithm. These algorithms were applied to the MovieLens dataset, and the authors found classical user-based methods perform better in terms of recall and precision.

Keywords: машинное обучение информационный поиск information retrieval рекомендательные системы machine learning Recommender Systems

In book

Diagnostic Test Approaches to Machine Learning and Commonsense Reasoning Systems

Naidenova X., Ignatov D. I. Hershey: IGI Global, 2012.

Three Algorithms for Merging Hierarchical Navigable Small World Graphs

Ponomarenko A., / Series Computer Science "arxiv.org". 2025.

This paper addresses the challenge of merging hierarchical navigable small world (HNSW) graphs, a critical operation for distributed systems, incremental indexing, and database compaction. We propose three algorithms for this task: Naive Graph Merge (NGM), Intra Graph Traversal Merge (IGTM), and Cross Graph Traversal Merge (CGTM). These algorithms differ in their approach to vertex selection ...

Added: July 30, 2026

Сравнение методов автоматической разметки речевых формул в русскоязычном интернет-дискурсе: пилотное исследование

Попова Т. И., Масленикова А. С., В кн.: Компьютерная лингвистика и интеллектуальные технологии: По материалам ежегодной международной конференции «Диалог». Выпуск 24.Issue 24.: M.: Max press, 2026. С. 420–429.

This study focuses on developing and comparing methods for automatic annotation of speech formulas in a corpus of Russian internet comments. Speech formulas are a class of multiword expressions that convey emotional reactions in dialogue. The research material consisted of a corpus of 10,000 comments (157,261 tokens) collected from five Telegram channels. Dictionary-based formal search ...

Added: June 29, 2026

The Use of the Missing Sample Simulation Modeling to Create a Classification Model for Three or More Classes by the Example of the Carbohydrate Metabolism Disorder Degree Detection Problem

Новиков Р. С., Novopashin M., Pozin B., Programming and Computer Software 2026 Vol. 52 No. 1 P. 28 – 38

Added: June 26, 2026

К ранжированию значимости факторов дестабилизации в странах Азии и Африки методами машинного обучения

Korotayev A., Chernomorchenko I., Медведев И. А., Восток. Афро-азиатские общества: история и современность 2026 № 3 С. 117–130

This study employs machine learning methods to rank factors contributing to large-scale armed and unarmed destabilization across Asian and African countries. Analysis reveals that African nations demonstrate greater vulnerability to armed destabilization (up to full-scale civil wars), whereas Asian countries are more prone to less violent unarmed forms (mass antigovernment demonstrations, riots, general strikes and ...

Added: June 21, 2026

Pre-trained LLMs Meet Sequential Recommenders: Efficient User-Centric Knowledge Distillation

Severin N., Kartushov D., Urzhumov V. et al., , in: Advances in Information Retrieval: 48th European Conference on Information Retrieval, ECIR 2026, Delft, The Netherlands, March 29 – April 2, 2026, Proceedings, Part II. (LNCS, volume 16484).: Cham: Springer Publishing Company, 2026. P. 508–517.

Sequential recommender systems have achieved significant success in modeling temporal user behavior but remain limited in cap-turing rich user semantics beyond interaction patterns. Large Language Models (LLMs) present opportunities to enhance user understanding with their reasoning capabilities, yet existing integration approaches cre-ate prohibitive inference costs in real time. To address these limitations, we present a ...

Added: June 18, 2026

Advances in Information Retrieval: 48th European Conference on Information Retrieval, ECIR 2026, Delft, The Netherlands, March 29 – April 2, 2026, Proceedings, Part II. (LNCS, volume 16484)

Cham: Springer Publishing Company, 2026.

The four-volume set LNCS 16483-16486 constitutes the refereed conference proceedings of the 48th European Conference on Information Retrieval, ECIR 2026, held in Delft, The Netherlands, during March 29–April 2, 2026. The 46 full papers and 37 short papers presented together with 10 findings papers, 9 reproducibility papers, 17 resource papers, 11 workshop papers, 7 tutorial papers, ...

Added: June 18, 2026

Artificial intelligence and digital twins for failure prediction in data center cooling systems: a comprehensive literature review (2018–2026)

Butorova A., Bobakov V., Sergeev A. et al., European Physical Journal: Special Topics 2026 P. 1–19

This paper presents a review of artificial intelligence (AI) methods for failure prediction in data center cooling systems, with a focus on the integration of digital twins (DTs), physics-informed learning, and graph-based models. Positioned within complex network science, this review addresses a limitation of conventional graph approaches—their reliance on pairwise connectivity—whereas real-world failures often arise ...

Added: June 10, 2026

Влияние шизофрении на лексический уровень языка

Untila K., Tasenko O., В кн.: Современная лингвистика: ключ к диалогу. Труды и материалы IV Казанского международного лингвистического саммита.Т. 1: СОВРЕМЕННАЯ ЛИНГВИСТИКА: КЛЮЧ К ДИАЛОГУ.: Каз.: Издательство Казанского университета, 2024. С. 221–224.

Шизофрения – это хроническое психическое расстройство, которое выражается как комбинация психотических симптомов – таких как галлюцинации, бред и дезорганизация когнитивных функций. У многих пациентов с диагнозом шизофрения обнаруживаются нарушения речи. Для исследования были отобраны рассказы об истории из жизни из корпуса 3D. В качестве личных историй были собраны ответы на вопросы «Какой самый лучший или запоминающийся ...

Added: June 8, 2026

Proceedings of the 43rd International Conference on Machine Learning (ICML 2026)

Seul: PMLR, 2026.

Added: June 4, 2026

От неизвестности к прозрачности: обзор технологий объяснимого ИИ (XAI)

Avdoshin S. M., Pesotskaya E. Y., Информационные технологии 2026 Т. 32 № 4 С. 185–194

With the rapid advancement of artificial intelligence, and deep learning in particular, models have emerged that are capable of delivering highly accurate predictions. However, the internal logic of such models remains difficult to interpret—an issue of critical importance, especially in domains where the correctness of an algorithm directly affects high-stakes decision-making. One promising avenue for ...

Added: May 8, 2026

Современные методы анализа временных рядов в мониторинге и прогнозировании состояния оборудования для механизированной добычи

Neznanov A., Glushko A., Овчинников С. et al., В кн.: Интеллектуальный анализ данных в нефтегазовой отрасли.: М.: ООО «Геомодель Развитие», 2024. С. 140–143.

With the development of monitoring systems, now we have the opportunity to collect key performance indicators of devices in the process of artificial lift. Every day a huge amount of telemetry is generated by our devices, which can be used to forecast the working mode and health state of the equipment after the process of ...

Added: April 29, 2026

Machine Learning Approach to Anticancer Activity Prediction of Transition-Metal Complexes Based on a Large-Scale Experimental Database

Krasnov L., Malikov D., Kiseleva M. et al., Journal of Medicinal Chemistry 2026 Vol. 69 No. 8 P. 8838–8851

In this work, we developed a straightforward data-driven approach to predict the cytotoxicity of metal complexes based entirely on their (metal + ligands) composition. To this end, we have manually curated MetalCytoToxDB─a comprehensive experimental database comprising 26,500 IC50 values for 7050 metal complexes against 754 cell lines from 1921 articles. Based on these, machine learning ...

Added: April 23, 2026

LSTM-модель потребления тепловой энергии в многоэтажном жилом здании

Ершов И. А., Системная инженерия и инфокоммуникации 2025 № 4 С. 11–14

The heat consumption of residential buildings is a stochastic series. It is necessary for the design of thermal energy regulators the creation of a neural network model. In the paper, the model is carried out based on Long Short-Term Memory (LSTM). The high accuracy of reproducing the series was achieved by training the model on ...

Added: April 22, 2026

Алгоритм анализа новостной информации для принятия экономических решений

Чудинова О. С., Первицкая Л. А., Ramenskaya A., Индустриальная экономика 2026 № 1 С. 65–78

This article is devoted to the development of an algorithm for analyzing news information using machine learning methods implemented in Python libraries. The choice of tools used at each stage of the algorithm is justified by calculating metrics for the quality of the solution to the corresponding machine learning problems. The algorithm’s results are presented ...

Added: April 20, 2026

Modeling cosolvent effects on solubility in supercritical CO2 using data-driven approaches

Makarov D. M., Kalikin N., Gurikov P. et al., Journal of Supercritical Fluids 2026 Vol. 235 Article 106979

Supercritical CO2 (scCO2 ) is an environmentally friendly solvent, but its low polarity limits the solubility of polar compounds. Cosolvents are commonly used to enhance solvation capability, yet comprehensive datadriven studies are scarce. We compiled the largest dataset to date — 4401 experimental solubility records with 22 cosolvents for 93 nonionic solutes, plus 4855 records ...

Added: April 19, 2026

Эффективность применения прогнозов волатильности в активных торговых стратегиях институциональных инвесторов на российском рынке акций

Lysenok N., Фундаментальная и прикладная математика 2026 Т. 26 № 3 С. 33–42

This study examines the impact of realized volatility forecasts on the performance of active trading strategies in the Russian equity market. Using a sample of 17 liquid stocks over the period 2014–2026, a hybrid forecasting model is developed that combines HAR-J with gradient boosting; its superiority over the baseline HAR-J specification is confirmed by the ...

Added: April 17, 2026

Особые экономические зоны Российской Федерации: моделирование решений потенциальных резидентов и процесса их генерации

Plesovskikh A., Journal of Applied Economic Research 2023 Т. 22 № 2 С. 323–354

Modern studies widely discuss the role of special economic zones in stimulating the economic growth and development of Russia, generating the necessary investment flows and increasing the country's innovative potential by expanding production in high-tech sectors of the economy with high added value. The purpose of the study is to model the process of generating ...

Added: April 13, 2026

Опыт генерации оценок эмоциональной валентности и возбуждения слов на основе символьно-уровневой CNN

Lyusin D., Валуева Е. А., Sysoeva T., В кн.: Психология познания: Материалы Всероссийской научной конференции, ЯрГУ, Институт психологии РАН, 5–6 декабря 2025 г.: Институт психологии РАН, 2026. С. 310–314.

Эмоциональная окраска слов широко используются в различных академических и прикладных исследованиях, от анализа текстов до понимания когнитивных процессов. Актуальной задачей является создание объёмных датасетов с оценками слов по ряду эмоциональных параметров. Современные методы машинного обучения, основанные на семантической близости слов, извлекаемой из текстовых корпусов, демонстрируют высокие корреляции с человеческими оценками, однако иногда наблюдаются существенные расхождения. ...

Added: April 10, 2026