Multi-label Image Set Recognition in Visually-Aware Recommender Systems

Demochkin K.; A. Savchenko

doi:10.1007/978-3-030-37334-4_26

Publications

?

Multi-label Image Set Recognition in Visually-Aware Recommender Systems

Ch. 26. P. 291–297.

Demochkin K., Savchenko A.

In this paper we focus on the problem of multi-label image recognition for visually-aware recommender systems. We propose a two stage approach in which a deep convolutional neural network is firstly fine-tuned on a part of the training set. Secondly, an attention-based aggregation network is trained to compute the weighted average of visual features in an input image set. Our approach is implemented as a mobile fashion recommender system application. It is experimentally show on the Amazon Fashion dataset that our approach achieves an F1-measure of 0.58 for 15 recommendations, which is twice as good as the 0.25 F1-measure for conventional averaging of feature vectors.

Keywords: мобильные технологии рекомендательные системы Mobile Technologies Deep Convolutional Neural Networks сверточные нейронные сети Recommender Systems Fashion recommendation image set recognition распознавание набора изображений

Publication based on the results of:

Эффективные методы распознавания мультимедийных данных для задач анализа предпочтений пользователей мобильных устройств (2019)

In book

Analysis of Images, Social Networks and Texts. 8th International Conference, AIST 2019, Lecture Notes in Computer Science, Revised Selected Papers

Vol. 11832. , Cham: Springer, 2019.

Pre-trained LLMs Meet Sequential Recommenders: Efficient User-Centric Knowledge Distillation

Severin N., Kartushov D., Urzhumov V. et al., , in: Advances in Information Retrieval: 48th European Conference on Information Retrieval, ECIR 2026, Delft, The Netherlands, March 29 – April 2, 2026, Proceedings, Part II. (LNCS, volume 16484).: Cham: Springer Publishing Company, 2026. P. 508–517.

Sequential recommender systems have achieved significant success in modeling temporal user behavior but remain limited in cap-turing rich user semantics beyond interaction patterns. Large Language Models (LLMs) present opportunities to enhance user understanding with their reasoning capabilities, yet existing integration approaches cre-ate prohibitive inference costs in real time. To address these limitations, we present a ...

Added: June 18, 2026

Обучение распознаванию эмоций посредством мобильного приложения «ТРОПЭМО»

Shadrina E. V., Мохова В. О., Загоскин В. А. et al., Нижегородский психологический альманах 2024 № 2

The article considers the problem of learning of recognizing emotions from pictures. A review and analysis of domestic and foreign works of scientists dealing with the problem of emotional intelligence was carried out. Its formation, influence on human activity and existing variants of its structure were considered, and common features in the understanding of emotional ...

Added: April 9, 2026

Efficient Incorporation of New Interactions in Graph Recommenders via Folding-In

Yusupov V., Sukhorukov N., Frolov E., User Modelling and User-Adapted Interaction 2026 Vol. 36 Article 2

Graph-based recommender systems have emerged as a powerful paradigm for personalized recommendations. However, their reliance on full model retraining to incorporate new users or new interactions creates scalability barriers. The task becomes infeasible in real-life recommender systems due to excessive time and resource costs involved. To address this limitation, we propose a fast and efficient ...

Added: March 15, 2026

An Analysis of Sequential Patterns in Datasets for Evaluation of Sequential Recommendations

Klenitskiy A., Anna Volodkevich, Pembek A. et al., ACM Transactions on Recommender Systems 2026

Sequential recommender systems are an important and in-demand area of research. These systems aim to use the order of interactions in a user’s history to predict future interactions. The premise is that the order of interactions and sequential patterns play an essential role. Therefore, it is crucial to use datasets that exhibit a sequential structure ...

Added: January 28, 2026

Autoregressive generation strategies for Top-K sequential recommendations

Anna Volodkevich, Danil Gusak, Klenitskiy A. et al., User Modelling and User-Adapted Interaction 2025 No. 35 Article 13

The goal of modern sequential recommender systems is often formulated in terms of next-item prediction. In this paper, we explore the applicability of transformer-based generative models for the Top-K sequential recommendation task, where the goal is to predict items that a user is likely to interact with in the “near future.” This goal aligns with ...

Added: January 26, 2026

Encode Me If You Can: Learning Universal User Representations via Event Sequence Autoencoding

Klenitskiy A., Fatkulin A., Denisova D. et al., , in: RecSysChallenge '25: Proceedings of the Recommender Systems Challenge 2025.: Association for Computing Machinery (ACM), 2025. P. 26–30.

Building universal user representations that capture the essential aspects of user behavior is a crucial task for modern machine learning systems. In real-world applications, a user’s historical interactions often serve as the foundation for solving a wide range of predictive tasks, such as churn prediction, recommendations, or lifetime value estimation. Using a task-independent user representation ...

Added: January 26, 2026

Benefiting from Negative yet Informative Feedback by Contrasting Opposing Sequential Patterns

Ivanova V., Frolov E., Vasilev A., , in: RecSys '25: Proceedings of the Nineteenth ACM Conference on Recommender Systems.: ACM, 2025. P. 1142–1147.

We consider the task of learning from both positive and negative feedback in a sequential recommendation scenario, as both types of feedback are often present in user interactions. Meanwhile, conventional sequential learning models usually focus on considering and predicting positive interactions, ignoring that reducing items with negative feedback in recommendations improves user satisfaction with the ...

Added: January 26, 2026

Let It Go? Not Quite: Addressing Item Cold Start in Sequential Recommendations with Content-Based Initialization

Pembek A., Fatkulin A., Klenitskiy A. et al., , in: RecSys '25: Proceedings of the Nineteenth ACM Conference on Recommender Systems.: ACM, 2025. P. 626–631.

Many sequential recommender systems suffer from the cold start problem, where items with few or no interactions cannot be effectively used by the model due to the absence of a trained embedding. Content-based approaches, which leverage item metadata, are commonly used in such scenarios. One possible way is to use embeddings derived from content features ...

Added: January 26, 2026

Метод улучшения обнаружения атак презентации на биометрическую систему распознавания лиц с помощью сверточной сети с механизмом внимания

Pikul A. S., В кн.: Альманах научных работ молодых ученых университета ИТМО. Материалы Пятьдесят третьей (LIII) научной и учебно-методической конференции Том 1.: СПб.: Университет ИТМО, 2024. С. 338–342.

Предложен новый подход для улучшения распознавания атак презентации на биометрическую систему распознавания лиц с помощью сверточной сети с механизмом внимания. Проверена центральная гипотеза, которая заключалась в том, что с помощью механизма внимания возможно улучшить результаты работы исходной сверточной нейронной сети. В ходе экспериментов гипотеза была подтверждена. Наибольший прирост по качеству был достигнут на наборе данных ...

Added: December 13, 2025

Глубокая нейронная сеть с графовым вниманием для выявления поддельных изображений лица

Pikul A. S., Лепендин А. А., Труды молодых ученых Алтайского государственного университета 2023 № 20 С. 190–193

Представлен новый подход для выявления атак презентации на системы распознавания по лицу. Он основан на использовании механизма графового внимания, применяемого к промежуточным картам характеристик изображений лица, вычисленным сверточной сетью ResNet18. Показано, что предложенный подход позволил добиться высокого качества распознавания поддельных изображений при лицевой биометрической верификации, сравнимого с имеющимися в настоящее время альтернативными решениями. ...

Added: December 12, 2025

Ансамбль современных моделей компьютерного зрения для задачи обнаружения дипфейков

Pikul A. S., Безопасность информационных технологий 2024 Т. 31 № 4 С. 116–127

This article explores the potential use of modern computer vision architectures for the task of deepfake detection. The following architectures are considered: EfficientNet, Vision Transformer (ViT), VisionLSTM (ViL), Vision KAN, and Mamba Vision. The novelty of the approach lies in the application and comparison of these architectures, as well as their combination into paired ensembles ...

Added: December 12, 2025

Scaling Recommender Transformers to One Billion Parameters

Khrylchenko K., , in: 32nd SIGKDD Conference on Knowledge Discovery and Data MiningVol. 1.: Association for Computing Machinery (ACM), 2026. P. 1–10.

While large transformer models have been successfully used in many real-world applications such as natural language processing, computer vision, and speech processing, scaling transformers for recommender systems remains a challenging problem. Recently, Generative Recommenders framework was proposed to scale beyond typical Deep Learning Recommendation Models (DLRMs). Reformulation of recommendation as sequential transduction task led to ...

Added: November 25, 2025

32nd SIGKDD Conference on Knowledge Discovery and Data Mining

Association for Computing Machinery (ACM), 2026.

KDD is the premier Data Science and AI conference, hosting both a Research and an Applied Data Science Track. The conference will take place from August 9 to 13, 2026, in Jeju, Korea. ...

Added: November 25, 2025

Blending Sequential Embeddings, Graphs, and Engineered Features: 4th Place Solution in RecSys Challenge 2025

Makeev S., Andreev A., Baikalov V. et al., , in: RecSysChallenge '25: Proceedings of the Recommender Systems Challenge 2025.: Association for Computing Machinery (ACM), 2025. P. 21–25.

This paper describes the 4th-place solution by team ambitious for the RecSys Challenge 2025, organized by Synerise and ACM RecSys, which focused on universal behavioral modeling. The challenge objective was to generate user embeddings effective across six diverse downstream tasks. Our solution integrates (1) a sequential encoder to capture the temporal evolution of user interests, (2) a ...

Added: November 19, 2025

RecSysChallenge '25: Proceedings of the Recommender Systems Challenge 2025

Association for Computing Machinery (ACM), 2025.

Added: November 19, 2025

Correcting the LogQ Correction: Revisiting Sampled Softmax for Large-Scale Retrieval

Khrylchenko K., Baikalov V., Makeev S. et al., , in: RecSys '25: Proceedings of the Nineteenth ACM Conference on Recommender Systems.: ACM, 2025. P. 545–550.

Added: November 19, 2025

Ultra Fast Warm Start Solution for Graph Recommendations

Yusupov V., Rakhuba M., Frolov E., , in: CIKM '25: Proceedings of the 34rd ACM International Conference on Information and Knowledge Management.: ACM, 2025. Ch. 1 P. 5469–5473.

In this work, we present a fast and effective Linear approach for updating recommendations in a scalable graph-based recommender system UltraGCN. Solving this task is extremely important to maintain the relevance of the recommendations under the conditions of a large amount of new data and changing user preferences. To address this issue, we adapt the ...

Added: October 3, 2025

Leveraging Geometric Insights in Hyperbolic Triplet Loss for Improved Recommendations

Yusupov V., Rakhuba M., Frolov E., , in: RecSys '25: Proceedings of the Nineteenth ACM Conference on Recommender Systems.: ACM, 2025. Ch. 1 P. 1217–1221.

Recent studies have demonstrated the potential of hyperbolic geometry for capturing complex patterns from interaction data in recommender systems. In this work, we introduce a novel hyperbolic recommendation model that uses geometrical insights to improve representation learning and increase computational stability at the same time. We reformulate the notion of hyperbolic distances to unlock additional ...

Added: October 3, 2025