?
Detection and Recognition of Food in Photo Galleries for Analysis of User Preferences
Ch. 9. P. 83–94.
Miasnikov E., Savchenko A.
Food analysis is one of the most important parts of user preference prediction engines for recommendation systems in the travel domain. In this paper, we describe and study the neural network method that allows you to recognize food in a gallery of photos taken with mobile devices. The described method consists of three main stages, including the classification of scenes, food detection, and subsequent classification. An essential feature of the developed method is the use of lightweight neural network models, which allows its usage on mobile devices. The development of the method was carried out using both known open data and a proprietary data set.
In book
Vol. 12131. , Cham: Springer, 2020.
Shadrina E. V., Мохова В. О., Загоскин В. А. et al., Нижегородский психологический альманах 2024 № 2
The article considers the problem of learning of recognizing emotions from pictures. A review and analysis of domestic and foreign works of scientists dealing with the problem of emotional intelligence was carried out. Its formation, influence on human activity and existing variants of its structure were considered, and common features in the understanding of emotional ...
Added: April 9, 2026
Pikul A. S., В кн.: Альманах научных работ молодых ученых университета ИТМО. Материалы Пятьдесят третьей (LIII) научной и учебно-методической конференции Том 1.: СПб.: Университет ИТМО, 2024. С. 338–342.
Предложен новый подход для улучшения распознавания атак презентации на биометрическую систему распознавания лиц с помощью сверточной сети с механизмом внимания. Проверена центральная гипотеза, которая заключалась в том, что с помощью механизма внимания возможно улучшить результаты работы исходной сверточной нейронной сети. В ходе экспериментов гипотеза была подтверждена. Наибольший прирост по качеству был достигнут на наборе данных ...
Added: December 13, 2025
Pikul A. S., Лепендин А. А., Труды молодых ученых Алтайского государственного университета 2023 № 20 С. 190–193
Представлен новый подход для выявления атак презентации на системы распознавания по лицу. Он основан на использовании механизма графового внимания, применяемого к промежуточным картам характеристик изображений лица, вычисленным сверточной сетью ResNet18. Показано, что предложенный подход позволил добиться высокого качества распознавания поддельных изображений при лицевой биометрической верификации, сравнимого с имеющимися в настоящее время альтернативными решениями. ...
Added: December 12, 2025
Pikul A. S., Безопасность информационных технологий 2024 Т. 31 № 4 С. 116–127
This article explores the potential use of modern computer vision architectures for the task of deepfake detection. The following architectures are considered: EfficientNet, Vision Transformer (ViT), VisionLSTM (ViL), Vision KAN, and Mamba Vision. The novelty of the approach lies in the application and comparison of these architectures, as well as their combination into paired ensembles ...
Added: December 12, 2025
Morozov D., Garipov T., Lyashevskaya O. et al., Journal of Language and Education 2024 Vol. 10 No. 4 P. 71–84
Introduction: Numerous algorithms have been proposed for the task of automatic morpheme segmentation of Russian words. Due to the differences in task formulation and datasets utilized, comparing the quality of these algorithms is challenging. It is unclear whether the errors in the models are due to the ineffectiveness of algorithms themselves or to errors and inconsistencies ...
Added: January 7, 2025
Derkacheva A., Frost G., Ermokhina K. et al., , in: 2023 International Conference on Machine Intelligence for GeoAnalytics and Remote Sensing (MIGARS).: IEEE, 2023. P. 1–4.
Many studies using convolutional neural networks, including in the field of satellite images, are aimed at recognizing clearly defined objects, such as cars or individual trees. Here we present the first results on mapping the growth stage of tundra shrubs, which is a ‘‘fuzzy’’ target for a network: there are no obvious geometric boundaries of ...
Added: March 22, 2023
Karpova I. P., Управление большими системами: сборник трудов 2022 № 96 С. 69–117
The paper describes a bioinspired method of mobile robots navigation, similar to the navigation mechanism of social insects. The model species is the red forest ant Formica rufa. The scout red forest ant remembers the route to food and can transmit information about the food location to foraging ants. Foragers can walk to the food ...
Added: December 8, 2022
Карпова И.П., В кн.: Муравьи и защита леса: Материалы XVI Всероссийского мирмекологического симпозиума, Москва, 27–31 августа 2022 года.: Товарищество научных изданий КМК, 2022. С. 217–222.
The paper proposes a method of using visual landmarks for memorizing the traversed path by a mobile robot (animate), based on the navigation mechanism of ants. The model of route presentation and the rules of its interpretation are described. This allows scouting robot to remember and repeat the route, and pass the route description to ...
Added: September 1, 2022
Savchenko A., , in: 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).: IEEE, 2022. P. 2358–2365.
In this paper, we consider the problem of real-time video-based facial emotion analytics, namely, facial expression recognition, prediction of valence and arousal and detection of action unit points. We propose the novel frame-level emotion recognition algorithm by extracting facial features with the single EfficientNet model pre-trained on Affect-Net. The predictions for sequential frames are smoothed ...
Added: August 29, 2022
Makarov I., Bakhanova M., Nikolenko S. et al., PeerJ Computer Science 2022 Vol. 8 Article e865
Depth estimation has been an essential task for many computer vision applications, especially in autonomous driving, where safety is paramount. Depth can be estimated not only with traditional supervised learning but also via a self-supervised approach that relies on camera motion and does not require ground truth depth maps. Recently, major improvements have been introduced ...
Added: February 1, 2022
Krinitskiy M., Alexandrova M., Verezemskaya P. et al., Remote Sensing 2021 Vol. 13 No. 2 Article 326
Total Cloud Cover (TCC) retrieval from ground-based optical imagery is a problem that has been tackled by several generations of researchers. The number of human-designed algorithms for the estimation of TCC grows every year. However, there has been no considerable progress in terms of quality, mostly due to the lack of systematic approach to the ...
Added: September 24, 2021
Karpova I. P., Мехатроника, автоматизация, управление 2021 Т. 22 № 10 С. 537–546
A biologically-inspired approach to robot route following is presented. The ant of the genus Formica rufa (a red forest ant) is used as a model species. These ants actively use collective foraging, unlike many other ant species. The scout ant remembers the route to food and can transmit information about the food location to foraging ...
Added: September 21, 2021
Dmitrii Maslov, Makarov I., , in: Advances in Computational Intelligence: 16th International Work-Conference on Artificial Neural Networks, IWANN 2021, Virtual Event, June 16–18, 2021, Proceedings, Part I* 1. Vol. 12861.: Springer, 2021. Ch. 38 P. 456–467.
In this paper, we study depth reconstruction via RGB-based, Sparse-Depth, and RGBd approaches. We showed that combination of RGB and Sparse Depth approach in RGBd scenario provides the best results. We also proved that the models performance can be further tuned via proper selection of architecture blocks and number of depth points guiding RGB-to-depth reconstruction. ...
Added: September 1, 2021
Kudriavtseva P., Kashkinov M., Kertész-Farkas A., Journal of Proteome Research 2021 Vol. 20 No. 10 P. 4708–4717
Spectrum annotation is a challenging task due to the presence of unexpected peptide fragmentation ions as well as the inaccuracy of the detectors of the spectrometers. We present a deep convolutional neural network, called Slider, which learns an optimal feature extraction in its kernels for scoring mass spectrometry (MS)/MS spectra to increase the number of ...
Added: August 30, 2021
Savchenko A., Demochkin K., Grechikhin I., Pattern Recognition 2022 Vol. 121 Article 108248
In this paper, a user modeling task is examined by processing mobile device gallery of photos and videos. We propose a novel engine for preferences prediction based on scene recognition, object detection and facial analysis. At first, all faces in a gallery are clustered, and all private photos and videos with faces from large clusters ...
Added: August 19, 2021