Detection and Recognition of Food in Photo Galleries for Analysis of User Preferences

Miasnikov E.; A. Savchenko

doi:10.1007/978-3-030-50347-5_9

Publications

?

Detection and Recognition of Food in Photo Galleries for Analysis of User Preferences

Ch. 9. P. 83–94.

Miasnikov E., Savchenko A.

Food analysis is one of the most important parts of user preference prediction engines for recommendation systems in the travel domain. In this paper, we describe and study the neural network method that allows you to recognize food in a gallery of photos taken with mobile devices. The described method consists of three main stages, including the classification of scenes, food detection, and subsequent classification. An essential feature of the developed method is the use of lightweight neural network models, which allows its usage on mobile devices. The development of the method was carried out using both known open data and a proprietary data set.

Keywords: Deep Convolutional Neural Networks сверточные нейронные сети scene recognition распознавание сцен Food recognition распознавание еды на изображениях

In book

Proceedings of International Conference on Image Analysis and Recognition (ICIAR 2020)

Vol. 12131. , Cham: Springer, 2020.

Обучение распознаванию эмоций посредством мобильного приложения «ТРОПЭМО»

Shadrina E. V., Мохова В. О., Загоскин В. А. et al., Нижегородский психологический альманах 2024 № 2

The article considers the problem of learning of recognizing emotions from pictures. A review and analysis of domestic and foreign works of scientists dealing with the problem of emotional intelligence was carried out. Its formation, influence on human activity and existing variants of its structure were considered, and common features in the understanding of emotional ...

Added: April 9, 2026

Метод улучшения обнаружения атак презентации на биометрическую систему распознавания лиц с помощью сверточной сети с механизмом внимания

Pikul A. S., В кн.: Альманах научных работ молодых ученых университета ИТМО. Материалы Пятьдесят третьей (LIII) научной и учебно-методической конференции Том 1.: СПб.: Университет ИТМО, 2024. С. 338–342.

Предложен новый подход для улучшения распознавания атак презентации на биометрическую систему распознавания лиц с помощью сверточной сети с механизмом внимания. Проверена центральная гипотеза, которая заключалась в том, что с помощью механизма внимания возможно улучшить результаты работы исходной сверточной нейронной сети. В ходе экспериментов гипотеза была подтверждена. Наибольший прирост по качеству был достигнут на наборе данных ...

Added: December 13, 2025

Глубокая нейронная сеть с графовым вниманием для выявления поддельных изображений лица

Pikul A. S., Лепендин А. А., Труды молодых ученых Алтайского государственного университета 2023 № 20 С. 190–193

Представлен новый подход для выявления атак презентации на системы распознавания по лицу. Он основан на использовании механизма графового внимания, применяемого к промежуточным картам характеристик изображений лица, вычисленным сверточной сетью ResNet18. Показано, что предложенный подход позволил добиться высокого качества распознавания поддельных изображений при лицевой биометрической верификации, сравнимого с имеющимися в настоящее время альтернативными решениями. ...

Added: December 12, 2025

Ансамбль современных моделей компьютерного зрения для задачи обнаружения дипфейков

Pikul A. S., Безопасность информационных технологий 2024 Т. 31 № 4 С. 116–127

This article explores the potential use of modern computer vision architectures for the task of deepfake detection. The following architectures are considered: EfficientNet, Vision Transformer (ViT), VisionLSTM (ViL), Vision KAN, and Mamba Vision. The novelty of the approach lies in the application and comparison of these architectures, as well as their combination into paired ensembles ...

Added: December 12, 2025

Automatic Morpheme Segmentation for Russian: Can an Algorithm Replace Experts?

Morozov D., Garipov T., Lyashevskaya O. et al., Journal of Language and Education 2024 Vol. 10 No. 4 P. 71–84

Introduction: Numerous algorithms have been proposed for the task of automatic morpheme segmentation of Russian words. Due to the differences in task formulation and datasets utilized, comparing the quality of these algorithms is challenging. It is unclear whether the errors in the models are due to the ineffectiveness of algorithms themselves or to errors and inconsistencies ...

Added: January 7, 2025

Neural networks and satellite images-based shrub tundra landscape study: phenomena with fuzzy geometric and categorical boundaries

Derkacheva A., Frost G., Ermokhina K. et al., , in: 2023 International Conference on Machine Intelligence for GeoAnalytics and Remote Sensing (MIGARS).: IEEE, 2023. P. 1–4.

Many studies using convolutional neural networks, including in the field of satellite images, are aimed at recognizing clearly defined objects, such as cars or individual trees. Here we present the first results on mapping the growth stage of tundra shrubs, which is a ‘‘fuzzy’’ target for a network: there are no obvious geometric boundaries of ...

Added: March 22, 2023

Об одном биоинспирированном подходе к ориентации роботов, или настоящий «муравьиный» алгоритм

Karpova I. P., Управление большими системами: сборник трудов 2022 № 96 С. 69–117

The paper describes a bioinspired method of mobile robots navigation, similar to the navigation mechanism of social insects. The model species is the red forest ant Formica rufa. The scout red forest ant remembers the route to food and can transmit information about the food location to foraging ants. Foragers can walk to the food ...

Added: December 8, 2022

Об одном биоинспирированном подходе к ориентации роботов

Карпова И.П., В кн.: Муравьи и защита леса: Материалы XVI Всероссийского мирмекологического симпозиума, Москва, 27–31 августа 2022 года.: Товарищество научных изданий КМК, 2022. С. 217–222.

The paper proposes a method of using visual landmarks for memorizing the traversed path by a mobile robot (animate), based on the navigation mechanism of ants. The model of route presentation and the rules of its interpretation are described. This allows scouting robot to remember and repeat the route, and pass the route description to ...

Added: September 1, 2022

Video-based Frame-level Facial Analysis of Affective Behavior on Mobile Devices using EfficientNets

Savchenko A., , in: 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).: IEEE, 2022. P. 2358–2365.

In this paper, we consider the problem of real-time video-based facial emotion analytics, namely, facial expression recognition, prediction of valence and arousal and detection of action unit points. We propose the novel frame-level emotion recognition algorithm by extracting facial features with the single EfficientNet model pre-trained on Affect-Net. The predictions for sequential frames are smoothed ...

Added: August 29, 2022

Self-supervised recurrent depth estimation with attention mechanisms

Makarov I., Bakhanova M., Nikolenko S. et al., PeerJ Computer Science 2022 Vol. 8 Article e865

Depth estimation has been an essential task for many computer vision applications, especially in autonomous driving, where safety is paramount. Depth can be estimated not only with traditional supervised learning but also via a self-supervised approach that relies on camera motion and does not require ground truth depth maps. Recently, major improvements have been introduced ...

Added: February 1, 2022

On the generalization ability of data-driven models in the problem of total cloud cover retrieval

Krinitskiy M., Alexandrova M., Verezemskaya P. et al., Remote Sensing 2021 Vol. 13 No. 2 Article 326

Total Cloud Cover (TCC) retrieval from ground-based optical imagery is a problem that has been tackled by several generations of researchers. The number of human-designed algorithms for the estimation of TCC grows every year. However, there has been no considerable progress in terms of quality, mostly due to the lack of systematic approach to the ...

Added: September 24, 2021

Организация маршрута анимата на основе визуальных ориентиров и распознавания сцен

Karpova I. P., Мехатроника, автоматизация, управление 2021 Т. 22 № 10 С. 537–546

A biologically-inspired approach to robot route following is presented. The ant of the genus Formica rufa (a red forest ant) is used as a model species. These ants actively use collective foraging, unlike many other ant species. The scout ant remembers the route to food and can transmit information about the food location to foraging ...

Added: September 21, 2021

Fast Depth Reconstruction Using Deep Convolutional Neural Networks

Dmitrii Maslov, Makarov I., , in: Advances in Computational Intelligence: 16th International Work-Conference on Artificial Neural Networks, IWANN 2021, Virtual Event, June 16–18, 2021, Proceedings, Part I* 1. Vol. 12861.: Springer, 2021. Ch. 38 P. 456–467.

In this paper, we study depth reconstruction via RGB-based, Sparse-Depth, and RGBd approaches. We showed that combination of RGB and Sparse Depth approach in RGBd scenario provides the best results. We also proved that the models performance can be further tuned via proper selection of architecture blocks and number of depth points guiding RGB-to-depth reconstruction. ...

Added: September 1, 2021

Deep Convolutional Neural Networks Help Scoring Tandem Mass Spectrometry Data in Database-Searching Approaches

Kudriavtseva P., Kashkinov M., Kertész-Farkas A., Journal of Proteome Research 2021 Vol. 20 No. 10 P. 4708–4717

Spectrum annotation is a challenging task due to the presence of unexpected peptide fragmentation ions as well as the inaccuracy of the detectors of the spectrometers. We present a deep convolutional neural network, called Slider, which learns an optimal feature extraction in its kernels for scoring mass spectrometry (MS)/MS spectra to increase the number of ...

Added: August 30, 2021

Preference prediction based on a photo gallery analysis with scene recognition and object detection

Savchenko A., Demochkin K., Grechikhin I., Pattern Recognition 2022 Vol. 121 Article 108248

In this paper, a user modeling task is examined by processing mobile device gallery of photos and videos. We propose a novel engine for preferences prediction based on scene recognition, object detection and facial analysis. At first, all faces in a gallery are clustered, and all private photos and videos with faces from large clusters ...

Added: August 19, 2021