Распознавание пола и возраста по видеоизображению лица на основе сверточных нейронных сетей

А. С. Харчевникова; А. В. Савченко

Publications

?

Распознавание пола и возраста по видеоизображению лица на основе сверточных нейронных сетей

С. 864–869.

Kharchevnikova A., Savchenko A.

Language: Russian

Full text

Keywords: коллективы решающих правил committee machine Deep Convolutional Neural Networks age and gender recognition сверточные нейронные сети распознавание пола и возраста

Publication based on the results of:

Разработка и апробация эффективных методов классификации для больших баз мультимедийных данных (2017)

In book

Материалы XXIII международной научно-технической конференции «Информационные системы и технологии-2017»

[б.и.], 2017.

Efficient facial representations for age, gender and identity recognition in organizing photo albums using multi-output ConvNet

Savchenko A., PeerJ Computer Science 2019 Vol. 5:e197 P. 1–26

This paper is focused on the automatic extraction of persons and their attributes (gender, year of born) from album of photos and videos. A two-stage approach is proposed in which, firstly, the convolutional neural network simultaneously predicts age/gender from all photos and additionally extracts facial representations suitable for face identification. Here the MobileNet is modified ...

Added: June 12, 2019

The Video-Based Age and Gender Recognition with Convolution Neural Networks

Savchenko A., Kharchevnikova Angelina S., , in: Computational Aspects and Applications in Large-Scale Networks. Springer Proceedings in Mathematics & StatisticsVol. 247.: Springer, 2018. P. 37–46.

The paper reviews the problem of age and gender recognition methods for video data using modern deep convolutional neural networks. We present the comparative analysis of classifier fusion algorithms to aggregate decisions for individual frames. We implemented the video-based recognition system with several aggregation methods to improve the age and gender identification accuracy. The experimental ...

Added: September 2, 2018

Сверточные нейронные сети в задаче распознавания пола и возраста по видеоизображению

Kharchevnikova A., Savchenko A., В кн.: Сборник трудов IV Международной конференции и молодёжной школы "Информационные технологии и нанотехнологии" (ИТНТ 2018).: Самара: Предприятие "Новая техника", 2018. Гл. 124 С. 916–924.

In this paper we examine the age and gender video-based recognition problem using deep convolutional neural networks. The comparative analysis of classifier fusion algorithms to aggregate decisions for individual frames is presented. In order to improve the age and gender identification accuracy we implement the video-based recognition system with several aggregation methods. We provide the ...

Added: October 18, 2018

Event Recognition Based on Classification of Generated Image Captions

Savchenko A., Miasnikov E., , in: Advances in Intelligent Data Analysis XVIII (IDA 2020)Vol. 12080.: Cham: Springer, 2020. Ch. 33 P. 418–430.

In this paper, we consider the problem of event recognition on single images. In contrast to conventional fine-tuning of convolutional neural networks (CNN), we proposed to use image captioning, i.e., a generative model that converts images to textual descriptions. The motivation here is the possibility to combine conventional CNNs with a completely different approach in ...

Added: May 17, 2020

Neural networks in video-based age and gender recognition on mobile platforms

A.S. Kharchevnikova, Savchenko A., Optical Memory and Neural Networks (Information Optics) 2018 Vol. 27 No. 4 P. 246–259

The paper considers the use of convolutional neural networks for the concurrent recognition of the gender and age of a person by video records of his face. The emphasis is on the incorporation of the approach into mobile video-recording software. We have investigated the fusion of decisions obtained during the processing of each video frame, ...

Added: November 5, 2018

Detection and Recognition of Food in Photo Galleries for Analysis of User Preferences

Miasnikov E., Savchenko A., , in: Proceedings of International Conference on Image Analysis and Recognition (ICIAR 2020)Vol. 12131.: Cham: Springer, 2020. Ch. 9 P. 83–94.

Food analysis is one of the most important parts of user preference prediction engines for recommendation systems in the travel domain. In this paper, we describe and study the neural network method that allows you to recognize food in a gallery of photos taken with mobile devices. The described method consists of three main stages, ...

Added: October 1, 2020

Извлечение предпочтений пользователя на основе методов автоматического порождения текстовых описаний изображений фотоальбома

Kharchevnikova A., Savchenko A., Компьютерная оптика 2020 Т. 44 № 4 С. 618–626

В работе рассматривается задача извлечения предпочтений пользователя по его фотоальбому. Предложен новый подход на основе автоматического порождения текстовых описаний фотографий и последующей классификации таких описаний. Проведен анализ известных методов создания аннотаций по изображению на основе свёрточных и рекуррентных (Long short-term memory) нейронных сетей. С использованием набора данных Google’s Conceptual Captions обучены новые модели, в которых ...

Added: September 16, 2020

Multi-label Image Set Recognition in Visually-Aware Recommender Systems

Demochkin K., Savchenko A., , in: Analysis of Images, Social Networks and Texts. 8th International Conference, AIST 2019, Lecture Notes in Computer Science, Revised Selected PapersVol. 11832.: Cham: Springer, 2019. Ch. 26 P. 291–297.

In this paper we focus on the problem of multi-label image recognition for visually-aware recommender systems. We propose a two stage approach in which a deep convolutional neural network is firstly fine-tuned on a part of the training set. Secondly, an attention-based aggregation network is trained to compute the weighted average of visual features in ...

Added: December 22, 2019

Event Recognition with Automatic Album Detection based on Sequential Grouping of Confidence Scores and Neural Attention

Savchenko A., , in: Proceedings of International Joint Conference on Neural Networks 2020 (IJCNN 2020).: Piscataway: IEEE, 2020. P. 1–8.

In this paper a new formulation of event recognition task is examined: it is required to predict event categories given a gallery of images, for which albums (groups of photos corresponding to a single event) are unknown. The novel two-stage approach is proposed. At first, features are extracted in each photo using the pre-trained convolutional ...

Added: October 15, 2020

Cluster Analysis of Facial Video Data in Video Surveillance Systems Using Deep Learning

Savchenko A., Sokolova Anastasiia D., , in: Computational Aspects and Applications in Large-Scale Networks. Springer Proceedings in Mathematics & StatisticsVol. 247.: Springer, 2018. P. 113–120.

In this paper, we propose the approach of structuring information in video surveillance systems by grouping the videos, which contain identical faces. First, the faces are detected in each frame and features of each facial region are extracted at the output of preliminarily trained deep convolution neural networks. Second, the tracks that contain identical faces ...

Added: September 2, 2018

A New Sport Teams Logo Dataset for Detection Tasks

Kuznetsov A., Savchenko A., , in: Proceedings of the International Conference on Computer Vision and Graphics (ICCVG 2020)Vol. 12334.: Cham: Springer, 2020. Ch. 8 P. 87–97.

In this research we introduce a new labelled SportLogo dataset, that contains images of two kinds of sports: hockey (NHL) and basketball (NBA). This dataset presents several challenges typical for logo detection tasks. A huge number of occlusions and logo view changes during playing games lead to an ambiguity of a straightforward detection approach use. ...

Added: October 1, 2020

Organizing Multimedia Data in Video Surveillance Systems Based on Face Verification with Convolutional Neural Networks

Sokolova Anastasiia, Kharchevnikova Angelina, Savchenko A., Lecture Notes in Computer Science 2018 Vol. 10716 P. 223–230

In this paper we propose the two-stage approach of organizing information in video surveillance systems. At first, the faces are detected in each frame and a video stream is split into sequences of frames with face region of one person. Secondly, these sequences (tracks) that contain identical faces are grouped using face verification algorithms and ...

Added: October 24, 2017

Organizing Multimedia Data in Video Surveillance Systems Based on Face Verification with Convolutional Neural Networks

Anastasiia D. Sokolova, Angelina S. Kharchevnikova, Savchenko A., , in: Analysis of Images, Social Networks and Texts. 6th International Conference, 2017, Revised Selected PapersVol. 10716.: Cham: Springer, 2018. P. 223–230.

Added: May 2, 2018

Кластеризация видеопоследовательностей в системах видеонаблюдения на основе сверточных нейронных сетей

Соколова А. Д., Savchenko A., В кн.: Материалы XXIII международной научно-технической конференции «Информационные системы и технологии-2017».: [б.и.], 2017. С. 870–875.

Рассматривается задача структурирования информации в программных системах видеонаблюдения с помощью группирования видеоданных, в которых присутствуют идентичные лица. Сделан акцент на эффективную кластеризацию видеопоследовательностей с использованием сверточных нейронных сетей для извлечения характерных признаков. Разработан новый алгоритм кластеризации фрагментов видео на основе технологий глубокого обучения и статистического подхода. Приведены предварительные результаты экспериментального исследования точности и быстродействия предложенного ...

Added: October 24, 2017

Russian Q&A Method Study: From Naive Bayes to Convolutional Neural Networks

Nikolaev K., Malafeev A., , in: Analysis of Images, Social Networks and Texts. 7th International Conference AIST 2018.: Springer, 2018. Ch. 12 P. 121–126.

This paper deals with automatic classification of questions in the Russian language. In contrast to previously used methods, we introduce a convolutional neural network for question classification. We took advantage of an existing corpus of 2008 questions, manually annotated in accordance with a pragmatic 14-class typology. We modified the data by reducing the typology to ...

Added: February 15, 2019

Deep Convolutional Neural Networks Help Scoring Tandem Mass Spectrometry Data in Database-Searching Approaches

Kudriavtseva P., Kashkinov M., Kertész-Farkas A., Journal of Proteome Research 2021 Vol. 20 No. 10 P. 4708–4717

Spectrum annotation is a challenging task due to the presence of unexpected peptide fragmentation ions as well as the inaccuracy of the detectors of the spectrometers. We present a deep convolutional neural network, called Slider, which learns an optimal feature extraction in its kernels for scoring mass spectrometry (MS)/MS spectra to increase the number of ...

Added: August 30, 2021

On the generalization ability of data-driven models in the problem of total cloud cover retrieval

Krinitskiy M., Alexandrova M., Verezemskaya P. et al., Remote Sensing 2021 Vol. 13 No. 2 Article 326

Total Cloud Cover (TCC) retrieval from ground-based optical imagery is a problem that has been tackled by several generations of researchers. The number of human-designed algorithms for the estimation of TCC grows every year. However, there has been no considerable progress in terms of quality, mostly due to the lack of systematic approach to the ...

Added: September 24, 2021

Gender and Tourism: Challenges and Entrepreneurial Opportunities

Bingley: Emerald Publishing Limited, 2021.

Gender and Tourism: Challenges and Entrepreneurial Opportunities adopts a multi-disciplinary approach, building on a historically informed, future-focused research agenda that accounts for the needs and concerns of contemporary policy makers and practitioners in the tourism field. The collection is structured in two parts, with the first part collecting chapters that analyze the key factors of female entrepreneurship ...

Added: September 21, 2021

Упорядочивание данных в системах видеонаблюдения на основе технологий глубокого обучения

Соколова А. Д., Savchenko A., В кн.: Сборник трудов IV Международной конференции и молодёжной школы "Информационные технологии и нанотехнологии" (ИТНТ 2018).: Самара: Предприятие "Новая техника", 2018. Гл. 128 С. 946–952.

The task of organizing information in video surveillance systems is implemented by grouping the video tracks, which contain identical faces. We examine aggregation methods for the features of individual frames extracted using deep convolutional neural networks. The tracks with identical faces are grouped based on known face verification algorithms and clustering methods. Experimental study on ...

Added: October 18, 2018

Preference prediction based on a photo gallery analysis with scene recognition and object detection

Savchenko A., Demochkin K., Grechikhin I., Pattern Recognition 2022 Vol. 121 Article 108248

In this paper, a user modeling task is examined by processing mobile device gallery of photos and videos. We propose a novel engine for preferences prediction based on scene recognition, object detection and facial analysis. At first, all faces in a gallery are clustered, and all private photos and videos with faces from large clusters ...

Added: August 19, 2021

Deep convolutional neural networks capabilities for binary classification of polar mesocyclones in satellite mosaics

Криницкий М. А., Verezemskaya P., Гращенков К. В. et al., Atmosphere 2018 Vol. 9 No. 426 P. 1–23

Polar mesocyclones (MCs) are small marine atmospheric vortices. The class of intense MCs, called polar lows, are accompanied by extremely strong surface winds and heat fluxes and thus largely influencing deep ocean water formation in the polar regions. Accurate detection of polar mesocyclones in high-resolution satellite data, while challenging, is a time-consuming task, when performed ...

Added: November 26, 2020

Sequential Analysis with Specified Confidence Level and Adaptive Convolutional Neural Networks in Image Recognition

Savchenko A., , in: Proceedings of International Joint Conference on Neural Networks 2020 (IJCNN 2020).: Piscataway: IEEE, 2020. P. 1–8.

In this paper the problem of high computational complexity of deep convolutional nets in image recognition is considered. An existing framework of adaptive neural networks is extended by appending the separate classifier to intermediate layers. The hierarchical representations of the input image are sequentially analyzed. If the first classifier returns rather high confidence score, the ...

Added: October 15, 2020

Video-based Frame-level Facial Analysis of Affective Behavior on Mobile Devices using EfficientNets

Savchenko A., , in: 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).: IEEE, 2022. P. 2358–2365.

In this paper, we consider the problem of real-time video-based facial emotion analytics, namely, facial expression recognition, prediction of valence and arousal and detection of action unit points. We propose the novel frame-level emotion recognition algorithm by extracting facial features with the single EfficientNet model pre-trained on Affect-Net. The predictions for sequential frames are smoothed ...

Added: August 29, 2022

Система постановки произношения на основе сверточных нейронных сетей и информационной теории восприятия речи

Savchenko L., Информационные технологии 2019 Т. 25 № 5 С. 313–318

We consider a problem of computer assisted language and pronunciation learning based on the deep learning methods and the information theory of speech perception. In order to improve the efficiency of testing of pronunciation quality, we propose to train a convolutional neural network using the best reference utterances from the user. The experimental results proved ...

Added: May 29, 2019