Сверточные нейронные сети в задаче распознавания пола и возраста по видеоизображению

А. С. Харчевникова; А. В. Савченко

?

Сверточные нейронные сети в задаче распознавания пола и возраста по видеоизображению

Гл. 124. С. 916–924.

Kharchevnikova A., Savchenko A.

In this paper we examine the age and gender video-based recognition problem using deep convolutional neural networks. The comparative analysis of classifier fusion algorithms to aggregate decisions for individual frames is presented. In order to improve the age and gender identification accuracy we implement the video-based recognition system with several aggregation methods. We provide the experimental comparison for IJB-A, Indian Movies and Kinect datasets. It is demonstrated that the most accurate decisions are obtained using the geometric mean and mathematical expectation of the outputs at softmax layers of the convolutional neural networks for gender recognition and age prediction, respectively.

Language: Russian

Full text

Text on another site

Keywords: коллектив решающих правил deep learning convolutional neural networks age and gender recognition сверточные нейронные сети распознавание пола и возраста глубокое обучение classifier fusion

Publication based on the results of:

Разработка и апробация эффективных методов классификации для больших баз мультимедийных данных (2017)

In book

Сборник трудов IV Международной конференции и молодёжной школы "Информационные технологии и нанотехнологии" (ИТНТ 2018)

Самара: Предприятие "Новая техника", 2018.

Neural networks in video-based age and gender recognition on mobile platforms

A.S. Kharchevnikova, Savchenko A., Optical Memory and Neural Networks (Information Optics) 2018 Vol. 27 No. 4 P. 246–259

The paper considers the use of convolutional neural networks for the concurrent recognition of the gender and age of a person by video records of his face. The emphasis is on the incorporation of the approach into mobile video-recording software. We have investigated the fusion of decisions obtained during the processing of each video frame, ...

Added: November 5, 2018

Распознавание изолированных слов на основе взвешенного голосования дикторозависимых нейросетевых моделей

Savchenko L., Информационные технологии 2020 Т. 26 № 5 С. 290–296

article deals with the problem of isolated words recognition based on deep convolutional neural networks. The use of existing recognition systems in practice is limited by an insufficiently high degree of their reliability functioning in conditions of intense acoustic noise, such as street noise, sounds from passing vehicles, etc. Nowadays, the most accurate recognition methods are characterized by ...

Added: September 2, 2020

The Video-Based Age and Gender Recognition with Convolution Neural Networks

Savchenko A., Kharchevnikova Angelina S., , in: Computational Aspects and Applications in Large-Scale Networks. Springer Proceedings in Mathematics & StatisticsVol. 247. Springer, 2018. P. 37–46.

The paper reviews the problem of age and gender recognition methods for video data using modern deep convolutional neural networks. We present the comparative analysis of classifier fusion algorithms to aggregate decisions for individual frames. We implemented the video-based recognition system with several aggregation methods to improve the age and gender identification accuracy. The experimental ...

Added: September 2, 2018

Система постановки произношения на основе сверточных нейронных сетей и информационной теории восприятия речи

Savchenko L., Информационные технологии 2019 Т. 25 № 5 С. 313–318

We consider a problem of computer assisted language and pronunciation learning based on the deep learning methods and the information theory of speech perception. In order to improve the efficiency of testing of pronunciation quality, we propose to train a convolutional neural network using the best reference utterances from the user. The experimental results proved ...

Added: May 29, 2019

Deep convolutional neural networks capabilities for binary classification of polar mesocyclones in satellite mosaics

Криницкий М. А., Verezemskaya P., Гращенков К. В. et al., Atmosphere 2018 Vol. 9 No. 426 P. 1–23

Polar mesocyclones (MCs) are small marine atmospheric vortices. The class of intense MCs, called polar lows, are accompanied by extremely strong surface winds and heat fluxes and thus largely influencing deep ocean water formation in the polar regions. Accurate detection of polar mesocyclones in high-resolution satellite data, while challenging, is a time-consuming task, when performed ...

Added: November 26, 2020

A Deep Learning Method Study of User Interest Classification

Malafeev A., Nikolaev K., , in: Analysis of Images, Social Networks and Texts. 8th International Conference, AIST 2019, Kazan, Russia, July 17–19, 2019, Revised Selected Papers. Communications in Computer and Information ScienceVol. 1086. Springer, 2020. P. 154–159.

In this paper, a deep learning method study is conducted to solve a new multiclass text classification problem, identifying user interests by text messages. We used an original dataset of almost 90 thousand forum text messages, labeled for ten interests. We experimented with different modern neural network architectures: recurrent and convolutional, as well as simpler ...

Added: November 7, 2019

Efficient facial representations for age, gender and identity recognition in organizing photo albums using multi-output ConvNet

Savchenko A., PeerJ Computer Science 2019 Vol. 5:e197 P. 1–26

This paper is focused on the automatic extraction of persons and their attributes (gender, year of born) from album of photos and videos. A two-stage approach is proposed in which, firstly, the convolutional neural network simultaneously predicts age/gender from all photos and additionally extracts facial representations suitable for face identification. Here the MobileNet is modified ...

Added: June 12, 2019

Распознавание пола и возраста по видеоизображению лица на основе сверточных нейронных сетей

Kharchevnikova A., Savchenko A., В кн.: Материалы XXIII международной научно-технической конференции «Информационные системы и технологии-2017». [б.и.], 2017. С. 864–869.

Рассматривается задача построения интеллектуальных систем контекстной рекламы с автоматической настройкой на потенциальные предпочтения пользователя. Выполнен аналитический обзор современных публикаций, посвященных распознаванию пола и возраста по видеоизображению лица, в том числе на основе глубоких сверточных нейронных сетей. Проведен сравнительный анализ способов агрегации решений, полученных при распознавании каждого видеокадра. Приведены результаты экспериментального исследования их точности и быстродействия. ...

Added: October 24, 2017

Russian Q&A Method Study: From Naive Bayes to Convolutional Neural Networks

Nikolaev K., Malafeev A., , in: Analysis of Images, Social Networks and Texts. 7th International Conference AIST 2018. Springer, 2018. Ch. 12 P. 121–126.

This paper deals with automatic classification of questions in the Russian language. In contrast to previously used methods, we introduce a convolutional neural network for question classification. We took advantage of an existing corpus of 2008 questions, manually annotated in accordance with a pragmatic 14-class typology. We modified the data by reducing the typology to ...

Added: February 15, 2019

Deep learning approach for predicting functional Z-DNA regions using omics data

Beknazarov N., Jin S., Poptsova M., Scientific Reports 2020 Vol. 10 P. 19134

Computational methods to predict Z-DNA regions are in high demand to understand the functional role of Z-DNA. The previous state-of-the-art method Z-Hunt is based on statistical mechanical and energy considerations about B- to Z-DNA transition using sequence information. Z-DNA CHiP-seq experiment results showed little overlap with Z-Hunt predictions implying that sequence information only is not ...

Added: December 11, 2020

Deep Machine Learning Investigation of Phase Transitions

Chertenkov V., Burovskiy E., Shchur L., , in: Supercomputing: 8th Russian Supercomputing Days, RuSCDays 2022, Moscow, Russia, September 26–27, 2022, Revised Selected PapersVol. 13708. Springer, 2022. P. 397–408.

We explore the possibilities of using neural networks to study phase transitions. The main question is the level of accuracy which can be achieved for the estimates of the critical point and critical exponents of statistical physics models. We generate data for two spin models in two dimensions for which analytical solutions exist, the Ising ...

Added: March 31, 2023

Landscape Patterns of Shrubification in Low Arctic Landscapes: A Machine Learning Perspective

Derkacheva A., Frost G., Epstein H. et al., Journal of Ecology 2024

Tundra shrub expansion is one of the primary vegetation changes being observed in Arctic ecosystems, but the pace of shrubification is highly variable across multiple spatial scales, complicating efforts to understand its drivers and consequences. Here we apply Convolutional Neural Networks (CNNs) to very-high resolution (VHR) commercial satellite imagery acquired 10–15 years apart to identify ...

Added: November 30, 2023

Может ли искусственный интеллект прогнозировать решения суда? Систематический обзор международных исследований

Kazun A., Мониторинг общественного мнения: Экономические и социальные перемены 2024 № 5 С. 100–122

Advancements in artificial intelligence technologies and the emergence of open databases containing judicial decisions have led to rapid improvements in algorithms capable of classifying legal documents and forecasting decisions made by judges. This article examines a body of international research dedicated to the question of how accurately AI can predict judges’ decisions, and consequently, whether ...

Added: November 29, 2024

Deep neural networks and maximum likelihood search for approximate nearest neighbor in video-based image recognition

Savchenko A., Optical Memory and Neural Networks (Information Optics) 2017 Vol. 26 No. 2 P. 129–136

We analyzed the way to increase computational efficiency of video-based image recognition methods with matching of high dimensional feature vectors extracted by deep convolutional neural networks. We proposed an algorithm for approximate nearest neighbor search. At the first step, for a given video frame the algorithm verifies a reference image obtained when recognizing the previous ...

Added: June 30, 2017

Methods of obtaining geospatial data using satellite communications and their processing using convolutional neural networks

Tsvetkovskaya I. I., Tekutieva N. V., Prokofyeva E. N. et al., , in: 2020 Moscow Workshop on Electronic and Networking Technologies (MWENT). IEEE, 2020. P. 1–5.

The availability of high-resolution satellite images obtained through space radio communications offers the opportunity to use the most advanced technologies and techniques for analyzing remote sensing data. The paper discusses the data obtained with the use of ground-based, airborne or space-based filming equipment, which makes it possible to obtain images in one or several sections ...

Added: June 23, 2020

Детектирование эмоций в мультимедиа контенте

А. С. Попова, А. Г. Рассадин, А. А. Пономаренко, В кн.: Материалы XXIII международной научно-технической конференции «Информационные системы и технологии-2017». [б.и.], 2017. С. 852–857.

In this paper we consider the automatic emotions recognition problem, especially the case of digital audio signal processing. We consider and verify an approach in which the classification of a sound fragment is reduced to the problem of image recognition. The waveform and spectrogram are used as a visual representation of the image. The computational ...

Added: October 18, 2017

Детектирование эмоций в речи с использованием долгой краткосрочной памяти

Попова А. С., Рассадин А. Г., Пономаренко А. А., В кн.: Материалы XXIV международной научно-технической конференции «Информационные системы и технологии-2018. [б.и.], 2018. С. 1083–1089.

Рассматривается задача автоматической классификации эмоций в цифровом аудио сигнале. В работе рассматривается и верифицируется подход, в котором классификация звукового фрагмента производится с помощью рекуррентной нейронной сети c долговременно-кратковременной памятью. В качестве признаков использовались мел-кепстральные коэффициенты. Произведен численный эксперимент на открытом наборе данных Ravdess, включающий 8 различных эмоций: “нейтральный”, “спокойный”, “счастливый”, “грустный”, “злой”, “испуганный”, “отвращение”, “удивление” ...

Added: October 21, 2018

Deep learning based methods for estimating distribution of coalescence rates from genome-wide data

Khomutov E., Arzymatov K., Shchur V., Journal of Physics: Conference Series 2021 Vol. 1740 Article 012031

Demographic and population structure inference is one of the most important problems in genomics. Population parameters such as effective population sizes, population split times and migration rates are of high interest both themselves and for many applications, e.g. for genome-wide association studies. Hidden Markov Model (HMM) based methods, such as PSMC, MSMC, coalHMM etc., proved ...

Added: May 17, 2021

Star-Shaped Denoising Diffusion Probabilistic Models

Andrey Okhotin, Dmitry Molchanov, Arkhipkin V. et al., , in: Advances in Neural Information Processing Systems 36 (NeurIPS 2023). Curran Associates, Inc., 2023. P. 10038–10067.

Added: February 15, 2024

Proceedings of International Joint Conference on Neural Networks 2020 (IJCNN 2020)

Piscataway: IEEE, 2020.

2020 International Joint Conference on Neural Networks (IJCNN) held virtually, as part of the IEEE World Congress on Computational Intelligence (IEEE WCCI) 2020. IJCNN 2020 is jointly organized by the IEEE Computational Intelligence Society (CIS) and the International Neural Network Society (INNS). For IJCNN 2020 (and when WCCI is organized in even-numbered years) IEEE CIS ...

Added: October 15, 2020

Data organization in video surveillance systems using deep learning

A.D. Sokolova, A.V. Savchenko, , in: CEUR Workshop ProceedingsVol. 2210: Proceedings of the International Conference Information Technology and Nanotechnology. Session Image Processing and Earth Remote Sensing . [б.и.], 2018. P. 243–250.

In this paper we propose to organize information in video surveillance systems by grouping the video tracks, which contain identical faces. Aggregation of the features of individual frames extracted using deep convolutional neural networks are used in order to obtain a descriptor of video track. The tracks with identical faces are grouped using the known ...

Added: November 5, 2018

Unet-boosted classifier – мультизадачная архитектура для малых выборок на примере классификации МРТ снимков головного мозга

Sobyanin K., Kulikova S., Информатика и автоматизация (Труды СПИИРАН) 2024 Т. 23 № 4 С. 1022–1046

The problem of training deep neural networks on small samples is especially relevant for medical problems. The paper examines the impact of pixel-wise marking of significant objects in the image, over the true class label, on the quality of the classification. To achieve better classification results on small samples, we propose a multitasking architecture -- ...

Added: June 29, 2024

Emotion Recognition in Sound

Popova A. S., Alexandr G. Rassadin, Alexander A. Ponomarenko, , in: Advances in Neural Computation, Machine Learning, and Cognitive Research. Selected Papers from the XIX International Conference on Neuroinformatics, October 2-6, 2017, Moscow, RussiaVol. 736. Cham: Springer, 2017. P. 117–124.

In this paper we consider the automatic emotions recognition problem, especially the case of digital audio signal processing. We consider and verify an straight forward approach in which the classification of a sound fragment is reduced to the problem of image recognition. The waveform and spectrogram are used as a visual representation of the image. ...

Added: October 18, 2017

Refining the ONCE Benchmark With Hyperparameter Tuning

Maksim Golyadkin, Alexander Gambashidze, Nurgaliev I. et al., IEEE Access 2024 Vol. 12 P. 3805–3814

In response to the growing demand for 3D object detection in applications such as autonomous driving, robotics, and augmented reality, this work focuses on the evaluation of semi-supervised learning approaches for point cloud data. The point cloud representation provides reliable and consistent observations regardless of lighting conditions, thanks to advances in LiDAR sensors. Data annotation ...

Added: March 13, 2024