?
Сверточные нейронные сети в задаче распознавания пола и возраста по видеоизображению
Гл. 124. С. 916-924.
Kharchevnikova A., Savchenko A.
In this paper we examine the age and gender video-based recognition problem using deep convolutional neural networks. The comparative analysis of classifier fusion algorithms to aggregate decisions for individual frames is presented. In order to improve the age and gender identification accuracy we implement the video-based recognition system with several aggregation methods. We provide the experimental comparison for IJB-A, Indian Movies and Kinect datasets. It is demonstrated that the most accurate decisions are obtained using the geometric mean and mathematical expectation of the outputs at softmax layers of the convolutional neural networks for gender recognition and age prediction, respectively.
Keywords: коллектив решающих правилdeep learningconvolutional neural networksage and gender recognitionсверточные нейронные сетираспознавание пола и возрастаглубокое обучениеclassifier fusion
Publication based on the results of:
In book
Самара : Предприятие "Новая техника", 2018
A.S. Kharchevnikova, Savchenko A., Optical Memory and Neural Networks (Information Optics) 2018 Vol. 27 No. 4 P. 246-259
The paper considers the use of convolutional neural networks for the concurrent recognition of the gender and age of a person by video records of his face. The emphasis is on the incorporation of the approach into mobile video-recording software. We have investigated the fusion of decisions obtained during the processing of each video frame, ...
Added: November 5, 2018
Криницкий М. А., Verezemskaya P., Гращенков К. В. et al., Atmosphere 2018 Vol. 9 No. 426 P. 1-23
Polar mesocyclones (MCs) are small marine atmospheric vortices. The class of intense MCs, called polar lows, are accompanied by extremely strong surface winds and heat fluxes and thus largely influencing deep ocean water formation in the polar regions. Accurate detection of polar mesocyclones in high-resolution satellite data, while challenging, is a time-consuming task, when performed ...
Added: November 26, 2020
Savchenko L., Информационные технологии 2019 Т. 25 № 5 С. 313-318
We consider a problem of computer assisted language and pronunciation learning based on the deep learning methods and the information theory of speech perception. In order to improve the efficiency of testing of pronunciation quality, we propose to train a convolutional neural network using the best reference utterances from the user. The experimental results proved ...
Added: May 29, 2019
Savchenko A., Kharchevnikova Angelina S., , in : Computational Aspects and Applications in Large-Scale Networks. Springer Proceedings in Mathematics & Statistics. Vol. 247.: Springer, 2018. P. 37-46.
The paper reviews the problem of age and gender recognition methods for video data using modern deep convolutional neural networks. We present the comparative analysis of classifier fusion algorithms to aggregate decisions for individual frames. We implemented the video-based recognition system with several aggregation methods to improve the age and gender identification accuracy. The experimental ...
Added: September 2, 2018
Malafeev A., Nikolaev K., , in : Analysis of Images, Social Networks and Texts. 8th International Conference, AIST 2019, Kazan, Russia, July 17–19, 2019, Revised Selected Papers. Communications in Computer and Information Science. Vol. 1086.: Springer, 2020. P. 154-159.
In this paper, a deep learning method study is conducted to solve a new multiclass text classification problem, identifying user interests by text messages. We used an original dataset of almost 90 thousand forum text messages, labeled for ten interests. We experimented with different modern neural network architectures: recurrent and convolutional, as well as simpler ...
Added: November 7, 2019
Savchenko L., Информационные технологии 2020 Т. 26 № 5 С. 290-296
article deals with the problem of isolated words recognition based on deep convolutional neural networks. The use of
existing recognition systems in practice is limited by an insufficiently high degree of their reliability functioning in conditions of intense acoustic noise, such as street noise, sounds from passing vehicles, etc. Nowadays, the most accurate recognition methods are characterized by ...
Added: September 2, 2020
Kharchevnikova A., Savchenko A., В кн. : Материалы XXIII международной научно-технической конференции «Информационные системы и технологии-2017». : [б.и.], 2017. С. 864-869.
Рассматривается задача построения интеллектуальных систем контекстной рекламы с автоматической настройкой на потенциальные предпочтения пользователя. Выполнен аналитический обзор современных публикаций, посвященных распознаванию пола и возраста по видеоизображению лица, в том числе на основе глубоких сверточных нейронных сетей. Проведен сравнительный анализ способов агрегации решений, полученных при распознавании каждого видеокадра. Приведены результаты экспериментального исследования их точности и быстродействия. ...
Added: October 24, 2017
Savchenko A., PeerJ Computer Science 2019 Vol. 5:e197 P. 1-26
This paper is focused on the automatic extraction of persons and their attributes (gender, year of born) from album of photos and videos. A two-stage approach is proposed in which, firstly, the convolutional neural network simultaneously predicts age/gender from all photos and additionally extracts facial representations suitable for face identification. Here the MobileNet is modified ...
Added: June 12, 2019
Nikolaev K., Malafeev A., , in : Analysis of Images, Social Networks and Texts. 7th International Conference AIST 2018. : Springer, 2018. Ch. 12. P. 121-126.
This paper deals with automatic classification of questions in the Russian language. In contrast to previously used methods, we introduce a convolutional neural network for question classification. We took advantage of an existing corpus of 2008 questions, manually annotated in accordance with a pragmatic 14-class typology. We modified the data by reducing the typology to ...
Added: February 15, 2019
Babenko A., Slesarev A., Chigorin A. et al., , in : Lecture Notes in Computer Science. Proceedings of the 13th European Conference on Computer Vision (ECCV 2014). * 1. Vol. 8689.: Zürich : Springer, 2014. P. 584-599.
It has been shown that the activations invoked by an image within the top layers of a large convolutional neural network provide a high-level descriptor of the visual content of the image. In this paper, we investigate the use of such descriptors (neural codes) within the image retrieval application. In the experiments with several standard ...
Added: October 1, 2014
Chertenkov V., Burovskiy E., Shchur L., , in : Supercomputing: 8th Russian Supercomputing Days, RuSCDays 2022, Moscow, Russia, September 26–27, 2022, Revised Selected Papers. Vol. 13708.: Springer, 2022. P. 397-408.
We explore the possibilities of using neural networks to study phase transitions. The main question is the level of accuracy which can be achieved for the estimates of the critical point and critical exponents of statistical physics models. We generate data for two spin models in two dimensions for which analytical solutions exist, the Ising ...
Added: March 31, 2023
Derkacheva A., Frost G., Epstein H. et al., Journal of Ecology 2024
Tundra shrub expansion is one of the primary vegetation changes being observed in Arctic ecosystems, but the pace of shrubification is highly variable across multiple spatial scales, complicating efforts to understand its drivers and consequences. Here we apply Convolutional Neural Networks (CNNs) to very-high resolution (VHR) commercial satellite imagery acquired 10–15 years apart to identify ...
Added: November 30, 2023
Savchenko A., Optical Memory and Neural Networks (Information Optics) 2017 Vol. 26 No. 2 P. 129-136
We analyzed the way to increase computational efficiency of video-based image recognition methods with matching of high dimensional feature vectors extracted by deep convolutional neural networks. We proposed an algorithm for approximate nearest neighbor search. At the first step, for a given video frame the algorithm verifies a reference image obtained when recognizing the previous ...
Added: June 30, 2017
Beknazarov N., Jin S., Poptsova M., Scientific Reports 2020 Vol. 10 P. 19134
Computational methods to predict Z-DNA regions are in high demand to understand the functional role of Z-DNA. The previous state-of-the-art method Z-Hunt is based on statistical mechanical and energy considerations about B- to Z-DNA transition using sequence information. Z-DNA CHiP-seq experiment results showed little overlap with Z-Hunt predictions implying that sequence information only is not ...
Added: December 11, 2020
Romanyuk K., , in : 2018 Fifth HCT Information Technology Trends (ITT). : IEEE, 2018. P. 1-6.
The law of accelerating returns can be viewed as a concept that describes acceleration of technological progress. The idea is that tools are used for developing more advanced tools that are applied for creating even more advanced tools etc. A similar idea has been implemented in algorithms for advancing artificial intelligence. In this paper, the ...
Added: February 28, 2019
Попова А. С., Рассадин А. Г., Пономаренко А. А., В кн. : Материалы XXIV международной научно-технической конференции «Информационные системы и технологии-2018. : [б.и.], 2018. С. 1083-1089.
Рассматривается задача автоматической классификации эмоций в цифровом аудио сигнале. В работе рассматривается и верифицируется подход, в котором классификация звукового фрагмента производится с помощью рекуррентной нейронной сети c долговременно-кратковременной памятью. В качестве признаков использовались мел-кепстральные коэффициенты. Произведен численный эксперимент на открытом наборе данных Ravdess, включающий 8 различных эмоций: “нейтральный”, “спокойный”, “счастливый”, “грустный”, “злой”, “испуганный”, “отвращение”, “удивление” ...
Added: October 21, 2018
Piscataway : IEEE, 2020
2020 International Joint Conference on Neural Networks (IJCNN) held virtually, as part of the IEEE World Congress on Computational Intelligence (IEEE WCCI) 2020. IJCNN 2020 is jointly organized by the IEEE Computational Intelligence Society (CIS) and the International Neural Network Society (INNS). For IJCNN 2020 (and when WCCI is organized in even-numbered years) IEEE CIS ...
Added: October 15, 2020
A.D. Sokolova, A.V. Savchenko, , in : CEUR Workshop Proceedings. Vol. 2210: Proceedings of the International Conference Information Technology and Nanotechnology. Session Image Processing and Earth Remote Sensing .: [б.и.], 2018. P. 243-250.
In this paper we propose to organize information in video surveillance systems by grouping the video tracks, which contain identical faces. Aggregation of the features of individual frames extracted using deep convolutional neural networks are used in order to obtain a descriptor of video track. The tracks with identical faces are grouped using the known ...
Added: November 5, 2018
Khomutov E., Arzymatov K., Shchur V., Journal of Physics: Conference Series 2021 Vol. 1740 Article 012031
Demographic and population structure inference is one of the most important problems in genomics. Population parameters such as effective population sizes, population split times and migration rates are of high interest both themselves and for many applications, e.g. for genome-wide association studies. Hidden Markov Model (HMM) based methods, such as PSMC, MSMC, coalHMM etc., proved ...
Added: May 17, 2021
А. С. Попова, А. Г. Рассадин, А. А. Пономаренко, В кн. : Материалы XXIII международной научно-технической конференции «Информационные системы и технологии-2017». : [б.и.], 2017. С. 852-857.
In this paper we consider the automatic emotions recognition problem, especially the case of digital audio signal processing. We consider and verify an approach in which the classification of a sound fragment is reduced to the problem of image recognition. The waveform and spectrogram are used as a visual representation of the image. The computational ...
Added: October 18, 2017
Baklanov A., Khachay M., Pasynkov M., , in : Proceedings of Analysis of Images, Social Networks and Texts – 7th International Conference, AIST 2018, Moscow, Russia, July 5-7, 2018, Revised Selected Papers. Lecture Notes in Computer Science. Vol. 11179.: Berlin : Springer, 2018. P. 155-167.
This research is motivated by sustainability problems of oil palm expansion. Fast-growing industrial Oil Palm Plantations (OPPs) in the tropical belt of Africa, Southeast Asia and parts of Brazil lead to significant loss of rainforest and contribute to the global warming by the corresponding decrease of carbon dioxide absorption. We propose a novel approach to ...
Added: January 23, 2019
Tsvetkovskaya I. I., Tekutieva N. V., Prokofyeva E. N. et al., , in : 2020 Moscow Workshop on Electronic and Networking Technologies (MWENT). : IEEE, 2020. P. 1-5.
The availability of high-resolution satellite images obtained through space radio communications offers the opportunity to use the most advanced technologies and techniques for analyzing remote sensing data. The paper discusses the data obtained with the use of ground-based, airborne or space-based filming equipment, which makes it possible to obtain images in one or several sections ...
Added: June 23, 2020
Popova A. S., Alexandr G. Rassadin, Alexander A. Ponomarenko, , in : Advances in Neural Computation, Machine Learning, and Cognitive Research. Selected Papers from the XIX International Conference on Neuroinformatics, October 2-6, 2017, Moscow, Russia. Vol. 736.: Cham : Springer, 2017. P. 117-124.
In this paper we consider the automatic emotions recognition problem, especially the case of digital audio signal processing. We consider and verify an straight forward approach in which the classification of a sound fragment is reduced to the problem of image recognition. The waveform and spectrogram are used as a visual representation of the image. ...
Added: October 18, 2017
Maksim Golyadkin, Gambashidze A., Nurgaliev I. et al., IEEE Access 2024 Vol. 12 P. 3805-3814
In response to the growing demand for 3D object detection in applications such as autonomous driving, robotics, and augmented reality, this work focuses on the evaluation of semi-supervised learning approaches for point cloud data. The point cloud representation provides reliable and consistent observations regardless of lighting conditions, thanks to advances in LiDAR sensors. Data annotation ...
Added: March 13, 2024