Video-based Frame-level Facial Analysis of Affective Behavior on Mobile Devices using EfficientNets

A. Savchenko

doi:10.1109/CVPRW56347.2022.00263

Publications

?

Video-based Frame-level Facial Analysis of Affective Behavior on Mobile Devices using EfficientNets

P. 2358–2365.

Savchenko A.

In this paper, we consider the problem of real-time video-based facial emotion analytics, namely, facial expression recognition, prediction of valence and arousal and detection of action unit points. We propose the novel frame-level emotion recognition algorithm by extracting facial features with the single EfficientNet model pre-trained on Affect-Net. The predictions for sequential frames are smoothed using mean or median filters. It is demonstrated that our approach may be implemented even for video analytics on mobile devices. Experimental results for the large scale AffWild2 database from the third Affective Behavior Analysis in-the-wild Competition demonstrate that our simple model is significantly better when compared to the VggFace baseline. In particular, our method is characterized by 0.1-0.5 higher performance measures for test sets in the uni-task Expression Classification, Valence-Arousal Estimation, Action Unit Detection and Multi-Task Learning. Our team took the 3rd place in the multi-task learning challenge and 4th places in Valence-Arousal and Expression challenges. Due to simplicity, the proposed approach may be considered as a new baseline for all four sub-challenges.

Keywords: мобильные устройства распознавание эмоций emotion recognition mobile devices сверточные нейронные сети Facial Expression Recognition affective computing Convolutional neural network (CNN)обработка изображений лиц

In book

2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

IEEE, 2022.

HSE-NN Team at the 4th ABAW Competition: Multi-task Emotion Recognition and Learning from Synthetic Images

Savchenko A., / Series Computer Science "arxiv.org". 2022.

In this paper, we present the results of the HSE-NN team in the 4th competition on Affective Behavior Analysis in-the-wild (ABAW). The novel multi-task EfficientNet model is trained for simultaneous recognition of facial expressions and prediction of valence and arousal on static photos. The resulting MT-EmotiEffNet extracts visual features that are fed into simple feed-forward ...

Added: October 21, 2022

Classifying emotions and engagement in online learning based on a single facial expression recognition neural network

Savchenko A., Savchenko L., Makarov I., IEEE Transactions on Affective Computing 2022 Vol. 13 No. 4 P. 2132–2143

In this paper, behaviour of students in the e-learning environment is analyzed. The novel pipeline is proposed based on video facial processing. At first, face detection, tracking and clustering techniques are applied to extract the sequences of faces of each student. Next, a single efficient neural network is used to extract emotional features in each ...

Added: July 14, 2022

Determinants of Football Fans’ Happiness: Evidence from Facial Emotion Recognition

Naidenova I. N., Parshakov P., Sofiia Paklina, Journal of Happiness Studies 2020 Vol. 21 P. 1103–1116

We analyse the determinants of football fans’ happiness in the Russian Premier League using facial emotion recognition. We propose a new way of measuring subjective well-being and provide its empirical validation using sports data. Our sample consists of about 10,000 photos from football matches uploaded on the most popular social network in Russia during the ...

Added: October 16, 2019

Мобильные устройства как способ установления баланса между работой и личной жизнь: оборотная сторона

Сон Х. И., Chernova Z. V., Мониторинг общественного мнения: Экономические и социальные перемены 2018 № 6 С. 201–215

Modern corporate culture in the context of Bauman’s liquid modernity is greatly defined by the level of freedom, in particular, flexibility, mobility, new technologies and mass communications. Staying connected 24/7 both in professional and private life known as ‘hyperconnectivity’ becomes commonplace. Hyperconnectivity entails not only positive but also negative consequences regarding the effectiveness of an individual’s ...

Added: May 24, 2019

Fast Emotion Recognition Neural Network for IoT Devices

Mikhaylevskiy S., Chernyavskiy V., Pavlishen V. et al., , in: 2021 International Seminar on Electron Devices Design and Production (SED). IEEE, 2021. P. 1–6.

The last decades have witnessed rapid IoT technologies development, which provided ubiquitous human-computer interactions. Building intelligent systems of various types, among which emotion recognition systems, is important challenge nowadays. Especially pressing problem is to build a real-time portable system which can be embedded in low performance hardware. We propose a high accuracy emotion recognition system, ...

Added: August 8, 2021

Мобильные экосистемы

Avdoshin S. M., Pesotskaya E. Y., Открытые системы. СУБД 2014 № 2 С. 32–34

Mobile Ecosystems have been related to products, or to a community of developers around a product and gives the certain advantages to the platform owners and participants of the ecosystem. The paper answers the question -- what are the existing approaches to build mobile ecosystems, who are the participants and what are their benefits? ...

Added: April 10, 2014

Подходы к распознаванию эмоций в интеллектуальных системах

Карташева А. А., Технологос 2020 № 2 С. 15–24

The article examines approaches to emotion recognition in intelligent systems from the point of view of methodological grounds. In interdisciplinary research, where it is necessary to combine approaches from different fields, we face terminological uncertainty, since the problem of describing the emotional sphere is solved by different researchers in line with several main approaches. First, the ...

Added: October 20, 2023

Emotion Recognition of a Group of People in Video Analytics Using Deep Off-the-Shelf Image Embeddings

Tarasov Alexander V., Savchenko A., , in: Proceedings of Analysis of Images, Social Networks and Texts – 7th International Conference, AIST 2018, Moscow, Russia, July 5-7, 2018, Revised Selected Papers. Lecture Notes in Computer ScienceVol. 11179. Berlin: Springer, 2018. Ch. 19 P. 191–198.

In this paper we address the group-level emotion classification problem in video analytic systems.We propose to apply the MTCNN face detector to obtain facial regions on each video frame. Next, off-the-shelf image features are extracted from each located face using preliminary trained convolutional neural networks. The features of the whole frame are computed as a ...

Added: December 12, 2018

Overview of the Advancements in Automatic Emotion Recognition: Comparative Performance of Commercial Algorithms

Mariya Malygina, Artemyev M., Belyaev A. et al., / Series 4610E-685-0BE "Social and Behavioral Sciences ". 2019.

In the recent years facial emotion recognition algorithms have evolved and in some cases top commercial algorithms detect emotions like happiness better than humans do. To evaluate the performance of these algorithms, the common practice is to compare them with human-labeled ground truth. This article covers monitoring of the advancements in automatic emotion recognition solutions, ...

Added: December 24, 2019

A new videotest for measuring emotion recognition ability

Lyusin D., Ovsyannikova V. V., / NRU Higher School of Economics. Series PSY "Psychology". 2014. No. WP BRP 16/PSY/2014.

A new measure for emotion recognition abilities, the Videotest of Emotion Recognition, is described. Two aspects in emotion recognition are distinguished, accuracy of recognition of emotion types that constitute the emotional state of the observed person and sensitivity to the intensity of the observed emotions. The Videotest of Emotion Recognition allows obtaining the accuracy and ...

Added: February 13, 2014

Вычислительно эффективные алгоритмы классификации изображений на основе последовательного анализа

Savchenko A., Записки научных семинаров ПОМИ РАН 2021 Т. 499 С. 267–283

In this paper fast image recognition techniques based on statistical sequential analysis are discussed. We examine the possibility to sequentially process the principal components and organize a convolutional neural net- work with early exits. Particular attention is paid to sequentially learn multi-task lightweight neural network model to predict several facial at- tributes (age, gender and ...

Added: January 27, 2021

Исследование потребительских предпочтений на московском рынке смартфонов

Zvezdina N., Сорокин А. С., Вопросы статистики 2017 № 7 С. 41–51

As it is generally known, market of cutting edge technological devices (smartphones, tabs, etc.) is one of the most dynamic, competitive and knowledge intensive market in world. Thanks to that, market leaders always have to chase the latest tendencies and trends, in order to be in sync with consumers’ preferences. In this article authors present ...

Added: September 4, 2017

Neural networks and satellite images-based shrub tundra landscape study: phenomena with fuzzy geometric and categorical boundaries

Derkacheva A., Frost G., Ermokhina K. et al., , in: 2023 International Conference on Machine Intelligence for GeoAnalytics and Remote Sensing (MIGARS). IEEE, 2023. P. 1–4.

Many studies using convolutional neural networks, including in the field of satellite images, are aimed at recognizing clearly defined objects, such as cars or individual trees. Here we present the first results on mapping the growth stage of tundra shrubs, which is a ‘‘fuzzy’’ target for a network: there are no obvious geometric boundaries of ...

Added: March 22, 2023

Влияние эмоционального состояния на распознавание эмоций

Ovsyannikova V. V., Психология. Журнал Высшей школы экономики 2014 Т. 11 № 1 С. 86–101

The paper focuses on the way one’s own emotional state influences the recognition of other people’s emotions. Existing research indicates the effect of congruence between the emotions experienced at the moment and the evaluations of emotional stimuli. Our experimental study tested the hypotheses of the influence of emotional states on two aspects of emotion recognition, ...

Added: October 13, 2014

Связь между эмоциональными личностными чертами наблюдателя и сензитивностью к эмоциям определенной модальности

Lyusin D., Климова Е. А., Медведева В. В., Вестник Ярославского государственного университета им. П.Г. Демидова. Серия Гуманитарные науки 2014 № 3 С. 81–87

Статья посвящена изучению связи между эмоциональными личностными чертами человека и особенностями распознавания им эмоций других людей. Выдвигаются и обосновываются две альтернативные гипотезы, описывающие связь между эмоциональными личностными чертами и сензитивностью к эмоциям определённой модальности: гипотезы конгруэнтности и комплементарности. Вводится понятие сензитивности к эмоциям разной модальности, предлагается оригинальный способ её измерения. Получено подтверждение гипотезы комплементарности. ...

Added: October 10, 2013

Модели и методы интерактивного взаимодействия с вычислительными устройствами нового поколения

Manakhov P., Ковшов Е. Е., Прикладная информатика 2012 № 3(39) С. 71–81

The article examines the issue of developing models of the text input methods. The urgency of this matter is dictated by the reduction of financial costs of designing new input methods and upgrading existing ones. The article suggests a modeling method, which is verified by a series of experiments. Also the article gives recommendations on ...

Added: January 17, 2015

Влияние эмоционального состояния и диспозициональной радости на скорость распознавания эмоций по выражению лица

Ovsyannikova V. V., Психология. Журнал Высшей школы экономики 2016 Т. 13 № 3 С. 588–599

Previous works show that mood congruence effect or trait congruence effect can be achieved (Chepenik et al., 2007; Rusting, 1998). The present study explores the effect of emotional state and dispositional joy on effectiveness of emotion recognition from facial expression. The experimental study was conducted in two groups of subjects. The general sample consisted of ...

Added: October 22, 2016

Mobile Tests as a Tool of Formative Classroom Assessment

Ryabkova V.V., Высшее образование сегодня 2020 No. 8 P. 44–46

The results of the study of the didactic potential of mobile tests used in the framework of formative assessment of educational achievements of students of the English language according to the traditional model are presented. The data of the experiment using the Google Forms service is highlighted, during which grammar, vocabulary, reading and listening skills ...

Added: November 14, 2022

Three-way classification for sequences of observations

A. V. Savchenko, L. V. Savchenko, Information Sciences 2023 Vol. 648 Article 119540

This article introduces the novel technique to reduce the computation time for classifying a sequence of observations (frames), such as a video stream, where each observation is described by high-dimensional embeddings extracted by a deep neural network. By using the methodology of granular computing, an observed sequence is represented at various scales using different frame ...

Added: August 27, 2023

Моделирование потребительских предпочтений на московском рынке смартфонов методом совместного анализа

Zvezdina N., Сорокин А. С., Вопросы статистики 2018 № 12 С. 28–39

The statement that demand creates its own supply rings true for the smartphone market as well as for any other sector of the market economy. With computerization level of society growing, Internet-based technology advancing, with the increase in Internet penetration of households, growing population mobility and the development of small and medium-sized enterprises, it has ...

Added: January 10, 2019

Культурные правила выражения и распознавание эмоций других людей: различия в распознавании гнева представителями армянской и русской культур

Sysoeva T., Айрапетян Е. А., Психологические исследования: электронный научный журнал 2023 Т. 16 № 92 Статья 1

The current study is aimed to investigate the differences in emotion recognition among representatives of Armenian and Russian cultures. A preliminary study demonstrated that Armenians, unlike Russians, tend to control the expression of anger towards in-group members in greater extent. One hypothesis explaining the cultural influence on emotion recognition suggests that expression norms, which require ...

Added: January 29, 2024

Group-Level Emotion Recognition using Transfer Learning from Face Identification

Alexandr Rassadin, Alexey Gruzdev, Andrey Savchenko, , in: Proceedings of the 19th ACM International Conference on Multimodal Interaction. [б.и.], 2017. P. 544–548.

In this paper we describe our algorithmic approach, which was used for submissions in the fifth Emotion Recognition in the Wild (EmotiW 2017) group-level emotion recognition sub-challenge. We extracted feature vectors of detected faces using the Convolutional Neural Network trained for face identification task, rather than traditional pre-training on emotion recognition problems. In the final ...

Added: October 18, 2017

Sequential analysis in Fourier probabilistic neural networks

Savchenko A., Belova N. S., Expert Systems with Applications 2022 Vol. 207 Article 117885

In this paper, the computational complexity of the probabilistic neural network for the classification of high-dimensional data is improved. At first, the class probability densities are estimated by using only a few principal components of an observed point. The Gaussian–Parzen kernel is replaced by the orthogonal series estimates of class-conditional densities for each principal component using the Fourier series to speed ...

Added: June 29, 2022

Organizing Multimedia Data in Video Surveillance Systems Based on Face Verification with Convolutional Neural Networks

Anastasiia D. Sokolova, Angelina S. Kharchevnikova, Savchenko A., , in: Analysis of Images, Social Networks and Texts. 6th International Conference, 2017, Revised Selected PapersVol. 10716. Cham: Springer, 2018. P. 223–230.

In this paper we propose the two-stage approach of organizing information in video surveillance systems. At first, the faces are detected in each frame and a video stream is split into sequences of frames with face region of one person. Secondly, these sequences (tracks) that contain identical faces are grouped using face verification algorithms and ...

Added: May 2, 2018