Russian Sign Language Dactyl Recognition

I. Makarov; N. Veldyaykin; Maxim Chertkov; Aleksei Pokoev

doi:10.1109/TSP.2019.8768868

Publications

?

Russian Sign Language Dactyl Recognition

P. 726–729.

Makarov I., Veldyaykin N., Maxim Chertkov, Aleksei Pokoev

In this paper, we compare several real-time sign language dactyl recognition systems and present a new model based on deep convolutional neural networks. These systems are able to recognize Russian alphabet letters presented as static signs in Russian Sign language used by people from deaf community. In such an approach, we recognize words from Russian natural language presented by consequent hand gestures of each letter. We evaluate our approach on Russian (RSL) sign language, for which we collect our own dataset and evaluate dactyl recognition.

Keywords: Deep Convolutional Neural Networks Russian Sign Language РЖЯ Sign Language Translation Hand Gesture Recognition жестовые языки

In book

2019 42nd International Conference on Telecommunications and Signal Processing (TSP)

NY: IEEE, 2019.

American and Russian Sign Language Dactyl Recognition and Text2Sign Translation

Makarov I., Veldyaykin N., Maxim Chertkov et al., , in: Analysis of Images, Social Networks and Texts. 8th International Conference AIST 2019. Springer, 2019. P. 309–320.

Sign language is the main way to communicate for people from deaf community. However, common people mostly do not know sign language. In this paper, we overview several real-time sign language dactyl recognition systems using deep convolutional neural networks. These systems are able to recognize dactylized words gestured by signs for each letter. We evaluate ...

Added: February 4, 2020

American and Russian Sign Language Dactyl Recognition

Makarov I., Nikolay Veldyaykin, Maxim Chertkov et al., , in: Proceedings of the 12th ACM International Conference on PErvasive Technologies Related to Assistive Environments (PETRA '19). NY: ACM, 2019. P. 204–210.

Sign languages are the main way for people from deaf community to communicate with other people. In this paper, we have compared several real-time sign language dactyl recognition systems using deep convolutional neural networks. Our system is able to recognize words from natural language gestured using signs for each letter. We evaluate our approach on ...

Added: July 10, 2021

Word Order within the Nominal Domain in Russian Sign Language

Anna G. Klezovich, Kirill A. Aksenov, / NRU HSE. Series WP BRP "Linguistics". 2018. No. 72.

This work aims at investigating the word order within the nominal domain in Russian Sign Language (RSL) with respect to Universal 20 and a hierarchy of adjectives. Universal 20 proposed by Greenberg (1963) postulates that there are three possibilities of the word order in the noun phrase, namely Demonstrative > Numeral > Adjective > Noun, ...

Added: December 9, 2018

Deep probabilistic human pose estimation

Petrov I., Shakhuro V., Konushin A., IET Computer Vision 2018 Vol. 12 No. 5 P. 578–585

The authors consider the problem of human pose estimation using probabilistic convolutional neural networks. They explore ways to improve human pose estimation accuracy on standard pose estimation benchmarks MPII human pose and Leeds Sports Pose (LSP) datasets using frameworks for probabilistic deep learning. Such frameworks transform deterministic neural network into a probabilistic one and allow ...

Added: March 14, 2018

Negative Concord in Russian Sign Language

Kuhn J., Lena Pasalskaya, Natural Language and Linguistic Theory 2020

In natural language, negative concord (NC) describes a pattern in which a negative marking appears on multiple morphological items but a single negation is interpreted. For sign languages, Pfau (2016) argues that negative non-manuals can be seen as instances of negative concord; in RSL, for example, headshake can only appear in negative sentences. Nevertheless, manual ...

Added: November 2, 2019

Negative concord in Russian Sign Language (RSL)

Kuhn J., Lena Pasalskaya, , in: Theoretical Issues in Sign Language Research Conference: Conference Handbook. [б.и.], 2019. P. 300–302.

Added: October 31, 2019

Depth Map Interpolation using Perceptual Loss

Makarov I., Vladimir Aliev, Gerasimova Olga et al., , in: Adjunct Proceedings of 2017 IEEE International Symposium on Mixed and Augmented Reality (ISMAR-Adjunct). NY: IEEE, 2017. P. 93–94.

In this paper, we discuss a semi-dense depth map interpolation method based on convolutional neural network. We propose a compact neural network architecture with loss function defined as Euclidean distance in the feature space of VGG-16 neural network used for deep visual recognition. The suggested solution shows state-of-art performance on synthetic and real datasets. Together ...

Added: August 5, 2017

Exploring Networks of Lexical Variation in Russian Sign Language

Kimmelman V., Komarova A., Luchkova L. et al., Frontiers in Psychology 2022 Vol. 12 Article 740734

When describing variation at the lexical level in sign languages, researchers often distinguish between phonological and lexical variants, using the following principle: if two signs differ in only one of the major phonological components (handshape, orientation, movement, location), then they are considered phonological variants, otherwise they are considered separate lexemes. We demonstrate that this principle ...

Added: September 7, 2023

Fast Depth Map Super-Resolution Using Deep Neural Network

Alisa Korinevskaya, Makarov I., , in: Proceedings of IEEE International Symposium on Mixed and Augmented Reality (ISMAR'18). NY: IEEE, 2019. P. 117–122.

Depth map super-resolution is a challenging computer vision problem. In this paper, we present two deep convolutional neural networks solving the problem of single depth map super-resolution. Both networks learn residual decomposition and trained with specific perceptual loss improving sharpness and perceptive quality of the upsampled depth map. Several experiments on various depth super-resolution benchmark ...

Added: July 29, 2019

Sequential Analysis with Specified Confidence Level and Adaptive Convolutional Neural Networks in Image Recognition

Savchenko A., , in: Proceedings of International Joint Conference on Neural Networks 2020 (IJCNN 2020). Piscataway: IEEE, 2020. P. 1–8.

In this paper the problem of high computational complexity of deep convolutional nets in image recognition is considered. An existing framework of adaptive neural networks is extended by appending the separate classifier to intermediate layers. The hierarchical representations of the input image are sequentially analyzed. If the first classifier returns rather high confidence score, the ...

Added: October 15, 2020

Извлечение предпочтений пользователя на основе методов автоматического порождения текстовых описаний изображений фотоальбома

Kharchevnikova A., Savchenko A., Компьютерная оптика 2020 Т. 44 № 4 С. 618–626

В работе рассматривается задача извлечения предпочтений пользователя по его фотоальбому. Предложен новый подход на основе автоматического порождения текстовых описаний фотографий и последующей классификации таких описаний. Проведен анализ известных методов создания аннотаций по изображению на основе свёрточных и рекуррентных (Long short-term memory) нейронных сетей. С использованием набора данных Google’s Conceptual Captions обучены новые модели, в которых ...

Added: September 16, 2020

Жестовый язык в онлайн-общении глухих

Bolshakov N., Макаркин М., Меренкова В. et al., В кн.: Альманах «Исследуя сообщество глухих: 1». М.: V–A–С Press, 2024. С. 235–254.

Studies in various countries have addressed the social phenomena of d/Deaf users socialising online: in online spaces, d/Deaf users are able to hide their deafness, establish “weak ties” online (with people they don’t know personally), and acquire access to more information. In Russia, such studies have revealed social networks and messengers to be a very ...

Added: June 6, 2024

On the generalization ability of data-driven models in the problem of total cloud cover retrieval

Krinitskiy M., Alexandrova M., Verezemskaya P. et al., Remote Sensing 2021 Vol. 13 No. 2 Article 326

Total Cloud Cover (TCC) retrieval from ground-based optical imagery is a problem that has been tackled by several generations of researchers. The number of human-designed algorithms for the estimation of TCC grows every year. However, there has been no considerable progress in terms of quality, mostly due to the lack of systematic approach to the ...

Added: September 24, 2021

Личные местоимения в жестовых языках: часть языка или жестикуляция?

Aksenov K., Научно-техническая информация. Серия 2: Информационные процессы и системы 2021

This article is an overview of different approaches to personal pronouns in sign languages (SLs). Namely, I concentrate on similarities and differences between SLs pronouns and pointing gestures in spoken languages. They have similar configurations and their orientation depends on the position of their referents. However, SLs pronouns are more conventionalized, as compared to pointing ...

Added: November 28, 2020

Gradable Predicates in Russian Sign Language

Aksenov K., / NRU HSE. Series WP BRP "Linguistics". 2019. No. 90.

This paper aims at describing the syntactic and semantic properties of gradable predicates in Russian Sign Language (RSL). Property signs in RSL, such as big or beautiful, generally behave similarly to stative predicates. However, their compatibility with the degree modifiers and aspectual markers shows that they significantly differ from other stative verbs. Thus, they can ...

Added: December 13, 2019

Multi-label Image Set Recognition in Visually-Aware Recommender Systems

Demochkin K., Savchenko A., , in: Analysis of Images, Social Networks and Texts. 8th International Conference, AIST 2019, Lecture Notes in Computer Science, Revised Selected PapersVol. 11832. Cham: Springer, 2019. Ch. 26 P. 291–297.

In this paper we focus on the problem of multi-label image recognition for visually-aware recommender systems. We propose a two stage approach in which a deep convolutional neural network is firstly fine-tuned on a part of the training set. Secondly, an attention-based aggregation network is trained to compute the weighted average of visual features in ...

Added: December 22, 2019

Automatic detection of natural phonological classes in Russian Sign Language

Moroz G., Plaskovitskaya A. A., Rudnev P., / NRU HSE. Series WP BRP "Linguistics". 2018. No. 74.

The present paper applies Multiple Correspondence Analysis to test the validity of an existing theoretical model of the phonological system of Russian Sign Language (RSL). We show that comparing the importance of phonological features using ratio plots and MCA is a promising way of revealing non-binary oppositions in phonological systems of human languages irrespective of ...

Added: December 3, 2018

Linearization constraints on sentential negation in Russian Sign Language are prosodic

Rudnev P., Anna Kuznetsova, Sign Language & Linguistics 2021 Vol. 24 No. 2 P. 259–273

This short remark documents exceptions to the main strategy of expressing sentential negation in Russian Sign Language (RSL). The postverbal sentential negation particle in RSL inverts the basic SVO order characteristic of the language turning it into SOV (Pasalskaya 2018a). We show that this reversal requirement under negation is not absolute and does not apply ...

Added: April 14, 2020

Распознавание пола и возраста по видеоизображению лица на основе сверточных нейронных сетей

Kharchevnikova A., Savchenko A., В кн.: Материалы XXIII международной научно-технической конференции «Информационные системы и технологии-2017». [б.и.], 2017. С. 864–869.

Рассматривается задача построения интеллектуальных систем контекстной рекламы с автоматической настройкой на потенциальные предпочтения пользователя. Выполнен аналитический обзор современных публикаций, посвященных распознаванию пола и возраста по видеоизображению лица, в том числе на основе глубоких сверточных нейронных сетей. Проведен сравнительный анализ способов агрегации решений, полученных при распознавании каждого видеокадра. Приведены результаты экспериментального исследования их точности и быстродействия. ...

Added: October 24, 2017

Organizing Multimedia Data in Video Surveillance Systems Based on Face Verification with Convolutional Neural Networks

Anastasiia D. Sokolova, Angelina S. Kharchevnikova, Savchenko A., , in: Analysis of Images, Social Networks and Texts. 6th International Conference, 2017, Revised Selected PapersVol. 10716. Cham: Springer, 2018. P. 223–230.

In this paper we propose the two-stage approach of organizing information in video surveillance systems. At first, the faces are detected in each frame and a video stream is split into sequences of frames with face region of one person. Secondly, these sequences (tracks) that contain identical faces are grouped using face verification algorithms and ...

Added: May 2, 2018

Что значит быть глухим?

Bolshakov N., Колесников В. В., В кн.: Альманах «Исследуя сообщество глухих: 1». М.: V–A–С Press, 2024. С. 14–26.

Современные исследования, посвященные глухим и слабослыша- щим людям, а также глухоте в целом, в западных странах принято рассматривать в рамках такого междисциплинарного направле- ния, как Deaf Studies, появившегося во второй половине XX века. На русский язык перевести название этого направления наиболее корректно можно как «исследования сообщества глухих» в широком смысле: охватываются не только глухие и слабослышащие ...

Added: June 6, 2024

Event Recognition with Automatic Album Detection based on Sequential Grouping of Confidence Scores and Neural Attention

Savchenko A., , in: Proceedings of International Joint Conference on Neural Networks 2020 (IJCNN 2020). Piscataway: IEEE, 2020. P. 1–8.

In this paper a new formulation of event recognition task is examined: it is required to predict event categories given a gallery of images, for which albums (groups of photos corresponding to a single event) are unknown. The novel two-stage approach is proposed. At first, features are extracted in each photo using the pre-trained convolutional ...

Added: October 15, 2020

Event Recognition Based on Classification of Generated Image Captions

Savchenko A., Miasnikov E., , in: Advances in Intelligent Data Analysis XVIII (IDA 2020)Vol. 12080. Cham: Springer, 2020. Ch. 33 P. 418–430.

In this paper, we consider the problem of event recognition on single images. In contrast to conventional fine-tuning of convolutional neural networks (CNN), we proposed to use image captioning, i.e., a generative model that converts images to textual descriptions. The motivation here is the possibility to combine conventional CNNs with a completely different approach in ...

Added: May 17, 2020

Detection and Recognition of Food in Photo Galleries for Analysis of User Preferences

Miasnikov E., Savchenko A., , in: Proceedings of International Conference on Image Analysis and Recognition (ICIAR 2020)Vol. 12131. Cham: Springer, 2020. Ch. 9 P. 83–94.

Food analysis is one of the most important parts of user preference prediction engines for recommendation systems in the travel domain. In this paper, we describe and study the neural network method that allows you to recognize food in a gallery of photos taken with mobile devices. The described method consists of three main stages, ...

Added: October 1, 2020