Clustering and maximum likelihood search for efficient statistical classification with medium-sized databases

A. Savchenko

doi:10.1007/s11590-015-0948-6

Publications

?

Clustering and maximum likelihood search for efficient statistical classification with medium-sized databases

Optimization Letters. 2017. Vol. 11. No. 2. P. 329-341.

Savchenko A.

This paper addresses the problem of insufficient performance of statistical classification with the medium-sized database (thousands of classes). Each object is represented as a sequence of independent segments. Each segment is defined as a random sample of independent features with the distribution of multivariate exponential type. To increase the speed of the optimal Kullback-Leibler minimum information discrimination principle, we apply the clustering of the training set and an approximate nearest neighbor search of the input object in a set of cluster medoids. By using the asymptotic properties of the Kullback-Leibler divergence, we propose the maximal likelihood search procedure. In this method the medoid to check is selected from the cluster with the maximal joint density (likelihood) of the distances to the previously checked medoids. Experimental results in image recognition with artificially generated dataset and Essex facial database prove that the proposed approach is much more effective, than an exhaustive search and the known approximate nearest neighbor methods from FLANN and NonMetricSpace libraries.

Research target: Computer Science

Priority areas: IT and mathematics

Keywords: информационное рассогласование Кульбака-Лейблера pattern recognition approximate nearest neighbor classification приближенные методы ближайшего соседа 28.23.15 Распознавание образов. Обработка изображений Kullback-Leibler divergence

Publication based on the results of:

Методы классификации с направленным перебором альтернатив на основе вероятностной нейросетевой модели кусочно-однородных объектов (2015)

Maximum-likelihood approximate nearest neighbor method in real-time image recognition

Savchenko A., Pattern Recognition 2017 Vol. 61 P. 459-469

An exhaustive search of all classes in pattern recognition methods cannot be implemented in real-time, if the database contains a large number of classes. In this paper we introduce a novel probabilistic approximate nearest-neighbor (NN) method. Despite the most of known fast approximate NN algorithms, our method is not heuristic. The joint probabilistic densities (likelihoods) ...

Added: August 30, 2016

The Maximal Likelihood Enumeration Method for the Problem of Classifying Piecewise Regular Objects

Savchenko A., Automation and Remote Control 2016 Vol. 77 No. 3 P. 443-450

We study the recognition problem for composite objects based on a probabilistic model of a piecewise regular object with thousands of alternative classes. Using the model’s asymptotic properties, we develop a new maximal likelihood enumeration method which is optimal (in the sense of choosing the most likely reference for testing on every step) in the ...

Added: April 11, 2016

About variability of using of methods of the shape identification based on geometrical correlation

Gostev I. M., Advanced Materials Research 2014 Vol. 837 P. 381-386

Methods of identification of the form of objects based on the signature analysis and invariant to affine transformations are considered. It is shown as these methods it is possible to apply to surface quality assurance. Questions of sensitivity of these methods are considered. Dependences of these methods on noise are brought. ...

Added: November 28, 2013

Метод максимально правдоподобного перебора в задаче классификации кусочно-однородных объектов

Savchenko A., Автоматика и телемеханика 2016 № 3 С. 99-108

Исследуется задача распознавания составных объектов на основе вероятностной модели кусочно-однородного объекта при наличии тысяч альтернативных классов. Используя асимптотические свойства модели, разработан новый метод максимально правдоподобного перебора, который является оптимальными (в смысле выбора для проверки на каждом этапе максимально правдоподобного эталона) среди класса “жадных” алгоритмов приближенного поиска ближайшего соседа. Приведены результаты эксперимента в задаче распознавания лиц ...

Added: March 25, 2016

Оптико-электронные приборы и устройства в системах распознавания образов, обработки изображений и символьной информации. Распознавание - 2017. Сборник материалов XIII международной научно-технической конференции

Курск : Юго-Западный университет, 2017

Сборник содержит материалы XIII Международной конференции «Оптико-электронные приборы и устройства в системах распознавания образов, обработки изображений и символьной информации» (Курск, 16-19 мая 2017 г.), целью которой является ознакомление с имеющимися достижениями по созданию оптико-электронных приборов, систем и внедрению информационных технологий в научные исследования, учебный процесс и промышленность, а также координация по эффективному их применению в ...

Added: November 9, 2017

Статистическое распознавание образов на основе посегментного анализа однородности

Savchenko A., Машинное обучение и анализ данных 2015 Т. 1 № 11 С. 1500-1516

Исследуется проблема малых выборок в задаче статистического распознавания образов на основе методов ближайших соседей, точность которых во многом определяется выбранной мерой близости, при этом их реализация в режиме реального времени может оказаться невозможной уже при наличии тысяч классов. Для преодоления указанных проблем предложен новый подход к разработке классификаторов с посегментным анализом однородности и быстрой последовательной ...

Added: September 10, 2015

Statistical testing of segment homogeneity in classification of piecewise-regular objects

Savchenko A., Belova N. S., International Journal of Applied Mathematics and Computer Science 2015 Vol. 25 No. 4 P. 915-925

The paper is focused on the problem of multi-class classification of composite (piecewise-regular) objects (e.g., speech signals, complex images, etc.). We propose a mathematical model of composite object representation as a sequence of independent segments. Each segment is represented as a random sample of independent identically distributed feature vectors. Based on this model and statistical ...

Added: September 10, 2015

Decision Support in Intelligent Maintenance-planning Systems Based on Contextual Multi-armed Bandit Algorithm

Savchenko A., Milov V., Procedia Computer Science 2017 Vol. 103 P. 316-323

In this paper we focus on two essential problems of maintenance decision support systems, namely, 1) detection of potential dangerous situation, and 2) classification of this situation in order to recommend an appropriate repair action. The former task is usually solved with the known statistical process control techniques. The latter problem can be reduced to ...

Added: February 8, 2017

Deep neural networks and maximum likelihood search for approximate nearest neighbor in video-based image recognition

Savchenko A., Optical Memory and Neural Networks (Information Optics) 2017 Vol. 26 No. 2 P. 129-136

We analyzed the way to increase computational efficiency of video-based image recognition methods with matching of high dimensional feature vectors extracted by deep convolutional neural networks. We proposed an algorithm for approximate nearest neighbor search. At the first step, for a given video frame the algorithm verifies a reference image obtained when recognizing the previous ...

Added: June 30, 2017

Sequential Three-Way Decisions in Efficient Classification of Piecewise Stationary Speech Signals

Savchenko A., Lecture Notes in Artificial Intelligence 2017 Vol. 10314 P. 264-277

In this paper it is proposed to improve performance of the automatic speech recognition by using sequential three-way decisions. At first, the largest piecewise quasi-stationary segments are detected in the speech signal. Every segment is classified using the maximum a-posteriori (MAP) method implemented with the Kullback-Leibler minimum information discrimination principle. The three-way decisions are taken ...

Added: June 27, 2017

Maximum A Posteriori Estimation of Distances Between Deep Features in Still-to-Video Face Recognition

Savchenko A., Belova N. S., / Cornell University. Series "Working papers by Cornell University". 2017.

The paper deals with the still-to-video face recognition for the small sample size problem based on computation of distances between high-dimensional deep bottleneck features. We present the novel statistical recognition method, in which the still-to-video recognition task is casted into Maximum A Posteriori estimation. In this method we maximize the joint probabilistic density of the ...

Added: August 29, 2017

Proceedings of the 24th International Conference on Pattern Recognition (ICPR)

IEEE, 2018

Added: December 2, 2018

Об одном способе повышения вычислительной эффективности вероятностной нейронной сети в задаче распознавания образов на основе проекционных оценок

Savchenko A., Информационные системы и технологии 2015 № 4(90) С. 28-38

Рассмотрена проблема недостаточной вычислительной эффективности вероятностной нейронной сети (ВНС) в задачах распознавания образов при наличии в базе данных для каждого класса небольшого числа эталонов. На основе проекционных оценок плотности распределения с ядром Фейера и наивного предположения о независимости признаков классифицируемого объекта синтезирована новая модификация ВНС. Экспериментально показано, что предложенный классификатор оказался несколько точнее и намного ...

Added: October 8, 2015

Система постановки произношения на основе сверточных нейронных сетей и информационной теории восприятия речи

Savchenko L., Информационные технологии 2019 Т. 25 № 5 С. 313-318

We consider a problem of computer assisted language and pronunciation learning based on the deep learning methods and the information theory of speech perception. In order to improve the efficiency of testing of pronunciation quality, we propose to train a convolutional neural network using the best reference utterances from the user. The experimental results proved ...

Added: May 29, 2019

Braverman Readings in Machine Learning. Key Ideas from Inception to Current State

Heidelberg : Springer Publishing Company, 2018

This state-of-the-art survey is dedicated to the memory of Emmanuil Markovich Braverman (1931-1977), a pioneer in developing the machine learning theory. The 12 revised full papers and 4 short papers included in this volume were presented at the conference "Braverman Readings in Machine Learning: Key Ideas from Inception to Current State" held in Boston, MA, USA, in ...

Added: September 11, 2018

О применении Фишеровских ядер в задаче рас-познавания диктора

Ermilov A., Известия Юго-Западного государственного университета 2011 № 2 С. 15-20

In this article we consider application of Support Vector Machines with different types of kernels to the task of speaker identification. We use Fisher features for several types of channels (telephone, GSM, microphone). We analyze dependence of accuracy from length of input sentence. ...

Added: January 18, 2014

HSE-NN Team at the 4th ABAW Competition: Multi-task Emotion Recognition and Learning from Synthetic Images

Savchenko A., / Cornell University. Series Computer Science "arxiv.org". 2022.

In this paper, we present the results of the HSE-NN team in the 4th competition on Affective Behavior Analysis in-the-wild (ABAW). The novel multi-task EfficientNet model is trained for simultaneous recognition of facial expressions and prediction of valence and arousal on static photos. The resulting MT-EmotiEffNet extracts visual features that are fed into simple feed-forward ...

Added: October 21, 2022

About one Model of Machines Remote Control on the Basis of Gaze Tracking

Gostev I. M., Sibirtseva Elene Alekseevna, Selected engineering problems 2013 P. 85-91

The methodology of control metal- or woodworking equipment based on the determination of the eyeball position and its movement direction. The developed hardware and software system for gaze direction tracking can be used as an alternative method of input, which is closer to the natural way of interaction with the environment, as well as the ...

Added: October 1, 2014

Сборник трудов V Международной конференции и молодёжной школы "Информационные технологии и нанотехнологии" (ИТНТ 2019)

[б.и.], 2019

Конференции ИТНТ-2019 проводится с целью предоставления возможности научных дискуссий и обсуждения результатов фундаментальных и прикладных исследований в области информационных технологий и нанотехнологий, привлечение молодежи в сферу передовых научных исследований, обмен опытом научно-образовательной деятельности при подготовке ИТНТ-специалистов.. Тематика Конференции ИТНТ-2019 охватывает широкий круг областей применения информационных технологий в науке и высокотехнологичных отраслях промышленности. Основными направлениями работы Конференции ИТНТ-2018 ...

Added: December 4, 2018

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2010)

San Francisco : IEEE, 2010

Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on ...

Added: October 18, 2017

Applied Data Analysis in Energy Monitoring System

Kychkin A.V., Mikriukov G. P., Проблемы региональной энергетики 2016 Vol. 2 No. 31 P. 84-92

Software and hardware system organization is presented as an example for building energy monitoring of multi-sectional lighting and climate control / conditioning needs. System key feature is applied office energy data analysis that allows to provide each type of hardware localized work mode recognition. It based on general energy consumption profile with following energy consumption ...

Added: November 21, 2017

Influence of Noise on the DTW Metric Value in Object Shape Recognition

Gostev I. M., Sevastianov L., RUDN Journal of Mathematics, Information Sciences and Physics 2018 Vol. 26 No. 4 P. 331-342

The paper sets out one of the methodologies on image processing and recognition of the form of graphic objects. In it, at the first stage preliminary processing of the image with the purpose of extracting of characteristic attributes of the form of objects is made. Contours of objects are used as such attributes. For transformation ...

Added: December 19, 2018

Об одном методе дифференцирования плоской дискретной кривой при обработке изображений

Gostev I. M., Севастьянов Л. А., Вестник Российского университета дружбы народов. Серия: Математика, информатика, физика 2016 № 4 С. 49-55

The problem of receiving points with high curvature (singular points) of contours for identification of the shape of objects on images is solved. Analysis of existing methods of numerical differentiation in the given aspect is held. The new method of differentiation of the flat discretely defined curves, which are dots (pixels) of circuits, based on ...

Added: February 17, 2017

Pattern recognition and increasing of the computational efficiency of a parallel realization of the probabilistic neural network with homogeneity testing

Savchenko A., Optical Memory and Neural Networks (Information Optics) 2013 Vol. 22 No. 3 P. 184-192

The research subject is the computational complexity of the probabilistic neural network (PNN) in the pattern recognition problem for large model databases. We examined the following methods of increasing the efficiency of a neuralnetwork classifier: a parallel multithread realization, reducing the PNN to a criterion with testing of homogeneity of feature histograms of input and ...

Added: September 10, 2013