Classification of a Sequence of Objects with the Fuzzy Decoding Method

A. Savchenko; Savchenko L. V.

?

Classification of a Sequence of Objects with the Fuzzy Decoding Method

Lecture Notes in Artificial Intelligence. 2014. Vol. 8536. P. 309–318.

Savchenko A., Savchenko L. V.

The problem of recognition of a sequence of objects (e.g., video-based image recognition, phoneme recognition) is explored. The generalization of the fuzzy phonetic decoding method is proposed by assuming the distribution of the classified object to be of exponential type. Its preliminary phase includes association of each model object with the fuzzy set of model classes with grades of membership defined as the confusion probabilities estimated with the Kullback-Leibler divergence between model distributions. At first, each object (e.g., frame) in a classified sequence is put in correspondence with the fuzzy set which grades are defined as the posterior probabilities. Next, this fuzzy set is intersected with the fuzzy set corresponding to the nearest neighbor. Finally, the arithmetic mean of these fuzzy intersections is assigned to the decision for the whole sequence. In this paper we propose not to limit the method's usage with the Kullback-Leibler discrimination and to estimate the grades of membership of models and query objects based on an arbitrary distance with appropriate scale factor. The experimental results in the problem of isolated Russian vowel phonemes and words recognition for state-of-the-art measures of similarity are presented. It is shown that the correct choice of the scale parameter can significantly increase the recognition accuracy.

Research target: Computer Science

Priority areas: IT and mathematics

Language: English

Full text

Text on another site

Keywords: классификация информационное рассогласование Кульбака-Лейблера нечеткое множество Kullback-Leibler information discrimination fuzzy sets classification speech recognition распознавание речи sequence of objects fuzzy decoding method последовательность объектов метод нечеткого декодирования

Система постановки произношения на основе сверточных нейронных сетей и информационной теории восприятия речи

Savchenko L., Информационные технологии 2019 Т. 25 № 5 С. 313–318

We consider a problem of computer assisted language and pronunciation learning based on the deep learning methods and the information theory of speech perception. In order to improve the efficiency of testing of pronunciation quality, we propose to train a convolutional neural network using the best reference utterances from the user. The experimental results proved ...

Added: May 29, 2019

Phonetic encoding method in the isolated words recognition problem

Savchenko A., Journal of Communications Technology and Electronics 2014 Vol. 59 No. 4 P. 339–345

A phonetic approach to the problem of automatic recognition of isolated words is investigated.The phonetic encoding method whereby each word from a vocabulary is associated with the code sequenceof stable phonemes is proposed. The informationtheoretical estimate of vocabulary confusability, the calcuations of which rely on the phonetic database of a speaker and the communications channel ...

Added: April 8, 2014

Метод отбора признаков на основе вероятностного подхода и перекрестной энтропии на примере задачи распознавания изображений

Dubnov Y. A., Искусственный интеллект и принятие решений 2020 № 2 С. 78–85

The paper considers the problem of feature selection in the classification problem. A method for selecting informative features based on a probabilistic approach and cross-entropy metrics is proposed. Several variants of the information criterion for selecting features for a binary classification problem are considered, as well as its generalization to the case of a multiclass ...

Added: October 31, 2020

Sequential Three-Way Decisions in Efficient Classification of Piecewise Stationary Speech Signals

Savchenko A., Lecture Notes in Artificial Intelligence 2017 Vol. 10314 P. 264–277

In this paper it is proposed to improve performance of the automatic speech recognition by using sequential three-way decisions. At first, the largest piecewise quasi-stationary segments are detected in the speech signal. Every segment is classified using the maximum a-posteriori (MAP) method implemented with the Kullback-Leibler minimum information discrimination principle. The three-way decisions are taken ...

Added: June 27, 2017

Fairness of Machine Learning Algorithms in Demography

Emmanuel I. C., Mitrofanova E., / Series 4064475 "ArXiv Preprint". 2022.

The paper is devoted to the study of the model fairness and process fairness of the Russian demographic dataset by making predictions of divorce of the 1st marriage, religiosity, 1st employment and completion of education. Our goal was to make classifiers more equitable by reducing their reliance on sensitive features while increasing or at least ...

Added: May 31, 2022

Байесовская идентификация параметров смеси нормальных распределений

Dubnov Y. A., Bulychev A., Информационные технологии и вычислительные системы 2017 № 1 С. 101–111

We consider a problem of parameters estimation for gaussian mixture models widely used in data analysis and unsupervised machine learning. A new model identification method based on Bayesian aproach and the principle of maximum posterior distribution is proposed. In the article we describe the method of multiextremum density function maximum definition using sampling by Metropolis-Hastings ...

Added: December 27, 2017

Об одном способе повышения вычислительной эффективности вероятностной нейронной сети в задаче распознавания образов на основе проекционных оценок

Savchenko A., Информационные системы и технологии 2015 № 4(90) С. 28–38

Рассмотрена проблема недостаточной вычислительной эффективности вероятностной нейронной сети (ВНС) в задачах распознавания образов при наличии в базе данных для каждого класса небольшого числа эталонов. На основе проекционных оценок плотности распределения с ядром Фейера и наивного предположения о независимости признаков классифицируемого объекта синтезирована новая модификация ВНС. Экспериментально показано, что предложенный классификатор оказался несколько точнее и намного ...

Added: October 8, 2015

Concept Learning from Triadic Data

Zhuk R., Ignatov D. I., Konstantinova N., Procedia Computer Science 2014 Vol. 31 P. 928–938

We propose extensions of the classical JSM-method and the Na ̈ıve Bayesian classifier for the case of triadic relational data. We performed a series of experiments on various types of data (both real and synthetic) to estimate quality of classification techniques and compare them with other classification algorithms that generate hypotheses, e.g. ID3 and Random ...

Added: June 9, 2014

Математическое и программное обеспечение баз экспертных знаний, полученных при разрешении инцидентов в информационных системах.

Karasev A. A., Starykh V., Вестник МГТУ МИРЭА 2014 Т. 4 № 5 С. 113–121

This article is devoted to a solution of the problem of collecting and classification of the expert knowledge acquired during the operation of information systems aimed to organize expert knowledge bases. As perspective approach ontological theory use is offered; it provides a basis for building mathematical model of knowledge representation. Organization of objects hierarchy is ...

Added: February 10, 2015

Оценка качества произношения на основе метода нечеткого фонетического кодирования

Savchenko L., Телекоммуникации 2017 № 5 С. 42–48

The paper is devoted to the automatic assessment of phoneme pronunciation quality for computer assisted language learning systems. The novel pronunciation training algorithm is proposed. In this algorithm, at first, a student has to achieve a stable pronunciation of all sounds by using the phonetic database from ideal speaker. The second, novel, stage of our ...

Added: October 24, 2018

Advances in Information Retrieval

Kuznetsov S., Serdyukov P., Segalovich I. et al., L.: Springer, 2013.

Higher School of Economics (HSE) and supported by the Information Retrieval Specialist Group at the British Computer Society (BCS–IRSG). The conference was held during March 24–27, 2013, in Moscow, Russia – the easternmost location in the history of the ECIR series. ECIR 2013 received a total of 287 submissions in three categories: 191 full papers, ...

Added: April 15, 2013

Об энтропийных критериях отбора признаков в задачах анализа данных

Dubnov Y. A., Информационные технологии и вычислительные системы 2018 № 2 С. 60–69

The paper considers the problem of reducing the dimension of the feature space for describing objects in data analysis problems using the example of binary classification. The article provides a detailed overview of existing approaches to solving this problem and proposes several modifications. In which the dimensionality reduction is considered as the problem of extracting the most relevant ...

Added: July 4, 2018

Алгоритм работы программной реализации фильтра Виннера

Кузнецов Д. С., Естественные и технические науки 2009 № 4 С. 365–369

В данной статье рассматривается фильтр Винера в качестве метода повышения эффективности работы систем распознавания речи. Приводятся сведения о возможных модификациях фильтра Винера для повышения степени шумоподавления. Рассматривается алгоритм работы программной реализации классического фильтра Винера и его модификаций. ...

Added: February 21, 2013

Automated real-time classification of psychological functional state based on discrete wavelet transform of EEG data

Galatenko V. V., Livshitz E., Podol’skii V. et al., International Journal of Applied Mathematics 2012 Vol. 25 No. 6 P. 871–882

A method for the automated real-time classification of psychological functional state is proposed. The classification is based on discrete wavelet transform of electroencephalographic data. The method consists of two preliminary stages — global feature selection and individual tuning, and the main stage — real-time classification. All stages are fully automated. The software implementation of this ...

Added: October 30, 2015

Кросс-энтропийная редукции матрицы данных с ограничением информационной емкости матриц-проекторов и их норм

Popkov Y., Popkov A., Dubnov Y. A., Математическое моделирование 2020 Т. 32 № 9 С. 35–52

We develop a new method of dimensionality reduction based on direct and inverse projection of data matrix and calculation of projectors minimizing cross-entropy functional. Concept of information capacity of matrix which is used as a restriction in a problem of optimal reduction is introduced. We conduct a comparison of proposed method with known ones based ...

Added: October 31, 2020

Применении Фишеровских ядер к задаче идентификации диктора

Gostev I. M., Ermilov A., Известия Юго-Западного государственного университета 2011 № 2 С. 15–22

In this article we consider application of Support Vector Machines with different types of kernels to the task of speaker identification. We use Fisher features for several types of channels (telephone, GSM, microphone). We analyze dependence of accuracy from length of input sentence. ...

Added: January 31, 2014

Fuzzy Phonetic Decoding Method in a Phoneme Recognition Problem

Savchenko A., Savchenko L. V., Lecture Notes in Artificial Intelligence 2013 Vol. 7911 P. 176–183

The definition of a phoneme as a fuzzy set of minimal speech units from the model database is proposed. On the basis of this definition and the Kullback-Leibler minimum information discrimination principle the novel phoneme recognition algorithm has been developed as an enhancement of the phonetic decoding method. The experimental results in the problems of ...

Added: June 16, 2013

Ансамблевый метод машинного обучения, основанный на рекомендации классификаторов

Kashnitsky Y., Ignatov D. I., Интеллектуальные системы. Теория и приложения 2015 Т. 19 № 4 С. 37–55

The paper makes a brief introduction into multiple classifier systems and describes a particular algorithm which improves classification accuracy by making a recommendation of an algorithm to an object. This recommendation is done under a hypothesis that a classifier is likely to predict the label of the object correctly if it has correctly classified its ...

Added: December 7, 2015

Информационные ресурсы для сферы образования: каталогизация, классификация, онтология.

Башмаков А. И., Белоозеров В. Н., Starykh V., Информационные системы и технологии 2013 № 6(80) С. 88–102

In article process of construction formal ontology of information resources system for an education, that pursues the aim to reflect representation about this sphere in the automated systems intended for creation, account, ordering, storage, search and use of these resources in educational institutions of various level is stated. The system of information resources is set ...

Added: January 16, 2014

Entropy “2”-Soft Classification of Objects

Popkov Y., Dubnov Y. A., Volkovich Z. et al., Entropy 2017 Vol. 19(4) No. 178 P. 1–14

A proposal for a new method of classification of objects of various nature, named “2”-soft classification, which allows for referring objects to one of two types with optimal entropy probability for available collection of learning data with consideration of additive errors therein. A decision rule of randomized parameters and probability density function (PDF) is formed, ...

Added: May 26, 2017

Information Theoretic Analysis of Efficiency of the Phonetic Encoding–Decoding Method in Automatic Speech Recognition

Savchenko A., Savchenko V.V., Journal of Communications Technology and Electronics 2016 Vol. 61 No. 4 P. 430–435

A words phonetic decoding method in automatic speech recognition is considered. The properties of Kullback–Leibler divergence are used to synthesize the estimation of the distribution of divergence between minimum speech units (e.g., single phonemes) inside a single class. It is demonstrated that the min imum variance of the intraphonemic divergence is reached when the phonetic ...

Added: April 11, 2016

Исследовательский проект как инструмент обучения методам анализа текста: предсказание класса поста в социальной сети

Suvorova A., Смирнова К. Р., Будин Е. А. et al., Компьютерные инструменты в образовании 2018 № 3 С. 49–64

The article describes a student research project on predicting the class of a post on a social network based on its textual content. The features of the project are discussed as an integral part of the trajectory of teaching data analysis methods, including text analysis methods and tools that are often not included in machine ...

Added: January 28, 2019

Fuzzy and rough formal concept analysis: a survey

Poelmans J., Ignatov D. I., Kuznetsov S. et al., International Journal of General Systems 2014 Vol. 43 No. 2 P. 105–134

Formal Concept Analysis (FCA) is a mathematical technique that has been extensively applied to Boolean data in knowledge discovery, information retrieval, web mining, etc. applications. During the past years, the research on extending FCA theory to cope with imprecise and incomplete information made significant progress. In this paper, we give a systematic overview of the ...

Added: June 9, 2014

Классификация мозговой активности при помощи синолитических сетей

Vlasenko D., Zaikin A., Zakharov D., Известия высших учебных заведений. Прикладная нелинейная динамика 2023 Т. 31 № 5 С. 661–669

Because the brain is an extremely complex hypernet of interacting macroscopic subnetworks, full-scale analysis of brain activity is a daunting task.Nevertheless,this task can be greatly simplified by analysing the correspondence between various patterns of macroscopic brain activity, forex ample,through functional magneticresonance imaging(fMRI) scans, and the performance of particular cognitive tasks or pathological states.The purpose of ...

Added: October 4, 2023