Методы детерминированных и рандомизированных энтропийных проекций для редукции размерности матрицы данных

Ю. С. Попков; А. Ю. Попков; Ю. А. Дубнов

doi:10.14357/19922264200407

Publications

?

Методы детерминированных и рандомизированных энтропийных проекций для редукции размерности матрицы данных

Информатика и ее применения. 2020. Т. 14. № 4. С. 47-54.

Popkov Y., Popkov A., Dubnov Y. A.

The work is devoted to development of methods for deterministic and randomized projection aimed at dimensionality reduction problems. In the deterministic case, the authors develop the parallel reduction procedure minimizing Kullback-Leibler cross-entropy target to condition on information capacity based on the gradient projection method. In the randomized case, the authors solve the problem of reduction of feature space. The idea of application of projection procedures for reduction of data matrix is implemented in the proposed method of randomized entropy projection where the authors use the principle of keeping average distances between high- and low-dimensional points in the corresponding spaces. The problem leads to searching of a probability distribution maximizing Fermi entropy target to average distance between points.

Research target: Mathematics

Priority areas: IT and mathematics

Keywords: cross-entropy кросс-энтропия dimensionality reduction понижение размерности

Кросс-энтропийная редукции матрицы данных с ограничением информационной емкости матриц-проекторов и их норм

Popkov Y., Popkov A., Dubnov Y. A., Математическое моделирование 2020 Т. 32 № 9 С. 35-52

We develop a new method of dimensionality reduction based on direct and inverse projection of data matrix and calculation of projectors minimizing cross-entropy functional. Concept of information capacity of matrix which is used as a restriction in a problem of optimal reduction is introduced. We conduct a comparison of proposed method with known ones based ...

Added: October 31, 2020

Об энтропийных критериях отбора признаков в задачах анализа данных

Dubnov Y. A., Информационные технологии и вычислительные системы 2018 № 2 С. 60-69

The paper considers the problem of reducing the dimension of the feature space for describing objects in data analysis problems using the example of binary classification. The article provides a detailed overview of existing approaches to solving this problem and proposes several modifications. In which the dimensionality reduction is considered as the problem of extracting the most relevant ...

Added: July 4, 2018

Метод отбора признаков на основе вероятностного подхода и перекрестной энтропии на примере задачи распознавания изображений

Dubnov Y. A., Искусственный интеллект и принятие решений 2020 № 2 С. 78-85

The paper considers the problem of feature selection in the classification problem. A method for selecting informative features based on a probabilistic approach and cross-entropy metrics is proposed. Several variants of the information criterion for selecting features for a binary classification problem are considered, as well as its generalization to the case of a multiclass ...

Added: October 31, 2020

Manifold Learning in Regression Tasks

Bernstein A., Kuleshov A. P., Yanovich Y., Lecture Notes in Computer Science 2015 Vol. 9047 P. 414-423

The paper presents a new geometrically motivated method for non-linear regression based on Manifold learning technique. The regression problem is to construct a predictive function which estimates an unknown smooth mapping f from q-dimensional inputs to m-dimensional outputs based on a training data set consisting of given ‘input-output’ pairs. The unknown mapping f determines q-dimensional ...

Added: August 30, 2015

CAPM-Like Model and the Special Form of the Utility Function

Дранев Юрий Яковлевич, Корпоративные финансы 2012 № 1 С. 33-36

The variance and semivariance are traditional measures of asset returns volatility since Markowitz proposed the market portfolio theory. Well known models for expected asset returns were developed under assumptions of mean-variance or mean-semivariance investor’s behavior. But numerous papers provided arguments against these models because of unrealistic assumptions and controversial empiric evidence. More complicated models with ...

Added: November 16, 2012

Complete complexity dichotomy for 7-edge forbidden subgraphs in the edge coloring problem

Malyshev D., Journal of Applied and Industrial Mathematics (перевод журналов "Сибирский журнал индустриальной математики" и "Дискретный анализ и исследование операций") 2020 Vol. 14 No. 4 P. 706-721

The edge coloring problem for a graph is to minimize the number of colors that are sufficient to color all edges of the graph so that all adjacent edges receive distinct colors. The computational complexity of the problem is known for all graph classes defined by forbidden subgraphs with at most 6 edges. We improve ...

Added: January 30, 2021

Proceedings 10th International Conference on Terminology and Artificial Intelligence TIA 2013

P. : Université Paris 13 - Paris Sorbonne Cité, 2013

In this workshop we will bring together participants who have solutions for one or more of the following problems: How can mutual understanding be optimized with the help of technology in hospitals where both patients and professionals have varying language skills, cultural backgrounds and cognitive capacities? Can domain ontologies, natural language processing tools, multilingual knowledge-based ...

Added: December 18, 2014

Development of Maslov’s Approach to the Construction of Nonoscillating WKB-Type Solutions

Danilov V., Rakhel M., Russian Journal of Mathematical Physics 2021 Vol. 28 No. 2 P. 179-187

In this paper, we show how to construct an asymptotic representation of the fundamental solution to the Cauchy problem for degenerate linear parabolic equations. ...

Added: June 6, 2021

The complexity of the 3-colorability problem in the absence of a pair of small forbidden induced subgraphs

Malyshev D., Discrete Mathematics 2015 Vol. 338 No. 11 P. 1860-1865

We completely determine the complexity status of the 3-colorability problem for hereditary graph classes defined by two forbidden induced subgraphs with at most five vertices. ...

Added: April 7, 2014

Влияние проницаемости поясков Каспари для воды и растворенных веществ на величину корневого давления: математическое моделирование

Logvenkov S. A., Штейн А. А., Российский журнал биомеханики 2013 Т. 17 № 4 С. 47-57

The mathematical modelling is performed to study the effect of the permeability of the Casparian bands to water and solutes on the formation of the root pressure. It is shown that the pressure in the xylem vessels which stops the flow across a root cut (root pressure) decreases with increase in the permeability of the ...

Added: January 30, 2014

Normal approximation and smoothness for sums of means of lattice-valued random variables

Decrouez G. G., Hall P., Bernoulli: a journal of mathematical statistics and probability 2013 Vol. 19 No. 4 P. 1268-1293

Motivated by a problem arising when analysing data from quarantine searches, we explore properties of distributions of sums of independent means of independent lattice-valued random variables. The aim is to determine the extent to which approximations to those sums require continuity corrections. We show that, in cases where there are only two different means, the ...

Added: September 29, 2014

Численное моделирование затвердевания сплавов при интенсивном сопряженном теплообмене

Marshirov V. V., Marshirova L. E., Сибирский журнал индустриальной математики 2013 Т. XVI № 4 С. 111-120

The paper considers the problem of determining the rate of cooling of metal during solidification at the intersection of the liquidus temperature under intense heat sink from the surface. The solution to this problem it is necessary to determine the process conditions, the boundary and initial conditions for which it is possible to get new ...

Added: November 17, 2013

Agent-based modelling of interactions between air pollutants and greenery using a case study of Yerevan, Armenia

Akopov A. S., Beklaryan L. A., Saghatelyan A. K., Environmental Modelling and Software 2019 Vol. 116 P. 7-25

Urban greenery such as trees can effectively reduce air pollution in a natural and eco-friendly way. However, how to spatially locate and arrange greenery in an optimal way remains as a challenging task. We developed an agent-based model of air pollution dynamics to support the optimal allocation and configuration of tree clusters in a city. The Pareto ...

Added: February 24, 2019

Классы планарных графов с полиномиально разрешимой задачей о независимом множестве

Malyshev D., Alekseev V., Дискретный анализ и исследование операций 2008 Т. 15 № 1 С. 3-10

Доказывается полиномиальная разрешимость задачи о независимом множестве для бесконечного семейства подмножеств класса планарных графов. ...

Added: August 31, 2012

Fast reconfiguration of high frequency brain networks in response to surprising changes in auditory input

Nicol R., Chapman S., Vertes P. et al., Journal of Neurophysiology (США) 2012 Vol. 107 No. 5 P. 1421-1430

How do human brain networks react to dynamic changes in the sensory environment? We measured rapid changes in brain network organization in response to brief, discrete, salient auditory stimuli. We estimated network topology and distance parameters in the immediate central response period, <1 s following auditory presentation of standard tones interspersed with occasional deviant tones ...

Added: October 23, 2014

Hardness of Approximation for H-free Edge Modification Problems

Bliznets Ivan, Cygan M., Komosa P. et al., ACM Transactions on Computation Theory 2018 Vol. 10 No. 2 P. 1-32

The H-free Edge Deletion problem asks, for a given graph G and integer k, whether it is possible to delete at most k edges from G to make it H-free—that is, not containing H as an induced subgraph. The H-free Edge Completion problem is defined similarly, but we add edges instead of deleting them. The study of these two problem families has recently been the subject of intensive studies from the point of ...

Added: October 30, 2018

О некоторых медленно сходящихся системах преобразований термов

Beklemishev L. D., Оноприенко А. А., Математический сборник 2015 Т. 206 № 9 С. 3-20

We formulate some term rewriting systems in which the number of computation steps is finite for each output, but this number cannot be bounded by a provably total computable function in Peano arithmetic PA. Thus, the termination of such systems is unprovable in PA. These systems are derived from an independent combinatorial result known as the Worm ...

Added: March 13, 2016

Пятая Международная конференция «Системный анализ и информационные технологии» САИТ-2013 (19–25 сентября 2013 г., г.Красноярск, Россия): Труды конференции. В 2-х т.

Красноярск : ИВМ СО РАН, 2013

Труды Пятой Международной конференции «Системный анализ и информационные технологии» САИТ-2013 (19–25 сентября 2013 г., г.Красноярск, Россия): ...

Added: November 18, 2013

Совершенствование преподавания дисциплин математического цикла на основе инвариантов, необходимых для преподавания курса «Эконометрика» экономистам-бакалаврам

Kotelnikova M. V., Aistov A., Вестник Нижегородского университета им. Н.И. Лобачевского. Серия: Социальные науки 2019 Т. 55 № 3 С. 183-189

The article describes a method that allows to improve the content of disciplines of the mathematical cycle by dividing them into invariant (general) and variable parts. The invariants were identified for such disciplines as «Linear algebra», «Mathematical analysis», «Probability theory and mathematical statistics» delivered to Bachelors program students of economics at several universities. Based on ...

Added: January 28, 2020

Particle Simulation for Predicting Effective Properties of Short Fiber Reinforced Composites

Skoptsov K. A., Sheshenin S., Galatenko V. V. et al., International Journal of Applied Mechanics 2016 Vol. 8 No. 2 P. 1650016-01-1650016-18

We present a method for evaluating elastic properties of a composite material produced by molding a resin filled with short elastic fibers. A flow of the filled resin is simulated numerically using a mesh-free method. After that, assuming that spatial distribution and orientation of fibers are not significantly changed during polymerization, effective elastic moduli of ...

Added: May 22, 2016

Time series models for border inspection data

Decrouez G. G., Robinson A., Risk Analysis: An International Journal 2013 Vol. 33 No. 12 P. 2142-2153

We propose a new modeling approach for inspection data that provides a more useful interpretation of the patterns of detections of invasive pests, using cargo inspection as a motivating example. Methods that are currently in use generally classify shipments according to their likelihood of carrying biosecurity risk material, given available historical and contextual data. Ideally, ...

Added: September 29, 2014

Algorithms and methods for solving scheduling problems and other extremum problems on large-scale graphs

Chernyshev S. V., Cherepanov E. A., Pankratiev E. V. et al., Journal of Mathematical Sciences 2005 Vol. 128 No. 6 P. 3487-3495

Added: January 27, 2014

Complex forecasting scheme for surface meteorological values

Bagrov A. N., Gordin V. A., Bykov P. L., Russian Meteorology and Hydrology 2014 No. 5 P. 283-291

The evaluations of the forecasts of surface air temperature and precipitation for the period July 2010 - June 2013 are presented. The forecasting of surface air temperature at 5 days and precipitation at 3 days are considered. Our complex statistical scheme uses the results of the best foreign global schemes, regional scheme COSMO-RU7. The joint ...

Added: December 7, 2013

Об одномерных проекциях многогранников задач дискретной оптимизации

Vyalyi M., Дискретная математика 1991 Т. 3 № 3 С. 35-45

Added: October 17, 2014