Состоятельность оценки области определения алгоритмом спектральных вложений Грассмана-Штифеля

Ю. А. Янович

Publications

?

Состоятельность оценки области определения алгоритмом спектральных вложений Грассмана-Штифеля

С. 191–197.

Keywords: снижение размерности Manifold Learning Dimension reduction

In book

Сборник статей конференции "Информационные технологии и системы" (ИТиС'16)

М.: ИППИ РАН, 2016.

Locally isometric and conformal parameterization of image manifold

Bernstein A., Kuleshov A. P., Yanovich Y., , in: Proceedings of SPIE 9875, Eighth International Conference on Machine Vision (ICMV 2015). Barcelona: SPIE, 2015. Ch. 7 P. 1–7.

Images can be represented as vectors in a high-dimensional Image space with components specifying light intensities at image pixels. To avoid the ‘curse of dimensionality’, the original high-dimensional image data are transformed into their lower-dimensional features preserving certain subject-driven data properties. These properties can include ‘information-preserving’ when using the constructed low-dimensional features instead of original ...

Added: February 6, 2016

Non-Gaussian Component Analysis: New Ideas, New Proofs, New Applications

Panov V., / Series Discussion paper SFB 649 "Economic risk". 2010. No. 2010-026.

In this article, we present new ideas concerning Non-Gaussian Component Analysis (NGCA). We use the structural assumption that a high-dimensional random vector $\vX$ can be represented as a sum of two components - a low-dimensional signal $\vS$ and a noise component $\vN$. We show that this assumption enables us for a special representation for the ...

Added: September 3, 2015

Алгоритм Laplacian Eigenmaps для точек вне обучающей выборки

Вельдяйкин Николай, Yanovich Y., В кн.: Сборник статей конференции «Информационные технологии и системы» (ИТиС'17). ИППИ РАН, 2017. С. 74–80.

Методы снижения размерности позволяют заменить многомерные описания данных на их низкоразмерные аналоги почти без потери информации, что способно упростить построение моделей по ним в рамках машинного обучения. Как правило, программные реализации алгоритмов снижения размерности строят лишь низкоразмерные описания для точек из обучающих выборок. Однако для последующего решения задач классификации и регрессии важно уметь строить вложение ...

Added: July 10, 2021

Равномерное оценивание касательного к многообразию пространства

Yanovich Y., В кн.: Сборник статей конференции "Информационные технологии и системы" (ИТиС'13). М.: ИППИ РАН, 2013. С. 371–375.

Методы восстановления многообразий использу- ются для решения многомерных задач машинного обучения. В последние годы был разработан ряд под- ходов, таких как изометрическое отображение (Isomap), локально-линейное вложение (LLE), для решения данной задачи. Однако, эти методы рас- сматривали снижение размерности поточечно, не учитывая локальных свойств многообразия. Алго- ритмы выравнивания локальных тангенциальных пространств (LTSA) и спектральных вложений Грассмана-Штифеля ...

Added: September 24, 2015

Reconstruction of manifold embeddings into Euclidean spaces via intrinsic distances

Nikita Puchkin, Vladimir Spokoiny, Eugene Stepanov et al., ESAIM - Control, Optimisation and Calculus of Variations 2024 Vol. 30 Article 3

We consider the problem of reconstructing an embedding of a compact connected Riemannian manifold in a Euclidean space up to an almost isometry, given the information on intrinsic distances between points from its “sufficiently large” subset. This is one of the classical manifold learning problems. It happens that the most popular methods to deal with ...

Added: February 2, 2024

Alignment Of Vector Fields On Manifolds Via Contraction Mappings

Kachan O. N., Yanovich Y., Abramov E., Ученые записки Казанского университета. Серия: Физико-математические науки 2018 Vol. 160 No. 2 P. 300–308

According to the manifold hypothesis, high-dimensional data can be viewed and meaningfully represented as a lower-dimensional manifold embedded in a higher dimensional feature space. Manifold learning is a part of machine learning where an intrinsic data representation is uncovered based on the manifold hypothesis. Many manifold learning algorithms were developed. The one called Grassmann & Stiefel eigenmaps (GSE) ...

Added: October 29, 2020

Information preserving and locally isometric&conformal embedding via Tangent Manifold Learning

Bernstein A., Kuleshov A. P., Yanovich Y., , in: Data Science and Advanced Analytics (DSAA), 2015. 36678 2015. IEEE International Conference on. P.: IEEE, 2015. P. 1–10.

In many Data Analysis tasks, one deals with data that are presented in high-dimensional spaces. In practice original high-dimensional data are transformed into lower-dimensional representations (features) preserving certain subject-driven data properties such as distances or geodesic distances, angles, etc. Preserving as much as possible available information contained in the original high-dimensional data is also an ...

Added: January 14, 2016

Estimation Of Smooth Vector Fields On Manifolds By Optimization On Stiefel Group

Abramov E., Yanovich Y., Ученые записки Казанского университета. Серия: Физико-математические науки 2018 Vol. 160 No. 2 P. 220–228

Real data are usually characterized by high dimensionality. However, real data obtained from real sources, due to the presence of various dependencies between data points and limitations on their possible values, form, as a rule, form a small part of the high-dimensional space of observations. The most common model is based on the hypothesis that ...

Added: October 29, 2020

Генерация последовательностей случайных точек с заданной плотностью на многообразиях

Yanovich Y., Киселиус М., В кн.: Сборник статей конференции "Информационные технологии и системы" (ИТиС'15). М.: ИППИ РАН, 2015. С. 1036–1040.

В машинном обучении при построении регресси- онных зависимостей или решении задач класси- фикации многомерные описания объектов часто являются избыточными и функционально зави- симыми. Такие описания зачастую лежат око- ло многообразий существенно меньшей размерно- сти, чем размерность их первичной записи. Дан- ное предположение называется гипотезой много- образия (Manifold Hypothesis). Использование такой информации может помочь в решении ...

Added: September 24, 2015

Asymptotic Properties of Local Sampling on Manifold

Yanovich Yury Aleksandrovich, Journal of Mathematics and Statistics 2016 Vol. 12 No. 3 P. 157–175

In many applications, the real high-dimensional data occupy only a very small part in the high dimensional ‘observation space’ whose intrinsic dimension is small. The most popular model of such data is Manifold model which assumes that the data lie on or near an unknown manifold Data Manifold, (DM) of lower dimensionality embedded in an ...

Added: November 24, 2016

Estimation of the signal subspace without estimation of the inverse covariance matrix

Panov V., / Series Discussion paper SFB 649 "Economic risk". 2010. No. 2010-050.

Let a high-dimensional random vector $\vX$ be represented as a sum of two components - a signal $\vS$ that belongs to some low-dimensional linear subspace $\S$, and a noise component $\vN$. This paper presents a new approach for estimating the subspace $\S$ based on the ideas of the Non-Gaussian Component Analysis. Our approach avoids the ...

Added: September 3, 2015

Manifold Learning Based On Kernel Density Estimation

Kuleshov A. P., Bernstein A. V., Yanovich Y., Ученые записки Казанского университета. Серия: Физико-математические науки 2018 Vol. 160 No. 2 P. 327–338

The problem of unknown high-dimensional density estimation has been considered. It has been suggested that the support of its measure is a low-dimensional data manifold. This problem arises in many data mining tasks. The paper proposes a new geometrically motivated solution to the problem in the framework of manifold learning, including estimation of an unknown ...

Added: October 28, 2020

Structure-adaptive Manifold Estimation

Puchkin N., Spokoiny V., Journal of Machine Learning Research 2022 Vol. 23 No. 40 P. 1–62

We consider a problem of manifold estimation from noisy observations. Many manifold learning procedures locally approximate a manifold by a weighted average over a small neighborhood. However, in the presence of large noise, the assigned weights become so corrupted that the averaged estimate shows very poor performance. We suggest a structure-adaptive procedure, which simultaneously reconstructs ...

Added: February 3, 2022

Manifold Learning Based On Kernel Density Estimation

Added: October 28, 2020

User-controllable Multi-texture Synthesis with Generative Adversarial Networks

Alanov A., Kochurov M., Volkhonskiy D. et al., , in: Proceedings of the 15th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISAPP 2020)Vol. 4. SciTePress, 2020. P. 214–221.

We propose a novel multi-texture synthesis model based on generative adversarial networks (GANs) with a user-controllable mechanism. The user control ability allows to explicitly specify the texture which should be generated by the model. This property follows from using an encoder part which learns a latent representation for each texture from the dataset. To ensure ...

Added: November 8, 2020

Сокращение размерности данных в задачах имитационного моделирования

Агалаков Ю. Г., Bernstein A., Информационные технологии и вычислительные системы 2012 № 3 С. 3–17

Рассматриваются задачи интеллектуального анализа данных, которые необходимо решать в технологии предсказательного моделирования. Для уменьшения сложности решения этих задач в технологии предсказательного моделирования используются решения задач снижения размерности, которые должны удовлетворять ряду дополнительных условий. В статье обсуждаются эти дополнительные требования и сформулированы соответствующие новые нетрадиционные постановки задач снижения размерности. ...

Added: January 24, 2013

Manifold Learning in Data Mining Tasks

Kuleshov A. P., Bernstein A., , in: Machine Learning and Data Mining in Pattern RecognitionVol. 8556. Springer, 2014. P. 119–133.

Many Data Mining tasks deal with data which are presented in high dimensional spaces, and the ‘curse of dimensionality’ phenomena is often an obstacle to the use of many methods for solving these tasks. To avoid these phenomena, various Representation learning algorithms are used as a first key step in solutions of these tasks to ...

Added: September 30, 2014

Minimax adaptive dimension reduction for regression

Paris Q., Journal of Multivariate Analysis 2014 Vol. 128 P. 182–202

In this paper, we address the problem of regression estimation in the context of a -dimensional predictor when is large. We propose a general model in which the regression function is a composite function. Our model consists in a nonlinear extension of the usual sufficient dimension reduction setting. The strategy followed for estimating the regression function is based on the estimation of ...

Added: December 20, 2014

Manifold Learning: Generalization Ability and Tangent Proximity

Bernstein A., Kuleshov A. P., International Journal of Software and Informatics 2013 No. 7(3) P. 359–390

One of the ultimate goals of Manifold Learning (ML) is to reconstruct an unknown nonlinear low-dimensional Data Manifold (DM) embedded in a high-dimensional observation space from a given set of data points sampled from the manifold. We derive asymptotic expansion and local lower and upper bounds for the maximum reconstruction error in a small neighborhood ...

Added: November 21, 2014

О применении метода главных компонент в задачах финансового мониторинга

Denisenko A., Крылов Г. О., Корнев И. А., Известия Самарского научного центра Российской академии наук 2015 Т. 17 № 2 С. 1131–1140

В работе представлены результаты анализа методом главных компонент данных по финансовым потокам компаний и аффилированных с ними лиц в отрасли российской экономики, синтезированы рейтинговые оценки активности компаний в этой отрасли экономики в разрезе регионов. ...

Added: December 21, 2018

Asymptotically Optimal Method for Manifold Estimation Problem

Bernstein A., Kuleshov A. P., Yanovich Y., , in: Abstructs of the 29-th European Meeting of Statisticians. Budapest: Bernoulli Society, 2013. P. 325–325.

Let X be an unknown nonlinear smooth q-dimensional Data manifold (D-manifold) embedded in a p-dimensional space (p> q) covered by a single coordinate chart. It is assumed that the manifold's condition number is positive so X has no self-intersections. Let Xn={X1, X2,..., Xn}⊂ X⊂ Rp be a sample randomly selected from the D-manifold Xindependently of each other according ...

Added: September 24, 2015

Manifold Learning in Regression Tasks

Bernstein A., Kuleshov A. P., Yanovich Y., Lecture Notes in Computer Science 2015 Vol. 9047 P. 414–423

The paper presents a new geometrically motivated method for non-linear regression based on Manifold learning technique. The regression problem is to construct a predictive function which estimates an unknown smooth mapping f from q-dimensional inputs to m-dimensional outputs based on a training data set consisting of given ‘input-output’ pairs. The unknown mapping f determines q-dimensional ...

Added: August 30, 2015

Asymptotic Properties of Nonparametric Estimation on Manifold

Yanovich Y., Proceedings of Machine Learning Research 2017 Vol. 60 P. 18–38

In many applications, the real high-dimensional data occupy only a very small part in the high dimensional ‘observation space’ whose intrinsic dimension is small. The most popular model of such data is Manifold model which assumes that the data lie on or near an unknown manifold (Data Manifold, DM) of lower dimensionality embedded in an ...

Added: June 15, 2017