Asymptotically Optimal Method for Manifold Estimation Problem

A. Bernstein; A. P. Kuleshov; Y. Yanovich

?

Asymptotically Optimal Method for Manifold Estimation Problem

P. 325–325.

Bernstein A., Kuleshov A. P., Yanovich Y.

Let X be an unknown nonlinear smooth q-dimensional Data manifold (D-manifold) embedded in a p-dimensional space (p> q) covered by a single coordinate chart. It is assumed that the manifold's condition number is positive so X has no self-intersections. Let Xn={X1, X2,..., Xn}⊂ X⊂ Rp be a sample randomly selected from the D-manifold Xindependently of each other according to an unknown probability measure on X with strictly positive density.

Language: English

Text on another site

Keywords: dimension reduction Manifold Learning

In book

Abstructs of the 29-th European Meeting of Statisticians

Budapest: Bernoulli Society, 2013.

Reconstruction of manifold embeddings into Euclidean spaces via intrinsic distances

Nikita Puchkin, Vladimir Spokoiny, Eugene Stepanov et al., ESAIM - Control, Optimisation and Calculus of Variations 2024 Vol. 30 Article 3

We consider the problem of reconstructing an embedding of a compact connected Riemannian manifold in a Euclidean space up to an almost isometry, given the information on intrinsic distances between points from its “sufficiently large” subset. This is one of the classical manifold learning problems. It happens that the most popular methods to deal with ...

Added: February 2, 2024

Asymptotic Properties of Nonparametric Estimation on Manifold

Yanovich Y., Proceedings of Machine Learning Research 2017 Vol. 60 P. 18–38

In many applications, the real high-dimensional data occupy only a very small part in the high dimensional ‘observation space’ whose intrinsic dimension is small. The most popular model of such data is Manifold model which assumes that the data lie on or near an unknown manifold (Data Manifold, DM) of lower dimensionality embedded in an ...

Added: June 15, 2017

Semiparametric estimation of the signal subspace

Belomestny D., Panov V., Spokoiny V., Journal of machine learning and data analysis 2012 Vol. 1 No. 3 P. 140–147

Let a high-dimensional random vector $\vX$ be represented as a sum of two components - a signal $\vS$ that belongs to some low-dimensional linear subspace $\S$, and a noise component $\vN$. This paper presents a new approach for estimating the subspace $\S$ based on the ideas of the Non-Gaussian Component Analysis. Our approach avoids the ...

Added: September 23, 2013

Locally isometric and conformal parameterization of image manifold

Bernstein A., Kuleshov A. P., Yanovich Y., , in: Proceedings of SPIE 9875, Eighth International Conference on Machine Vision (ICMV 2015). Barcelona: SPIE, 2015. Ch. 7 P. 1–7.

Images can be represented as vectors in a high-dimensional Image space with components specifying light intensities at image pixels. To avoid the ‘curse of dimensionality’, the original high-dimensional image data are transformed into their lower-dimensional features preserving certain subject-driven data properties. These properties can include ‘information-preserving’ when using the constructed low-dimensional features instead of original ...

Added: February 6, 2016

Alignment Of Vector Fields On Manifolds Via Contraction Mappings

Kachan O. N., Yanovich Y., Abramov E., Ученые записки Казанского университета. Серия: Физико-математические науки 2018 Vol. 160 No. 2 P. 300–308

According to the manifold hypothesis, high-dimensional data can be viewed and meaningfully represented as a lower-dimensional manifold embedded in a higher dimensional feature space. Manifold learning is a part of machine learning where an intrinsic data representation is uncovered based on the manifold hypothesis. Many manifold learning algorithms were developed. The one called Grassmann & Stiefel eigenmaps (GSE) ...

Added: October 29, 2020

Cross-Entropy Reduction of Data Matrix with Restriction on Information Capacity of the Projectors and Their Norms

Popkov Y., Popkov A. Y., Dubnov Y. A., Mathematical Models and Computer Simulations 2021 Vol. 13 No. 3 P. 382–394

© 2021, Pleiades Publishing, Ltd.Abstract: We develop/propose the method reducing the dimension of a data matrix, based on its direct and inverse projection, and the calculation of projectors that minimize the cross-entropy functional, remove. We introduce the concept of information capacity of a matrix, which is used as a constraint in the optimal reduction problem, ...

Added: October 28, 2022

Manifold Learning in Data Mining Tasks

Kuleshov A. P., Bernstein A., , in: Machine Learning and Data Mining in Pattern RecognitionVol. 8556. Springer, 2014. P. 119–133.

Many Data Mining tasks deal with data which are presented in high dimensional spaces, and the ‘curse of dimensionality’ phenomena is often an obstacle to the use of many methods for solving these tasks. To avoid these phenomena, various Representation learning algorithms are used as a first key step in solutions of these tasks to ...

Added: September 30, 2014

Manifold Learning: Generalization Ability and Tangent Proximity

Bernstein A., Kuleshov A. P., International Journal of Software and Informatics 2013 No. 7(3) P. 359–390

One of the ultimate goals of Manifold Learning (ML) is to reconstruct an unknown nonlinear low-dimensional Data Manifold (DM) embedded in a high-dimensional observation space from a given set of data points sampled from the manifold. We derive asymptotic expansion and local lower and upper bounds for the maximum reconstruction error in a small neighborhood ...

Added: November 21, 2014

Manifold Learning Based On Kernel Density Estimation

Kuleshov A. P., Bernstein A. V., Yanovich Y., Ученые записки Казанского университета. Серия: Физико-математические науки 2018 Vol. 160 No. 2 P. 327–338

The problem of unknown high-dimensional density estimation has been considered. It has been suggested that the support of its measure is a low-dimensional data manifold. This problem arises in many data mining tasks. The paper proposes a new geometrically motivated solution to the problem in the framework of manifold learning, including estimation of an unknown ...

Added: October 28, 2020

Estimation of the signal subspace without estimation of the inverse covariance matrix

Panov V., / Series Discussion paper SFB 649 "Economic risk". 2010. No. 2010-050.

Added: September 3, 2015

Inserted perturbations generating asymptotical integrability

Karasev M., Novikova E., Mathematical notes 2014 Vol. 96 No. 6 P. 965–970

We discuss the general opportunity to create (asymptotically) a comletely integrable system from the original perturbed system by inserting additional perturbing terms. After such an artificial insertion, there appears an opportunity to make the secondary averaging and secondary reduction of the original system. Thus, by this way, the $3D$-system becomes $1$-dimensional. We demonstrate this approach ...

Added: November 25, 2014

Manifold Learning Based On Kernel Density Estimation

Added: October 28, 2020

Non-Gaussian Component Analysis: New Ideas, New Proofs, New Applications

Panov V., / Series Discussion paper SFB 649 "Economic risk". 2010. No. 2010-026.

In this article, we present new ideas concerning Non-Gaussian Component Analysis (NGCA). We use the structural assumption that a high-dimensional random vector $\vX$ can be represented as a sum of two components - a low-dimensional signal $\vS$ and a noise component $\vN$. We show that this assumption enables us for a special representation for the ...

Added: September 3, 2015

Structure-adaptive Manifold Estimation

Puchkin N., Spokoiny V., Journal of Machine Learning Research 2022 Vol. 23 No. 40 P. 1–62

We consider a problem of manifold estimation from noisy observations. Many manifold learning procedures locally approximate a manifold by a weighted average over a small neighborhood. However, in the presence of large noise, the assigned weights become so corrupted that the averaged estimate shows very poor performance. We suggest a structure-adaptive procedure, which simultaneously reconstructs ...

Added: February 3, 2022

User-controllable Multi-texture Synthesis with Generative Adversarial Networks

Alanov A., Kochurov M., Volkhonskiy D. et al., , in: Proceedings of the 15th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISAPP 2020)Vol. 4. SciTePress, 2020. P. 214–221.

We propose a novel multi-texture synthesis model based on generative adversarial networks (GANs) with a user-controllable mechanism. The user control ability allows to explicitly specify the texture which should be generated by the model. This property follows from using an encoder part which learns a latent representation for each texture from the dataset. To ensure ...

Added: November 8, 2020

Состоятельность оценки области определения алгоритмом спектральных вложений Грассмана-Штифеля

Yanovich Y., В кн.: Сборник статей конференции "Информационные технологии и системы" (ИТиС'16). М.: ИППИ РАН, 2016. С. 191–197.

В машинном обучении при построении регрессионных зависимостей или решении задач классификации многомерные описания объектов часто являются избыточными и функционально зависимыми. Такие описания зачастую лежат около многообразий существенно меньшей размерности, чем размерность их первичной записи. Данное предположение называется гипотезой многообразия (Manifold Hypothesis). Использование такой информации может помочь в решении исходной задачи. Так возникает задача оценивания многообразий. ...

Added: November 24, 2016

Estimation Of Smooth Vector Fields On Manifolds By Optimization On Stiefel Group

Abramov E., Yanovich Y., Ученые записки Казанского университета. Серия: Физико-математические науки 2018 Vol. 160 No. 2 P. 220–228

Real data are usually characterized by high dimensionality. However, real data obtained from real sources, due to the presence of various dependencies between data points and limitations on their possible values, form, as a rule, form a small part of the high-dimensional space of observations. The most common model is based on the hypothesis that ...

Added: October 29, 2020

Information preserving and locally isometric&conformal embedding via Tangent Manifold Learning

Bernstein A., Kuleshov A. P., Yanovich Y., , in: Data Science and Advanced Analytics (DSAA), 2015. 36678 2015. IEEE International Conference on. P.: IEEE, 2015. P. 1–10.

In many Data Analysis tasks, one deals with data that are presented in high-dimensional spaces. In practice original high-dimensional data are transformed into lower-dimensional representations (features) preserving certain subject-driven data properties such as distances or geodesic distances, angles, etc. Preserving as much as possible available information contained in the original high-dimensional data is also an ...

Added: January 14, 2016

Manifold Learning in Regression Tasks

Bernstein A., Kuleshov A. P., Yanovich Y., Lecture Notes in Computer Science 2015 Vol. 9047 P. 414–423

The paper presents a new geometrically motivated method for non-linear regression based on Manifold learning technique. The regression problem is to construct a predictive function which estimates an unknown smooth mapping f from q-dimensional inputs to m-dimensional outputs based on a training data set consisting of given ‘input-output’ pairs. The unknown mapping f determines q-dimensional ...

Added: August 30, 2015

Asymptotic Properties of Local Sampling on Manifold

Yanovich Yury Aleksandrovich, Journal of Mathematics and Statistics 2016 Vol. 12 No. 3 P. 157–175

In many applications, the real high-dimensional data occupy only a very small part in the high dimensional ‘observation space’ whose intrinsic dimension is small. The most popular model of such data is Manifold model which assumes that the data lie on or near an unknown manifold Data Manifold, (DM) of lower dimensionality embedded in an ...

Added: November 24, 2016