• A
  • A
  • A
  • АБВ
  • АБВ
  • АБВ
  • A
  • A
  • A
  • A
  • A
Обычная версия сайта
  • RU
  • EN
  • HSE University
  • Publications
  • Book chapter
  • Manifold Learning in Data Mining Tasks
  • RU
  • EN
Расширенный поиск
Высшая школа экономики
Национальный исследовательский университет
Priority areas
  • business informatics
  • economics
  • engineering science
  • humanitarian
  • IT and mathematics
  • law
  • management
  • mathematics
  • sociology
  • state and public administration
by year
  • 2027
  • 2026
  • 2025
  • 2024
  • 2023
  • 2022
  • 2021
  • 2020
  • 2019
  • 2018
  • 2017
  • 2016
  • 2015
  • 2014
  • 2013
  • 2012
  • 2011
  • 2010
  • 2009
  • 2008
  • 2007
  • 2006
  • 2005
  • 2004
  • 2003
  • 2002
  • 2001
  • 2000
  • 1999
  • 1998
  • 1997
  • 1996
  • 1995
  • 1994
  • 1993
  • 1992
  • 1991
  • 1990
  • 1989
  • 1988
  • 1987
  • 1986
  • 1985
  • 1984
  • 1983
  • 1982
  • 1981
  • 1980
  • 1979
  • 1978
  • 1977
  • 1976
  • 1975
  • 1974
  • 1973
  • 1972
  • 1971
  • 1970
  • 1969
  • 1968
  • 1967
  • 1966
  • 1965
  • 1964
  • 1963
  • 1958
  • More
Subject
News
May 25, 2026
HSE Scientists Train Neural Network to 'Hear' Faults in Electric Motors
Researchers at the AI and Digital Science Institute of the HSE Faculty of Computer Science have developed a new method—the Signature-Guided Data Augmentation (SGDA) framework—that achieves 99% accuracy in motor fault detection and 86% accuracy in fault classification. The application of this approach can reduce industrial equipment repair costs, minimise downtime, and improve production safety. The study results have been published in Engineering Applications of Artificial Intelligence.
May 25, 2026
'The Humanities Serve as a Conscience'
Maria Mizernaia studies Soviet literature and the history of book publishing. In this interview for the HSE Young Scientists project, she discusses plans to publish a novel about besieged Leningrad, AI-provoked reflections on what it means to be human, and how novels can help satisfy our dopamine hunger.
May 25, 2026
Is It Possible to Predict a Citys Life Based on the Shape of Its Neighbourhoods?
Is it possible to predict, based on the configuration of streets and buildings, where a café will open or where traffic congestion will occur? Participants in the Spatial Analysis and Modelling of Urban Processes research and study group use open data and machine learning to identify universal patterns. Alexander Sheludkov and Eduard Somov discuss the purpose of comparing cities, the need for new forms of urban statistics, and how open data is transforming approaches to urban studies.

 

Have you spotted a typo?
Highlight it, click Ctrl+Enter and send us a message. Thank you for your help!

Publications
  • Books
  • Articles
  • Chapters of books
  • Working papers
  • Report a publication
  • Research at HSE

?

Manifold Learning in Data Mining Tasks

P. 119–133.
Kuleshov A. P., Bernstein A.

Many Data Mining tasks deal with data which are presented in high dimensional spaces, and the ‘curse of dimensionality’ phenomena is often an obstacle to the use of many methods for solving these tasks. To avoid these phenomena, various Representation learning algorithms are used as a first key step in solutions of these tasks to transform the original high-dimensional data into their lower-dimensional representations so that as much information about the original data required for the considered Data Mining task is preserved as possible. The above Representation learning problems are formulated as various Dimensionality Reduction problems (Sample Embedding, Data Manifold embedding, Manifold Learning and newly proposed Tangent Bundle Manifold Learning) which are motivated by various Data Mining tasks. A new geometrically motivated algorithm that solves the Tangent Bundle Manifold Learning and gives new solutions for all the considered Dimensionality Reduction problems is presented.

Language: English
Text on another site
Keywords: data miningdimensionality reductionStatistical LearningRepresentation learningManifold LearningTangent LearningTangent Bundle Manifold Learning

In book

Machine Learning and Data Mining in Pattern Recognition
Vol. 8556. , Springer, 2014.
Similar publications
Alignment of Vector Fields on Manifolds via Contraction Mappings
Kachan O., Yanovich Y., Abramov E., Uchenye Zapiski Kazanskogo Universiteta. Seriya Fiziko-Matematicheskie Nauki 2018 Vol. 160 No. 2 P. 300–308
According to the manifold hypothesis, high-dimensional data can be viewed and meaning- fully represented as a lower-dimensional manifold embedded in a higher dimensional feature space. Manifold learning is a part of machine learning where an intrinsic data representation is uncovered based on the manifold hypothesis. Many manifold learning algorithms were developed. The one called Grassmann&Stiefel eigenmaps ...
Added: January 21, 2026
Pseudo-Boolean Polynomial Method for InterpreTab. Dimensionality Reduction: A Paradigm Shift from Abstract to Meaningful Feature Extraction
Chikake T. M., Goldengorin B. I., Pardalos P. M., Computer Optics 2025 Vol. 49 No. 6 P. 1191–1201
We present a general-purpose, training-free framework for dimensionality reduction and clustering based on per–sample pseudo–Boolean polynomials (PBP). The method constructs compact, interpreTab. features without model fitting and is evaluated under a standardized protocol that compares PBP to PCA, t-SNE, and UMAP using identical inputs and metrics: clustering alignment (V-measure, Adjusted Rand Index), cluster geometry (Silhouette coefficient, ...
Added: January 2, 2026
Machine Learning and Knowledge Discovery in Databases. Applied Data Science Track. European Conference, ECML PKDD 2024, Vilnius, Lithuania, September 9–13, 2024, Proceedings, Part X. LNCS, volume 14950
Cham: Springer, 2024.
This multi-volume set, LNAI 14941 to LNAI 14950, constitutes the refereed proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases, ECML PKDD 2024, held in Vilnius, Lithuania, in September 2024. ...
Added: November 22, 2024
2023 IEEE International Conference on Data Mining Workshops (ICDMW) 1–4 December 2023, Shanghai, China
Shanghai: IEEE Computer Society, 2023.
The IEEE International Conference on Data Mining (ICDM) has established itself as the world’s premier research conference in data mining. It provides an international forum for presentation of original research results, as well as exchange and dissemination of innovative and practical development experiences. The conference covers all aspects of data mining, including algorithms, software, systems, ...
Added: March 20, 2024
Поиск закономерностей и важности признаков в данных виктимизационного опроса
D'yakonov A., Головина А. М., Прикладная математика и информатика 2023 Т. 61 № 74 С. 91–108
A methodology for finding patterns by solving machine learning problems with a teacher is described and applied to the analysis of national victimization survey data. Important features for machine learning models, interesting patterns and inconsistencies in the data are found. Experiments on estimating feature importance using different methods are described. ...
Added: March 18, 2024
Reconstruction of manifold embeddings into Euclidean spaces via intrinsic distances
Nikita Puchkin, Vladimir Spokoiny, Eugene Stepanov et al., ESAIM - Control, Optimisation and Calculus of Variations 2024 Vol. 30 Article 3
We consider the problem of reconstructing an embedding of a compact connected Riemannian manifold in a Euclidean space up to an almost isometry, given the information on intrinsic distances between points from its “sufficiently large” subset. This is one of the classical manifold learning problems. It happens that the most popular methods to deal with ...
Added: February 2, 2024
10th International Conference, PReMI 2023, Kolkata, India, December 12–15, 2023, Proceedings. Pattern Recognition and Machine Intelligence. LNCS, volume 14301
Cham: Springer, 2023.
Added: November 29, 2023
Сентимент-анализ как метод исследования информационной повестки и общественного мнения (на примере СМИ и социальных сетей КНР)
Анташева М. С., Lobanova P., Isaeva J. K. et al., Социология: методология, методы, математическое моделирование 2023 № 57 С. 7–41
The information agenda broadcast by Chinese media resources is a   source of up-to-date data on public opinion on key issues of social welfare. Due to the technical peculiarities of the organization of Chinese websites and the need to attract additional resources for automatic processing  (parsing)  of texts in Chinese, this topic is not widely represented in domestic and foreign studies. The ...
Added: November 9, 2023
Data Analysis and Optimization. In Honor of Boris Mirkin's 80th Birthday
Springer, 2023.
This book presents the state-of-the-art in the emerging field of data science and includes models for layered security with applications in the protection of sites—such as large gathering places—through high-stake decision-making tasks. Such tasks include cancer diagnostics, self-driving cars, and others where wrong decisions can possibly have catastrophic consequences. Additionally, this book provides readers with ...
Added: August 31, 2023
Knowledge Discovery, Knowledge Engineering and Knowledge Management: 13th International Joint Conference, IC3K 2021, Virtual Event, October 25–27, 2021, Revised Selected Papers
Springer, 2023.
This book constitutes the extended and revised versions of a set of selected papers from the 13th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management, IC3K 2021, on October 25–27, 2021. The conference was held virtually due to the COVID-19 crisis. The 9 full papers included in this book were carefully reviewed and ...
Added: July 8, 2023
Исследование и определение признаков скрытых атак на предприятии для алгоритмов машинного обучения
Золотухина М. А., Zykov S. V., Вестник Российского нового университета 2023 № 1 С. 20–28
Зачастую именно человеческий фактор ведет к распространению угроз на предприятиях. Если техническое устройство представляет собой четко работающий и слаженный механизм с возможностью при помощи диагностического оборудования проводить замеры параметров неисправностей и устранять их, то для исследования скрытых атак необходим новый компонент системы. Предприятия и промышленность в целом нуждаются в интеллектуальной системе защиты и обнаружения скрытых ...
Added: April 11, 2023
Information Systems and Design. Third International Conference, ICID 2022, Tashkent, Uzbekistan, September 12–13, 2022, Revised Selected Papers
Springer, 2023.
This book constitutes the proceedings of Third International Conference on Information Systems and Design, ICID 2022, which took place in Tashkent, Uzbekistan, in September 2022.  The 12 papers presented in this volume were carefully reviewed and selected from 35 submissions. They were organized in topical sections as follows: methodological support of analysis and management tools: theoretical-focused ...
Added: March 31, 2023
Object-Attribute Biclustering for Elimination of Missing Genotypes in Ischemic Stroke Genome-Wide Data
Ignatov D. I., Khvorykh G., Khrunin A. et al., , in: Recent Trends in Analysis of Images, Social Networks and Texts. 9th International Conference, AIST 2020, Skolkovo, Moscow, Russia, October 15–16, 2020 Revised Supplementary ProceedingsVol. 12602.: Springer, 2021. P. 185–204.
© 2021, Springer Nature Switzerland AG.Missing genotypes can affect the efficacy of machine learning approaches to identify the risk genetic variants of common diseases and traits. The problem occurs when genotypic data are collected from different experiments with different DNA microarrays, each being characterised by its pattern of uncalled (missing) genotypes. This can prevent the ...
Added: November 1, 2022
Triclustering in Big Data Setting
Egurnov D., Точилкин Д. С., Ignatov D. I., , in: Complex Data Analytics with Formal Concept Analysis.: Springer, 2022. P. 239–258.
In this paper, we describe versions of triclustering algorithms adapted for efficient calculations in distributed environments with MapReduce model or parallelisation mechanism provided by modern programming languages. OAC-family of triclustering algorithms shows good parallelisation capabilities due to the independent processing of triples of a triadic formal context. We provide time and space complexity of the ...
Added: November 1, 2022
Machine learning methods for demographic data analysis
Muratova A., Ignatov D. I., Mitrofanova E., , in: Recent Trends in Analysis of Images, Social Networks and Texts. 9th International Conference, AIST 2020, Skolkovo, Moscow, Russia, October 15–16, 2020 Revised Supplementary ProceedingsVol. 12602.: Springer, 2021. P. 297–299.
This is the extended abstract of a case study on demographic sequences analysis by machine learning and data mining methods. ...
Added: November 1, 2022
Recent Trends in Analysis of Images, Social Networks and Texts. 10th International Conference, AIST 2021, Tbilisi, Georgia, December 16–18, 2021, Revised Selected Papers
Springer, 2022.
This book constitutes revised selected papers of the 10th International Conference on Analysis of Images, Social Networks and Texts, AIST 2021, held in Tbilisi, Georgia, in December 2021. Due to the COVID-19 pandemic the conference was held in hybrid mode.  The 17 full papers were carefully reviewed and selected from 118 submissions, out of which 92 ...
Added: October 31, 2022
Использование метода интеллектуального анализа данных для прогнозирования академически рискованных студентов в зависимости от их темперамента (на примере факультета ИМиКН в НИУ ВШЭ-Нижний Новгород)
Shadrina E. V., Вестник Нижегородского университета им. Н.И. Лобачевского. Серия: Социальные науки 2022 № 3(67) С. 229–236
The article discusses the influence of temperament on the academic performance of the first-year students at HSENizhny Novgorod on the example of the Faculty of Informatics, Mathematics and Computer Science. Analysis was held with the help of statistics methods and methods of data mining. The baseline data for the study is information about students, collected ...
Added: October 18, 2022
Artificial Intelligence in Education. Posters and Late Breaking Results, Workshops and Tutorials, Industry and Innovation Tracks, Practitioners’ and Doctoral Consortium -23rd International Conference, AIED 2022, Durham, UK, July 27–31, 2022, Proceedings, Part II
Springer, 2022.
This two-volume set LNAI 13355 and 13356 constitutes the refereed proceedings of the 23rd International Conference on Artificial Intelligence in Education, AIED 2022, held in Durham, UK, in July 2022. The 40 full papers and  40 short papers presented together with 2 keynotes, 6 industry papers, 12 DC papers, 6 Workshop papers, 10 Practitioner papers, 97 ...
Added: July 28, 2022
Credit scoring methods: latest trends and points to consider
Anton Markov, Zinaida Seleznyova, Victor Lapshin, Journal of Finance and Data Science 2022 Vol. 8 P. 180–201
Credit risk is the most significant risk by impact for any bank and financial institution. Accurate credit risk assessment affects an organisation’s balance sheet and income statement, since credit risk strategy determines pricing, and might even influence seemingly unrelated domains, e.g. marketing, and decision-making. This article aims at providing a systemic review of the most recent (2016–2021) articles, identifying ...
Added: July 28, 2022
  • About
  • About
  • Key Figures & Facts
  • Sustainability at HSE University
  • Faculties & Departments
  • International Partnerships
  • Faculty & Staff
  • HSE Buildings
  • HSE University for Persons with Disabilities
  • Public Enquiries
  • Studies
  • Admissions
  • Programme Catalogue
  • Undergraduate
  • Graduate
  • Exchange Programmes
  • Summer University
  • Summer Schools
  • Semester in Moscow
  • Business Internship
  • Research
  • International Laboratories
  • Research Centres
  • Research Projects
  • Monitoring Studies
  • Conferences & Seminars
  • Academic Jobs
  • Yasin (April) International Academic Conference on Economic and Social Development
  • Media & Resources
  • Publications by staff
  • HSE Journals
  • Publishing House
  • iq.hse.ru: commentary by HSE experts
  • Library
  • Economic & Social Data Archive
  • Video
  • HSE Repository of Socio-Economic Information
  • HSE1993–2026
  • Contacts
  • Copyright
  • Privacy Policy
  • Site Map
Edit