• A
  • A
  • A
  • АБВ
  • АБВ
  • АБВ
  • A
  • A
  • A
  • A
  • A
Обычная версия сайта
  • RU
  • EN
  • HSE University
  • Publications
  • Articles
  • Определение центроидов для повышения точности порядково-инвариантной паттерн-кластеризации
  • RU
  • EN
Расширенный поиск
Высшая школа экономики
Национальный исследовательский университет
Priority areas
  • business informatics
  • economics
  • engineering science
  • humanitarian
  • IT and mathematics
  • law
  • management
  • mathematics
  • sociology
  • state and public administration
by year
  • 2027
  • 2026
  • 2025
  • 2024
  • 2023
  • 2022
  • 2021
  • 2020
  • 2019
  • 2018
  • 2017
  • 2016
  • 2015
  • 2014
  • 2013
  • 2012
  • 2011
  • 2010
  • 2009
  • 2008
  • 2007
  • 2006
  • 2005
  • 2004
  • 2003
  • 2002
  • 2001
  • 2000
  • 1999
  • 1998
  • 1997
  • 1996
  • 1995
  • 1994
  • 1993
  • 1992
  • 1991
  • 1990
  • 1989
  • 1988
  • 1987
  • 1986
  • 1985
  • 1984
  • 1983
  • 1982
  • 1981
  • 1980
  • 1979
  • 1978
  • 1977
  • 1976
  • 1975
  • 1974
  • 1973
  • 1972
  • 1971
  • 1970
  • 1969
  • 1968
  • 1967
  • 1966
  • 1965
  • 1964
  • 1963
  • 1958
  • More
Subject
News
June 5, 2026
Neural Network Maps as a Method for Constructing Mathematical Models
Scientists from HSE University–Nizhny Novgorod and the Institute of Physics Belgrade, Serbia, are jointly exploring the application of machine learning techniques and neural networks to the study of nonlinear dynamics. Natalya Stankevich, Leading Research Fellow at the Laboratory of Topological Methods in Dynamics of the Faculty of Informatics, Mathematics, and Computer Science at HSE University–Nizhny Novgorod, spoke to the HSE News Service about this international project.
June 5, 2026
‘In the Age of Technology, It Is Interesting to Look into the Past and Think about What We Can Take from It
Polina Tabakova decided to apply for a Philology degree at HSE in Nizhny Novgorod because she grew up in Mari El and did not want to move far away from the Russian forests. In an interview for the Young Scientists of HSE University project, she spoke about the genre of the campus novel, the existential drama of Kolobok, and a blackout version of Eugene Onegin.
June 5, 2026
HSE Scientists Develop Method to Compress Large Language Models Without Losing Quality
Researchers from the AI and Digital Science Institute at the HSE Faculty of Computer Science have developed a new compression method for large language models such as GPT and LLaMA that reduces their size by 25–36% without additional training or significant loss of accuracy. This is the first approach to use mathematical transformations—specifically, rotations of model weights—to make models more amenable to compression with structured matrices. The study results have been published in ACL Findings 2025. The code is available on GitHub.

 

Have you spotted a typo?
Highlight it, click Ctrl+Enter and send us a message. Thank you for your help!

Publications
  • Books
  • Articles
  • Chapters of books
  • Working papers
  • Report a publication
  • Research at HSE

?

Определение центроидов для повышения точности порядково-инвариантной паттерн-кластеризации

Управление большими системами: сборник трудов. 2019. № 78. С. 6–22.
Myachin A. L.

The work continues the research of constructing methods for analyzing
patterns in parallel coordinates independent of the sequence of input data of the
results. The basic operations on objects of ordinal-invariant pattern clusters are
described. The assertion that the centroid of an ordinal-invariant pattern cluster
belongs to the original cluster is proved, which allows one to estimate the intracluster
object - centroid distances in the multidimensional feature space. Examples of
revealing the structural similarity of objects in parallel coordinates are given. The
main differences between the methods of analysis of patterns and cluster analysis
are noted. The methodology of the centroid detection of the ordinal-invariant pattern-
cluster is described. An algorithm for combining groups of objects based on
their structural similarity, on the one hand, and minimizing intracluster distances,
on the other, is proposed, which makes it possible to improve the accuracy of the
final results and partially solve the problem of finding similar objects in the presence
of error in the original data. The proposed algorithm uses the concept of intracluster
distances “object - centroid” and satisfies the following conditions: endogenous
determination of the number and composition of the desired groups of objects
under study; low (relatively) computational complexity; independence of the original
partition from the initial sequence of input data. The work of the proposed algorithm
on classical data sets is demonstrated. The results of testing are presented and
the clustering accuracy is increased.

Priority areas: IT and mathematics
Language: Russian
Full text
Text on another site
Keywords: cluster analysisкластерный анализpattern analysisанализ паттерновpatternпаттерн
Publication based on the results of:
A study of models of decision-making and analysis of complex structured data (2019)
Similar publications
ML-based Fast Simulation of FARICH Responses
Shipilov F., Barnyakov A., Ivanov A. et al., / Series Physics "arxiv.org". 2026.
A fast simulation of the detector response is a vital task in high-energy physics (HEP). Traditional Monte-Carlo methods form the backbone of modern particle physics simulation software but are computationally expensive. We present a machine-learning-based approach to fast simulation of the Focusing Aerogel Ring Imaging Cherenkov (FARICH) detector response. Given a particle track and momentum, ...
Added: May 19, 2026
Bibliometric Analysis by Network Models
Aleskerov F. T., Khutorskaya O., Stepochkina A. et al., Springer, 2026.
The book contains new models of bibliometric analysis based on centrality measures in network analysis, pattern analysis and stability analysis. A distinctive feature of these centrality measures is that they account for the parameters of vertices and group influence of vertices to a vertex. This reveals specific groups of publications, authors, terms, journals and affiliations ...
Added: May 15, 2026
Natural hazard database from Internet publications: text mining with a large language model
Derkacheva A., Sakirkina M., Kraev G. et al., /. 2026.
Comprehensive data on natural hazards and their consequences are crucial for effective for risk assessment, adaptation planning, and emergency response. However, many countries face challenges with fragmented, inconsistent, and inaccessible data, particularly regarding local-scale events. To address this data gap in Russia, we developed an end-to-end processing pipeline that scrapes news from various online sources, ...
Added: April 28, 2026
Algorithmic overlaps as thermodynamic variables: from local to cluster Monte Carlo dynamics in critical phenomena
Pilé I., Deng Y., Shchur L., / Series arXiv "math". 2026. No. 2604.10254.
We investigate the spatial overlap of successive spin configurations in Markov chain Monte Carlo simulations using the local Metropolis algorithm and the Svendsen-Wang and Wolff cluster algorithms. We examine the dynamics of these algorithms for two models in different universality classes: the Ising model and the Potts model with three components. The overlap of two ...
Added: April 20, 2026
Using predefined vector systems to speed up neural network multimillion class classification
Gabdullin N., Androsov I., / Series Computer Science "arxiv.org". 2026.
Label prediction in neural networks (NNs) has O(n) complexity proportional to the number of classes. This holds true for classification using fully connected layers and cosine similarity with some set of class prototypes. In this paper we show that if NN latent space (LS) geometry is known and possesses specific properties, label prediction complexity can ...
Added: April 2, 2026
Вовлеченность обучающихся массовых открытых онлайн-курсов по продуктам вендора облачных технологий
Porosenkov G., Цепелевич М. М., Кизяков Д. А. et al., Информатика и образование 2025 Т. 40 № 2 С. 57–65
Research on learner engagement in massive open online courses (MOOCs) has predominantly focused on those developed by educational and non-profit organizations. However, the significance of MOOCs offered by IT service providers (vendors), particularly in the field of cloud computing, has gained prominence in recent years. These vendor-led courses facilitate interaction between learners and vendors, underscoring the ...
Added: January 18, 2026
Iterative Ricci-Foster Curvature Flow with GMM-Based Edge Pruning: A Novel Approach to Community Detection
Sorokin K., Beketov M., Онучин А. et al., / arxiv.org. Серия cs.SI "Social and Information Networks ". 2025.
Community detection in complex networks is a fundamental problem, open to new approaches in various scientific settings. We introduce a novel community detection method, based on Ricci flow on graphs. Our technique iteratively updates edge weights (their metric lengths) according to their (combinatorial) Foster version of Ricci curvature computed from effective resistance distance between the ...
Added: January 15, 2026
Implementing Transport Coding in OMNeT++ for Message Delay Reduction
Petrovanov I., Sergeev A., / Series Computer Science "arxiv.org". 2025. No. 2512.18332.
Transport coding reduces message delay in packet-switched networks by introducing controlled redundancy at the transport layer:  original packets are encoded into  coded packets, and the message is reconstructed after the first  successful deliveries, effectively shifting latency from the maximum packet delay to the -th order statistic. We present a concise, reproducible discrete-event implementation of transport coding in OMNeT++, including ...
Added: December 24, 2025
Hessian-based lightweight neural network for brain vessel segmentation on a minimal training dataset
Меньшиков И. А., Бернадотт А. К., Elvimov N. S., / Series arXie "Statistical mechanics". 2025.
Accurate segmentation of blood vessels in brain magnetic resonance angiography (MRA) is essential for successful surgical procedures, such as aneurysm repair or bypass surgery. Currently, annotation is primarily performed through manual segmentation or classical methods, such as the Frangi filter, which often lack sufficient accuracy. Neural networks have emerged as powerful tools for medical image ...
Added: December 1, 2025
Determining the boundary of dynamical chaos in the generalized Chirikov map via machine learning
Chernyshov D., Satanin A., Shchur L., / Series arXiv "math". 2025.
We investigate the boundary separating regular and chaotic dynamics in the generalized Chirikov map, an extension of the standard map with phase-shifted secondary kicks. Lyapunov maps were computed across the parameter space (K,K(α, τ)) and used to train a convolutional neural network (ResNet18) for binary classification of dynamical regimes. The model reproduces the known critical ...
Added: November 21, 2025
Heterogeneous profiles and trajectories of science and technology parks: evidence from Brazil
Dávila Bolliger R., Brandão Fischer B., Ferreira de Faria A. et al., The Journal of Technology Transfer 2025 Vol. 50 P. 1461–1490
Science and Technology Parks (STPs) have become popular instruments among policymakers worldwide. Yet, empirical studies have questioned the effectiveness of STPs in generating the desired impacts on tenant companies and in regional development. Recent literature attributes these conditions to a lack of focus on the inherent heterogeneity of STPs and their respective tenants. This research ...
Added: November 12, 2025
Динамический паттерн-анализ поведения российских банков в период 2017–2021 гг.
Сурова К. В., Aleskerov F. T., Solodkov V. M. et al., Журнал Новой экономической ассоциации 2025 № 1(66) С. 76–96
В работе предлагается метод анализа данных в применении к исследо- ванию моделей поведения банков России в период до и во время пандемии коронавирус- ной инфекции. Исследование включает в себя источники данных с временными рядами показателей по модели CAMEL в период 2017–2021 гг. Система CAMEL является наибо- лее авторитетной и используется регуляторами для оценки и управления ...
Added: November 11, 2025
Эффективный алгоритм торговли на фондовом рынке: ретроспективный анализ, основанный на данных по S&P-500.
Rubchinskiy A., Chubarova D., / Series WP7 "Математические методы анализа решений в экономике, бизнесе и политике". 2025. No. WP7/2025/01.
The article examines one of the most famous examples of socio-economic systems, characterized by significant uncertainty – the S&P-500 stock market, where shares of 500 largest US companies are traded. No assumptions are made about the probabilistic characteristics of the stock market. A flexible algorithm for daily trading has been developed, based on both known fixed data ...
Added: November 9, 2025
Methodological Foundations of Validation and Quality Assessment of Pattern Analysis Results
Alexey Myachin, Studies in Systems, Decision and Control 2025 Vol. 615 P. 12–20
In this paper, we introduce a refined and rigorous methodological framework for the validation and quality assessment of pattern analysis outcomes. Our approach synergistically integrates a formal algorithmic model with novel conceptual constructs - specifically, the notions of the empty pattern and pattern complexity. A comprehensive array of metric approaches is employed to evaluate pattern analysis ...
Added: November 5, 2025
Роль очага инфаркта мозга в определении этиологии ишемического инсульта: обзор литературы
Кулеш А. А., Мехряков С. А., Демин Д. А. et al., Неврология и нейрохирургия Восточная Европа 2025 Т. 15 № 3 С. 436–445
The etiology of ischemic stroke is extremely diverse. According to the SSS-TOAST classification, ischemic stroke can be caused by atherosclerotic lesions of large arteries, cardiogenic thromboembolism, occlusion of small arteries (lacunar), other established (dissection, cerebral venous thrombosis, migraine, reversible cerebral vasoconstriction syndrome, antiphospholipid syndrome, etc.), and unspecified causes. Strokes of unknown etiology may be caused ...
Added: October 20, 2025
Тематическая структура исследований транспортной инфраструктуры и экономики: систематический обзор литературы с применением NLP
Gabdukaev E., Naidenova I. N., Parshakov P., Общество и экономика 2025 № 10 С. 39–55
This study presents a systematic analysis of 7,566 articles from international peer-reviewed journals on transport infrastructure and economic development. Using machine-learning and cluster-analysis techniques, it identifies the field’s key research directions. The results reveal four dominant thematic clusters: sustainable transport, digitalisation, regional corridors and decarbonisation. Topic-trend analysis shows a rapid surge of interest in the ...
Added: October 10, 2025
О КУЛЬТУРЕ СТРАТЕГИЧЕСКОГО ПЛАНИРОВАНИЯ ЭКОНОМИЧЕСКОГО РАЗВИТИЯ
Бураков Н. А., Якобсон Л. И., Вопросы экономики 2025 № 9 С. 27–42
The paper justifies the need to address sustainable characteristics of both individual and collective participants in the processes economic development strategic planning (EDSP), while extant research mostly focuses on structural and organizational issues. Our study shows the relevance of applying the concept of strategic culture with regard to EDSP, although currently it is used by ...
Added: September 11, 2025
A Method for Improving the Accuracy of Regression Models Based on Ordinal-Invariant Pattern Clustering
Alexey Myachin, Procedia Computer Science 2025 Vol. 266 P. 1330–1335
An agglomerative pattern analysis algorithm is presented that groups objects using ordinal-invariant pattern clustering, with the goal of maximizing the coefficient of determination. Key stages are described: constructing initial pattern pattern by grouping objects according to permutations of feature values; computing centroids and variances; and defining a merge criterion based on the maximal increase in ...
Added: August 18, 2025
Theoretical Aspects of Formation of the Concept of «Banking Ecosystem»
Boboshko D., Treistar D., Kulapova A., Lecture Notes in Networks and Systems 2024 No. 1092
The article presents the results of a study of the conceptual apparatus used in describing the processes of formation and functioning of banking ecosystems. Advanced search filters were applied to select the most relevant scientific literature, which made it possible to objectively narrow and clarify the scope of bibliometric research. The content of more than ...
Added: May 26, 2025
Публикационная активность университетов в регионах России: оценка и анализ кластеризации
Терещенко Д. С., Левкин Н. В., Псковский регионологический журнал 2025 Т. 21 № 1 С. 22–40
The article is devoted to the study of the publication activity of universities in the regions of the Russian Federation. The study is based on data from the OpenAlex project, which provides open and free access to information on scientific publications. The analysis used the principal component method and k-means clustering. As a result, three ...
Added: April 3, 2025
Метод туннельной кластеризации
Aleskerov F. T., Myachin A. L., Yakuba V. I., Доклады Российской академии наук. Математика, информатика, процессы управления (ранее - Доклады Академии Наук. Математика) 2024 Т. 520 № 1 С. 29–34
Предлагается новый метод быстрого поиска закономерностей в числовых данных большой раз-мерности, названный “туннельной кластеризацией”. Основными преимуществами нового методаявляются: относительно невысокая вычислительная сложность; эндогенное определение составаи количества кластеров; высокая степень интерпретируемости конечных результатов. Приведеноописание трех различных вариаций: с фиксированными гиперпараметрами, адаптивными, а так-же комбинированный подход. Рассмотрены три основных свойства туннельной кластеризации.Практическое применение приведено как на синтетических ...
Added: March 3, 2025
Tunnel Clustering Method
F. T. Aleskerov, A. L. Myachin, V. I. Yakuba, Doklady Mathematics 2024 Vol. 110 No. 3 P. 474–479
We propose a novel method for rapid pattern analysis of high-dimensional numerical data, termed tunnel clustering. The main advantages of the method are its relatively low computational complexity, endogenous determination of cluster composition and number, and a high degree of interpretability of final results. We present descriptions of three different variations: one with fixed hyperparameters, ...
Added: March 3, 2025
Особенности формирования кластеров стран ЕС в секторе возобновляемых источников энергии на современном этапе
Zuev V. N., Канихин Т. Н., Вестник Южно-Уральского государственного университета. Серия: Экономика и менеджмент 2024 Т. 18 № 4 С. 7–14
This article analyzes the key features of the formation and clustering of EU energy policies in the renewable energy sector. Global problems, such as climate change, and local problems in EU countries, associated with their dependence on oil and gas imports, are contributing to the transition of EU countries to renewable energy sources. However, this ...
Added: February 9, 2025
  • About
  • About
  • Key Figures & Facts
  • Sustainability at HSE University
  • Faculties & Departments
  • International Partnerships
  • Faculty & Staff
  • HSE Buildings
  • HSE University for Persons with Disabilities
  • Public Enquiries
  • Studies
  • Admissions
  • Programme Catalogue
  • Undergraduate
  • Graduate
  • Exchange Programmes
  • Summer University
  • Summer Schools
  • Semester in Moscow
  • Business Internship
  • Research
  • International Laboratories
  • Research Centres
  • Research Projects
  • Monitoring Studies
  • Conferences & Seminars
  • Academic Jobs
  • Yasin (April) International Academic Conference on Economic and Social Development
  • Media & Resources
  • Publications by staff
  • HSE Journals
  • Publishing House
  • iq.hse.ru: commentary by HSE experts
  • Library
  • Economic & Social Data Archive
  • Video
  • HSE Repository of Socio-Economic Information
  • HSE1993–2026
  • Contacts
  • Copyright
  • Privacy Policy
  • Site Map
Edit