Clustering of Biomedical Data Using the Greedy Clustering Algorithm Based on Interval Pattern Concepts

A. V. Galatenko; S. Nersisyan; Pankratieva V.

?

Clustering of Biomedical Data Using the Greedy Clustering Algorithm Based on Interval Pattern Concepts

P. 65–74.

Galatenko A. V., Nersisyan S., Pankratieva V.

nterval pattern concepts are a particular case of patternstructures. They can be used to clusterize rows of a numerical formalcontext (data matrix): two rows are close to each other if their entriesat the corresponding positions fall within a given interval.The problem of mining interval pattern concepts has much in commonwith the known problem related to computational geometry: given afinite set of points in the Euclidean space, position a box of a given sizein such a way that it encloses as many points as possible. This problemand its variations have been thoroughly studied in the case of a plane;however, the authors are not aware of the existence of algorithms which ina reasonable time produce an exact solution in the space of an arbitrarydimension.There exists an approximate greedy algorithm for solving this problem.It produces a solution with time which is linear in the number of pointsand polynomial in dimension. We apply a clustering approach based onthat algorithm to the gene expression table from the dataset “The CancerCell Line Encyclopedia”. The resulting partition well agrees witha prioriknown biological factors.

Language: English

Full text

Text on another site

Keywords: clustering greedy algorithm interval pattern concepts

In book

Proceedings of the International Workshop "What can FCA do for Artificial Intelligence?" (FCA4AI at IJCAI/ECAI 2019)

[б.и.], 2019.

Flexible Stock Market Algorithm

Rubchinskiy A., Chubarova D., Technology and Investment 2025 Vol. 16 No. 4 P. 211–240

The article considers one of the most famous examples of socio-economic systems characterized by significant uncertainty—the S&P-500 stock market, where shares of 500 largest US companies are traded. The flexible algorithm for daily trading has been developed. It is based on known fixed data about cost of shares in previous days as well as on ...

Added: December 19, 2025

Tunnel Clustering Method

F. T. Aleskerov, A. L. Myachin, V. I. Yakuba, Doklady Mathematics 2024 Vol. 110 No. 3 P. 474–479

We propose a novel method for rapid pattern analysis of high-dimensional numerical data, termed tunnel clustering. The main advantages of the method are its relatively low computational complexity, endogenous determination of cluster composition and number, and a high degree of interpretability of final results. We present descriptions of three different variations: one with fixed hyperparameters, ...

Added: March 3, 2025

Использование Z-чисел для описания набора данных

Гусейнов О., Degtyarev K. Y., IRETC MTÜ PAHTEI - Proceedings of Azerbaijan High Technical Educational Institutions 2025 Т. 48 № 1 С. 360–370

The concept of Z-number was proposed by Prof. Lotfi Zadeh to describe partial reliability of information, and it is a kind of fusion of fuzziness and probabilistic uncertainty. Z-number can be presented as a pair of fuzzy numbers Z(A,B) used to describe a value of a random variable X. The first component (A) is a ...

Added: February 20, 2025

Gradient descent clustering with regularization to recover communities in transformed attributed networks

Shalileh S., Social Network Analysis and Mining 2025 Vol. 15212 P. 137–148

Community detection in attributed networks aims to recover clusters in which the within-community nodes are as interconnected and as homogeneous as possible, while the between-communities nodes are as disconnected and as heterogeneous as possible. The current research proposes a straightforward data-driven model with an integrated regularization term to recover communities. For further improvement of the ...

Added: November 30, 2024

An empirical scrutinization of four crisp clustering methods with four distance metrics and one straightforward interpretation rule

T. A. Alvandyan, S. Shalileh, Doklady Mathematics 2024 Vol. 110 No. S1 P. S236–S250

Clustering has always been in great demand by scientific and industrial communities. However, due to the lack of ground truth, interpreting its obtained results can be debatable. The current research provides an empirical benchmark on the efficiency of three popular and one recently proposed crisp clustering methods. To this end, we extensively analyzed these (four) ...

Added: November 30, 2024

Моделирование оплаты труда учителей в условиях неоднородности социально-экономического состояния регионов

Богданова Т. К., Жукова Л. В., В кн.: XI-я международная конференция «Многомерный статистический анализ, эконометрика и моделирование реальных процессов» имени С.А. Айвазяна.: М.: ЦЭМИ РАН, 2024. С. 41–44.

The paper is devoted to the analysis and forecasting of the average salary of teachers. For 84 regions on the basis of their socio-demographic characteristics according to Rosstat data using Ward's method we obtained a two-cluster solution, which allowed us to identify quite strong differences in the level of wages, GRP per capita, level of ...

Added: October 4, 2024

Clustering with empty clusters

Penikas H. I., Феста Ю. Ю., Известия Дальневосточного федерального университета. Экономика и управление 2024 Vol. 2 P. 75–94

Кластерный анализ широко используется в различных научных и практических областях, связанных с анализом данных. Это важный инструмент для решения задач в таких областях, как машинное обучение, обработка изображений, распознавание текста и т.д. Отсутствие наблюдений не всегда означает отсутствие информации, поэтому предполагается, что наличие пробелов в данных, наличие“пустых” кластеров, также несёт в себе информацию об объекте исследования, как и реальные наблюдения. В этом исследовании предполагается, ...

Added: August 10, 2024

Detecting linguistic variation with geographic sampling

Koile E., Moroz G., Journal of Linguistic Geography 2024 Vol. 12 No. 1 P. 24–31

Geolectal variation is often present in settings where one language is spoken across a vast geographic area. This can be found in phonological, morphosyntactic, and lexical features. For practical reasons, it is not always possible to conduct fieldwork in every single location of interest in order to obtain the full pattern of variation, and a ...

Added: May 6, 2024

Spot the Bot: Distinguishing Human-Written and Bot-Generated Texts Using Clustering and Information Theory Techniques

Gromov V., Dang Q. N., , in: 10th International Conference, PReMI 2023, Kolkata, India, December 12–15, 2023, Proceedings. Pattern Recognition and Machine Intelligence. LNCS, volume 14301.: Cham: Springer, 2023. Ch. 3 P. 20–27.

Added: November 29, 2023

Temperature-driven transition into vortex clusters in low-kappa intertype superconductors

Backs A., Al-Falou A., Vagov A. et al., Physical Review B: Condensed Matter and Materials Physics 2023 Vol. 107 No. 17 Article 174527

In the vicinity of the type-I/type-II crossover in conventional superconductors, vortices exhibit a nonmonotonic interaction, which leads to exotic vortex matter states. We perform molecular dynamics simulations on a model superconductor in the intertype regime. In a field cooled approach, we examine the transition of a homogeneous vortex lattice (VL) into a structure consisting of ...

Added: November 2, 2023

2023 Fifth International Conference Neurotechnologies and Neurointerfaces (CNN) 18-20 Sept. 2023

Alshanskaia E., Martynova O., IEEE, 2023.

Cognitive and emotional load in the course of increasing the complexity of tasks leads to the activation of various parts of the autonomic nervous system (ANS) and can be accompanied by an increase in the efficiency of problem solving. An increase in cognitive load under the condition of high motivation is a stress factor and ...

Added: September 24, 2023

Новая программная платформа для моделирования транспортных потоков с участием беспилотных автомобилей

Beklaryan A., Вестник ЦЭМИ 2023 Т. 6 № 1 Статья 5

The article presents a new software platform for modelling traffic flows involving unmanned vehicles, using a number of advanced technological solutions, in particular, the FLAME GPU supercomputer agent modelling framework, intelligent software modules based on fuzzy and hierarchical clustering, genetic optimization algorithms, a subsystem for visualizing the state of agents-vehicles based on OpenGL, etc. As ...

Added: June 4, 2023

Tracing Vortex Clustering in a Superconductor by the Magnetic Flux Distribution

A. Vagov, E. G. Nikonov, The Journal of Physical Chemistry Letters 2023 Vol. 14 No. 15 P. 3743–3748

By investigating spatial configurations of the intermediate mixed state in an intertype superconductor, it is shown that vortex clustering can be characterized by the sample averaged distribution of the penetrating magnetic field. The clustering is manifested in the two peak structure of the distribution. The second peak indicates a spot a material occupies in the ...

Added: June 2, 2023

An empirical comparison of connectivity-based distances on a graph and their computational scalability

Miasnikof P., Shestopaloff A., Pitsoulis L. et al., Journal of Complex Networks 2022 Vol. 10 No. 1 Article cnac003

In this study, we compare distance measures with respect to their ability to capture vertex community structure and the scalability of their computation. Our goal is to find a distance measure which can be used in an aggregate pairwise minimization clustering scheme. The minimization should lead to subsets of vertices with high induced subgraph density. ...

Added: November 21, 2022

Кластеризация шумов как способ оценки функции постоянного сосудистого доступа у больных на гемодиализе

Кравцов П. Ф., Николаев Е. Н., Мазайшвили К. В. et al., Вестник СурГУ. Медицина 2022 Т. 51 № 1 С. 25–30

Abstract. The study aims to develop an algorithm for assessing spectrographic features of arteriovenous fistula dysfunction for hemodialysis. Materials and methods. Forty-four patients with native radiocephalic fistula formed in the distal third of the forearm participated in the research. Using electronic stethoscope, the noise of arteriovenous fistula was recorded in all patients. 653 spectrograms were analyzed with the ...

Added: November 14, 2022

Различение хаотических и регулярных временных рядов для идентификации состояния артериовенозной фистулы

Gromov V., Мазайшвили К. В., Заикин П. В. et al., Вестник кибернетики 2022 Т. 45 № 1 С. 72–82

The prevalence of chronic kidney disease is growing every year and is already comparable to such socially significant diseases as hypertension and diabetes mellitus, as well as obesity and metabolic syndrome [1,2]. The standard solution for hemodialysis patients is to create a permanent vascular access in the form of an arteriovenous fistula. However, its use ...

Added: November 14, 2022

Направления поддержки малых предприятий промышленного сектора экономики: региональный аспект

Arkhipova M., Cherviakova A. A., Друкеровский вестник 2022 № 2 С. 86–109

Innovation activity of small industrial enterprises substantially varies across Russian regions that makes relevant the search of distinctive features of innovative entrepreneurship in Russian regions in order to develop targeted support for small innovative enterprises and to create new job places in digital economy. Multidimensional clustering of Russian regions by the indicators of small industrial ...

Added: October 24, 2022

Usage of Clustering of Paley Graphs in Polar Coordinates for the Development of New Network on Chip Topologies

Alijon F. Fatullaev, Edward R. Rzaev, Aleksandr Yu. Romanov, , in: 2022 International Russian Automation Conference (RusAutoCon).: IEEE, 2022. P. 419–423.

The article presents a study of clustering of Paley graphs with the arrangement of prime numbers in polar coordinates and a comparison of the resulting groups in terms of their static parameters; the application of fault-tolerant self-organizing routing method for new topologies is also considered. This article is a continuation of a series of articles ...

Added: October 2, 2022