An Extension of K-Means for Least-Squares Community Detection in Feature-Rich Networks

?

An Extension of K-Means for Least-Squares Community Detection in Feature-Rich Networks

P. 285–296.

In press

We propose an extension of the celebrated K-means algorithm for community detection in feature-rich networks. Our least-squares criterion leads to a straightforward extension of the conventional batch K-means clustering method as an alternating optimization strategy for the criterion. By replacing the innate squared Euclidean distance with cosine distance we effectively tackle the so-called curse of dimensionality. We compare our proposed methods using synthetic and real-world data with state-of-the-art algorithms from the literature. The cosine distance-based version appears to be the overall winner, especially at larger datasets.

Language: English

DOI

In book

COMPLEX NETWORKS 2021: Complex Networks & Their Applications X.

Springer, 2021.

Community Partitioning over Feature-Rich Networks Using an Extended K-Means Method

Shalileh S., Mirkin B., Entropy 2022 Vol. 24 No. 5 Article 626

This paper proposes a meaningful and effective extension of the celebrated K-means algorithm to detect communities in feature-rich networks, due to our assumption of non-summability mode. We least-squares approximate given matrices of inter-node links and feature values, leading to a straightforward extension of the conventional K-means clustering method as an alternating minimization strategy for the ...

Added: August 1, 2022

A Data Recovery Method for Community Detection in Feature-Rich Networks

Mirkin B., , in: 2020 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).: Association for Computing Machinery (ACM), 2020. Ch. 05 P. 99–105.

The problem of community detection in a network with features at its nodes takes into account both the graph structure and node features. The goal is to find relatively dense groups of interconnected entities sharing some features in common. We apply the so-called data recovery approach to the problem by combining the least-squares recovery criteria ...

Added: October 31, 2020

Least-squares community extraction in feature-rich networks using similarity data

Shalileh S., Mirkin B., Plos One 2021 Vol. 16 No. 7 Article 0254377

We explore a doubly-greedy approach to the issue of community detection in feature-rich networks. According to this approach, both the network and feature data are straightfor- wardly recovered from the underlying unknown non-overlapping communities, supplied with a center in the feature space and intensity weight(s) over the network each. Our least- squares additive criterion allows ...

Added: July 22, 2021

Detecting Communities in Feature-Rich Networks with a K-Means Method

Shalileh S., Mirkin B., , in: Intelligent Data Engineering and Automated Learning – IDEAL 2021.: Springer, 2021. P. 539–547.

The main result of this paper is an extension of the K-means algorithm to the issue of community detection in feature-rich networks. This is based on a data-recovery criterion additively combining conventional least-squares criteria for approximation of the network link data and the feature data at network nodes. The dimension of the space at which ...

Added: April 19, 2022

Community detection in feature-rich networks to meet K-means

Shalileh S., Mirkin B., , in: ASONAM '21: Proceedings of the 2021 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining.: Association for Computing Machinery (ACM), 2021. P. 138–142.

We derive two extensions of the celebrated K-means algorithm as a tool for community detection in feature-rich networks. We define a data-recovery criterion additively combining conventional least-squares criteria for approximation of the network link data and the feature data at network nodes by a partition along with its within-cluster "centers". The dimension of the space ...

Added: April 19, 2022

Detection of an unspecified number of communities in feature-rich networks

Shalileh S., Mirkin B., , in: Proceedings of MARAMI 2020 - Modèles & Analyse des Réseaux : Approches Mathématiques & Informatiques - The 11th Conference on Network Modeling and Analysis(Vol-2750)Vol. Vol-2750: Modèles & Analyse des Réseaux : Approches Mathématiques & Informatiques - Network Modeling and Analysis 2020.: CEUR-WS.org, 2020. P. 1–12.

The problem of community detection in a network with features at its nodes takes into account both the graph structure and node features. The goal is to find relatively dense groups of interconnected entities sharing some features in common. Existing approaches require the number of communities pre-specified. We apply the so-called data recovery approach to ...

Added: January 13, 2021

Построение комплексного индикатора для оценки состояния российского коммерческого банка на основе структурированных и неструктурированных данных

Bogdanova T., Zhukova L., В кн.: Системное моделирование социально-экономических процессов: труды 43-ей международной научной школы-семинара.: Воронеж: Истоки, 2020. Гл. 9 С. 481–488.

The paper describes an approach to constructing a comprehensive indicator for assessing the state of the bank, other than satisfactory, including both homogeneous structured data on the financial condition of the bank, and not structured, from "open" data sources. To construct the components of a universal indicator, it is proposed to use the methods of ...

Added: February 16, 2021

Изучение энергетической устойчивости регионов Российской Федерации с применением методов анализа паттернов

Myachin A. L., Prokofiev V. N., Степанов А. А., Управление большими системами: сборник трудов 2021 № 92 С. 43–63

The study is devoted to the application of ordinal-fixed and ordinal-invariant pattern clustering to study the structure of the energy sector in regions of Russian Federation over a five-year period. Methods for pattern analysis in the work are due to the independence of the final results from the difference in the absolute values of indicators ...

Added: September 1, 2021

Модели финансовых систем зарубежных стран

Abramov A., Акшенцева К. С., / РАНХиГС. Серия Нет "working papers series". 2011.

The purpose of the study -- The positioning of the Russian financial system in the world, identifying key trends and strategies for its development in the future. The paper-based methods of cluster analysis examines a group of countries and their financial systems, studied Russian model of capitalism and the financial system, its place among the ...

Added: March 25, 2014

Africa and the Ukraine Crisis: Exploring Attitudes

Safranchuk I., Nesmashnyi A., Chernov, D.N., Russia in Global Affairs 2023 Vol. 21 No. 3 P. 159–180

The scale and global consequences of the Ukraine crisis do not allow even countries that are not directly involved in the standoff to ignore it. Most members of the international system have to respond to the current events and formulate their position on the conflict. When analyzing these positions, the epistemic community tries to explain ...

Added: September 17, 2023

Возможности официальной статистики для оценки потенциала российских предприятий по производству конкурентоспособных товаров

Popovskaya E. V., Вопросы статистики 2007 № 8 С. 20–29

Проведен анализ системы показателей, формируемых государственной статистикой, системы организации сбора и обработки информации, что позволило оценить возможности статистики, выявить ряд ограничений, и разработать предложения по формированию системы информации, необходимой для расчета потенциала российских предприятий по производству конкурентоспособных товаров статистики. Оценку потенциала российских предприятий по производству товаров, конкурентоспособных на внутреннем рынке, предлагается проводить методом кластерного анализа ...

Added: December 14, 2012

Industrial production zones as a tool of development of the regional economy (on the example of the republic of Tatarstan and the Sverdlovsk region)

Gabdrakhmanov N., Ергунова О. Т., Astra Salvensis 2017 No. 2 P. 447–455

At the present time, during the period of integration and globalization, free economic zones or, as they are called in Russia, special economic zones, become a regular fixture in the world economic practice, and are an integral part of domestic and international economic relations. This issue has been studied by foreign and domestic economists for ...

Added: October 8, 2018

Социография: инновационная аналитическая стратегия

Petrenko E. S., Galitskaya E., Galitsky E., Телескоп: журнал социологических и маркетинговых исследований 2010 № 2(80) С. 27–30

В статье описывается аналитическая стратегия решения задачи диагностики "гражданского климата" в различных российских регионах. Исследовательская стратегия, заключающаяся в последовательном применении процедур факторного, кластерного анализов и построения классификационного дерева, дает возможность синергического анализа эмпирических данных, полученных в разных опросах. Особое внимание уделяется устойчивости обобщенных характеристик осей пространства описания гражданских отношений, построенных по данным опросов, различающихся периодом ...

Added: October 25, 2012

Economic Consequences of the Development of Digital Technologies in Russia

Arkhipova M., Сербова Ю. О., , in: Proceedings of the Third Workshop on Computer Modelling in Decision Making (CMDM 2018)Issue 85: Advances in Computer Science Research.: Atlantis Press, 2019. Ch. 10 P. 56–60.

The article examines the literature, which determinates the factors affecting Gross Regional Product as well as broadens the analysis to different regions of the Russian Federation. The regressions modeling and cluster analysis is used for the issue. Two linear regression models are constructed based on the indicators of the Federal Statistics Survey databases for the ...

Added: October 30, 2019

Studying the indicators of regional sports development in Russian Federation

Prokofiev V. N., Myachina K., Myachin A. L., Control Sciences 2021 Vol. 3 P. 50–57

The indicators of regional sports development in the Russian Federation are analyzed to find regions with a similar sports development strategy (according to the chosen methodology and measures of closeness) and to identify dynamic groups in a four-year period. Some clustering and pattern analysis methods are described, and their use in the study is validated. ...

Added: October 24, 2021

Исследование показателей стратегии развития спорта в регионах РФ

Prokofiev V. N., Myachin A. L., Myachina K., Проблемы управления 2021 Т. 3 С. 50–57

Added: June 29, 2021

Pupillometry and autonomic nervous system responses to cognitive load and false feedback: an unsupervised machine learning approach

Evgeniia I. Alshanskaia, Portnova G., Liaukovich K. et al., Frontiers in Neuroscience 2024 Vol. 18 Article 1445697

Objectives: Pupil dilation is controlled both by sympathetic and parasympathetic nervous system branches. We hypothesized that the dynamic of pupil size changes under cognitive load with additional false feedback can predict individual behavior along with heart rate variability (HRV) patterns and eye movements reflecting specific adaptability to cognitive stress. To test this, we employed an ...

Added: September 2, 2024

Summable and nonsummable data‐driven models for community detection in feature‐rich networks

Shalileh S., Mirkin B., Social Network Analysis and Mining 2021 Vol. 11 No. 1 P. 1–23

A feature-rich network is a network whose nodes are characterized by categorical or quantitative features. We propose a data-driven model for finding a partition of the nodes to approximate both the network link data and the feature data. The model involves summary quantitative characteristics of both network links and features. We distinguish between two modes ...

Added: July 29, 2021

Методология повышения эффективности некоторых видов прямого маркетинга для розничного бизнеса

Zhukova L., Polyakov K. L., Polyakova M. V., Современная наука и инновации 2013 № 3 С. 75–81

Authors suggests some advices in the field of client base segmentation construction for retail profit-making organizations concerning their possible reaction on marketing campaigns. Advices are based on the results of research in one of the largest Russian retail network in the segment of mobile devices. ...

Added: February 2, 2015

Моделирование государственной состоятельности постсоциалистических стран

Stukal D., Khavenson T., Политическая экспертиза: ПОЛИТЭКС 2012 Т. 8 № 1 С. 238–264

The paper is focused on methodology of modeling state capacity developed within Research Project 47.0 Quantitative and Qualitative Analysis (including QCA) of Factors of Development and Decline of Statemanship of Socialist and Post-Socialist Countries in Europe and Asia at the Second Half of the XX Century and the Beginning of the XXI Century, carried out ...

Added: May 4, 2012

Оценка уровня адаптации специалистов к условиям трансформирующегося общества

Nizamova A. E., Вестник Российского университета дружбы народов. Серия: Социология 2012 № 1 С. 113–123

The author focuses her attention on the analysis of the general and the particular in the adaptation of specialists on the basis of the data collected in Russia by the NRI HSE in the course of monitoring the population’s economic situation and health (RLMS-HSE), comprising a vast body of classified information on the changes in ...

Added: January 20, 2013

Trajectories of Russian manufacturing firms’ growth after the global financial crisis of 2008–2009: the role of restructuring efforts and regional institutional environment

Boris Kuznetsov, Golikova V., Korotkov M. et al., Post-Communist Economies 2017 Vol. 29 No. 2 P. 139–157

The aim of this article is to conduct an empirical investigation and reveal which types of modernisation strategies and characteristics of regional institutional environment are likely to be associated with patterns of the performance of Russian manufacturing firms in 2007–2012. In addition to estimating the impact of ex-ante behaviour on the rate of sales growth, ...

Added: April 14, 2017