Detection of an unspecified number of communities in feature-rich networks

S. Shalileh; B. Mirkin

?

Detection of an unspecified number of communities in feature-rich networks

P. 1–12.

The problem of community detection in a network with features at its nodes takes into account both the graph structure and node features. The goal is to find relatively dense groups of interconnected entities sharing some features in common. Existing approaches require the number of communities pre-specified. We apply the so-called data recovery approach to allow a relaxation of the criterion for finding communities one-by-one. We show that our proposed method is effective on real-world data, as well as on synthetic data involving either only quantitative features or only categorical attributes or both. In the cases at which attributes are categorical, state-of-the-art algorithms are available. Our algorithm appears competitive against them. © 2020 CEUR-WS. All rights reserved.

Language: English

Full text

Text on another site

Keywords: network analysis clustering Community detection Feature-rich Networks

In book

Proceedings of MARAMI 2020 - Modèles & Analyse des Réseaux : Approches Mathématiques & Informatiques - The 11th Conference on Network Modeling and Analysis(Vol-2750)

Vol. Vol-2750: Modèles & Analyse des Réseaux : Approches Mathématiques & Informatiques - Network Modeling and Analysis 2020. , CEUR-WS.org, 2020.

Анализ культурных референций в творчестве А. Вознесенского: цифровое исследование имен персоналий

Tyuryakova-Matveeva D., Цифровые гуманитарные исследования 2026 № 1 С. 4–26

The article explores cultural references in the works of Andrei Voznesensky by analyzing the personalities he mentions. A total of 1,678 works were processed, including poetry, prose, and early unpublished poems. NER methods based on Natasha, spaCy, and LLM Grok tools made it possible to study the frequency of mentions of famous people and their ...

Added: May 31, 2026

Bibliometric Analysis by Network Models

Aleskerov F. T., Khutorskaya O., Stepochkina A. et al., Springer, 2026.

The book contains new models of bibliometric analysis based on centrality measures in network analysis, pattern analysis and stability analysis. A distinctive feature of these centrality measures is that they account for the parameters of vertices and group influence of vertices to a vertex. This reveals specific groups of publications, authors, terms, journals and affiliations ...

Added: May 15, 2026

Обэриуты в кругу Михаила Кузмина (сетевой анализ)

Pakhomova A., Вестник Московского университета. Серия 9: Филология 2026 № 1 С. 162–177

Beginning in the mid-1920s Aleksandr Vvedensky, Daniil Kharms, and Konstantin Vaginov became acquainted with the circle of the poet, writer, and playwright Mikhail Kuzmin, and by the close of this decade, they became regular visitors to his residence. The interactions between Kuzmin and the Oberiuts has been sufficiently developed; however, numerous studies have shifted the focus to the pragmatics of ...

Added: April 1, 2026

Возможности применения семантических сетей для анализа качественных данных

Barkhatova L., В кн.: Человек в информационном обществе: сборник материалов третьей международной научно-практической конференции, посвящённой 80-летию Победы в Великой Отечественной войне, 23–26 апреля 2025 года, г. Самара.: Самара: Самарский национальный исследовательский университет имени академика С.П. Королева, 2025. С. 94–98.

The analytical possibilities of using semantic network analysis in qualitative research are considered. A scheme for constructing a semantic map and its integration with the results of the qualitative stage is proposed. It is shown that the implementation of semantic networks for analyzing qualitative data enables validation of conclusions. ...

Added: January 25, 2026

Flexible Stock Market Algorithm

Rubchinskiy A., Chubarova D., Technology and Investment 2025 Vol. 16 No. 4 P. 211–240

The article considers one of the most famous examples of socio-economic systems characterized by significant uncertainty—the S&P-500 stock market, where shares of 500 largest US companies are traded. The flexible algorithm for daily trading has been developed. It is based on known fixed data about cost of shares in previous days as well as on ...

Added: December 19, 2025

Национальные и институциональные связи депутатов Народного собрания Республики Дагестан VII созыва

Андриянов Д. А., Брагина А. Б., Kechik U. et al., Полития: Анализ. Хроника. Прогноз 2025 Т. 118 № 3 С. 80–107

It has long been generally accepted in Political Science that distributed elite networks, rather than institutions, are crucial to the functioning of the Russian political system. One of the important factors in the formation of such networks is ethnicity. Of particular interest is the ethnic dynamics in those regions, where there is no dominant ethnic ...

Added: September 25, 2025

Метрики центральности сетевого анализа как инструмент определения уровня абстракции понятий внутри понятийной структуры обучающихся

Andronova E., Kapuza A., Психологические исследования: электронный научный журнал 2025 Т. 18 № 102 Статья 3

This study examines concepts as systemic elements within a conceptual structure, drawing on the theoretical framework of L.S. Vygotsky’s cultural-historical psychology. A key characteristic of concepts is their level of abstraction, which reflects their degree of generalization within a system. However, existing classification methods lack a unified approach, complicating empirical research. The current study aimed ...

Added: September 19, 2025

Опыт применения сетевых моделей для анализа для пространственного анализа практик джерримендеринга на выборах в Конгресс США в 2000–2020 гг.

Глумов Ф. В., Maltsev A., Terra Politica 2025 № 1 С. 167–186

The present study is devoted to the phenomenon of gerrymandering on the example of US Congressional elections. The boundaries of US Congressional electoral districts were used for the analysis. The division of states into districts was based on census data from 2000 to 2020. Data on small administrative units (counties) similarly relied on census data. ...

Added: August 6, 2025

Mapping the Russian Media Field Through Audience Networks and Agenda Choice

Loseva A., Moroz A., Shmidt E. et al., International Journal of Communication 2024 Vol. 18 P. 1–26

In light of the “gardening” of the public sphere in autocracies, the question of how power is distributed in the media field calls for empirical investigation. We use computational methods of network analysis, topic modeling, and semantic analysis to test if the Russian media landscape is organized around the three “publics” as suggested by earlier ...

Added: July 7, 2025

Connectedness of entrepreneurial ecosystems: evidence from the mobility of knowledge-intensive entrepreneurs

Spinazzola M., Scuotto V., Pironti M. et al., Small Business Economics 2025 Vol. 65 P. 1517–1534

Entrepreneurial ecosystems (EEs) are regarded as ideal breeding ground for knowledge-intensive entrepreneurs (KIEs). Yet, as EEs are mostly considered isolated from each other and their connectedness is neglected, there is a lack of research on their capacity to attract KIEs rather than to locally nurturing them. Inadequate data has been a major obstacle to this ...

Added: June 30, 2025

New centrality indices in network analysis

Fuad Aleskerov, Tkachev D., , in: In Honor of the 70th Birthday of Panos Pardalos. Theory, Algorithms and Experiments in Applied Optimization. SOIA, volume 226Vol. 226.: Springer, 2025. Ch. 2 P. 22–37.

New centrality indices Bundle and Pivotal, were introduced to identify key vertices taking into account parameters of vertices and an influence of vertices to a vertex. However, the Bundle and Pivotal indices are constructed in that way that they do not take into account directly the weights of arcs – important information about interaction among vertices ...

Added: April 29, 2025

Measuring Personal Networks with Core Discussion Network Methodology: A Case of Russian Students

Mikhaylova O., Dokuka S., Quality and Quantity 2025 Vol. 59 P. 4077–4095

This paper explores the use of the Core Discussion Network (CDN) methodology for analyzing personal networks. We discuss the method’s origins and the specifics of its implementation, followed by a brief overview of findings from international studies. Using data collected in 2021 from 270 first-year undergraduate and graduate students at a top-tier Russian university, we ...

Added: April 19, 2025

Networks Under Deep Uncertainty

Fuad Aleskerov, Tkachev D., , in: Dynamics of Disasters: From Natural Phenomena to Human ActivityVol. 217.: Springer, 2024. Ch. 1 P. 1–13.

The situation of deep uncertainty is defined by the absence of any statistical evaluations of the situation development. For instance, such situations may include events that occur for the first time. We use scenario analysis to model the potential outcomes of events affecting networks under deep uncertainty. Centrality indices are used to identify vulnerable vertices ...

Added: March 5, 2025

Tunnel Clustering Method

F. T. Aleskerov, A. L. Myachin, V. I. Yakuba, Doklady Mathematics 2024 Vol. 110 No. 3 P. 474–479

We propose a novel method for rapid pattern analysis of high-dimensional numerical data, termed tunnel clustering. The main advantages of the method are its relatively low computational complexity, endogenous determination of cluster composition and number, and a high degree of interpretability of final results. We present descriptions of three different variations: one with fixed hyperparameters, ...

Added: March 3, 2025

Использование Z-чисел для описания набора данных

Гусейнов О., Degtyarev K. Y., IRETC MTÜ PAHTEI - Proceedings of Azerbaijan High Technical Educational Institutions 2025 Т. 48 № 1 С. 360–370

The concept of Z-number was proposed by Prof. Lotfi Zadeh to describe partial reliability of information, and it is a kind of fusion of fuzziness and probabilistic uncertainty. Z-number can be presented as a pair of fuzzy numbers Z(A,B) used to describe a value of a random variable X. The first component (A) is a ...

Added: February 20, 2025

A Comparative Analysis of Centrality Measures in Complex Networks

Meshcheryakova N., Швыдун С. В., Automation and Remote Control 2024 No. 85 P. 685–695

Identification of central elements in networks is an ill-defined problem. Hence, a large number of centrality measures have been proposed in the literature. We present a survey of existing axioms, which characterize certain properties of centralities. We also perform a perturbation analysis of centrality measures in real and artificial networks. ...

Added: December 2, 2024

Gradient descent clustering with regularization to recover communities in transformed attributed networks

Shalileh S., Social Network Analysis and Mining 2025 Vol. 15212 P. 137–148

Community detection in attributed networks aims to recover clusters in which the within-community nodes are as interconnected and as homogeneous as possible, while the between-communities nodes are as disconnected and as heterogeneous as possible. The current research proposes a straightforward data-driven model with an integrated regularization term to recover communities. For further improvement of the ...

Added: November 30, 2024

An empirical scrutinization of four crisp clustering methods with four distance metrics and one straightforward interpretation rule

T. A. Alvandyan, S. Shalileh, Doklady Mathematics 2024 Vol. 110 No. S1 P. S236–S250

Clustering has always been in great demand by scientific and industrial communities. However, due to the lack of ground truth, interpreting its obtained results can be debatable. The current research provides an empirical benchmark on the efficiency of three popular and one recently proposed crisp clustering methods. To this end, we extensively analyzed these (four) ...

Added: November 30, 2024

Network psychometric and Item Response Theory (IRT) approach to validating the Russian adult version of the Structure of Temperament Questionnaire (STQ-77Ru)

Gallyamova A., Grigoryev D., Natural Systems of Mind 2025 Vol. 5 No. 1 P. 27–41

This study examines the psychometric properties of the Russian adult version of the Structure of Temperament Questionnaire (STQ-77Ru) within a diverse community sample. Employing both network psychometrics and Item Response Theory (IRT), we analyzed data from 3,442 Russian participants, aged between 18 and 81 years (M = 38, SD = 11). Network psychometrics were used ...

Added: November 1, 2024