Summable and nonsummable data‐driven models for community detection in feature‐rich networks

S. Shalileh; B. Mirkin

doi:10.1007/s13278-021-00774-8

Publications

?

Summable and nonsummable data‐driven models for community detection in feature‐rich networks

Social Network Analysis and Mining. 2021. Vol. 11. No. 1. P. 1–23.

Shalileh S., Mirkin B.

A feature-rich network is a network whose nodes are characterized by categorical or quantitative features. We propose a data-driven model for finding a partition of the nodes to approximate both the network link data and the feature data. The model involves summary quantitative characteristics of both network links and features. We distinguish between two modes of using the network link data. One mode postulates that the link values are comparable and summable across the network (summability); the other assumption models the case in which different nodes represent different measurement systems so that the link data are neither comparable, nor summable, across different nodes (nonsummability). We derive a Pythagorean decomposition of the combined data scatter involving our data recovery least-squares criterion. We address an equivalent problem of maximizing its complementary part, the contribution of a found partition to the combined data scatter. We follow a doubly greedy strategy in maximizing that. First, communities are found one-by-one, and second, entities are added one-by-one in the process of identifying a community. Our algorithms determine the number of clusters automatically. The nonsummability version proves to have a niche of its own; also, it is faster than the other version. In our experiments, they appear to be competitive over generated synthetic data sets and six real-world data sets from the literature.

Research target: Computer Science

Priority areas: IT and mathematics

Keywords: social network analysis community detection algorithms Community detection clustering algorithms Feature-rich Networks

Publication based on the results of:

Data analysis and choice of solutions in the studies of socio-economic and political systems (2021)

Least-squares community extraction in feature-rich networks using similarity data

Shalileh S., Mirkin B., Plos One 2021 Vol. 16 No. 7 Article 0254377

We explore a doubly-greedy approach to the issue of community detection in feature-rich networks. According to this approach, both the network and feature data are straightfor- wardly recovered from the underlying unknown non-overlapping communities, supplied with a center in the feature space and intensity weight(s) over the network each. Our least- squares additive criterion allows ...

Added: July 22, 2021

Combined method to detect communities in graphs of interacting objects

Chepovskiy A., Lobanova S., Business Informatics 2017 Vol. 42 No. 4 P. 64–73

A method for detecting intersecting and nested communities in graphs of interacting objects of different nature is proposed and implemented. For this, two classical algorithms are taken: a hierarchical agglomerate and based on the search for k -cliques. The presented combined method is based on their consistent application. In addition, parametric options are developed that ...

Added: August 2, 2019

Алгоритмы выделения групп общения

Лещёв Д. А., Сучков Д. В., Хайкова С. П. et al., Вопросы кибербезопасности 2019 Т. 32 № 4 С. 61–71

The purpose of the study: development of methods for analyzing the graph of interacting objects based on the detection of implicit communities in order to solve the problems of searching for the proximity of profiles and the exchange, distribution of information between objects. Method: importing data from social networks with the subsequent construction of a weighted ...

Added: August 2, 2019

Network partitioning algorithms as cooperative games

Avrachenkov K. E., Kondratev Aleksei Y, Mazalov V. V. et al., Computational Social Networks 2018 Vol. 5 No. 11 P. 1–28

The paper is devoted to game-theoretic methods for community detection in networks. The traditional methods for detecting community structure are based on selecting dense subgraphs inside the network. Here we propose to use the methods of cooperative game theory that highlight not only the link density but also the mechanisms of cluster formation. Specifically, we ...

Added: October 30, 2018

Модели импорта данных из Твиттера

Попов В. А., Chepovskiy A., Вестник Новосибирского государственного университета. Серия: Информационные технологии 2021 Т. 19 № 2 С. 76–91

In this paper, the authors describe an algorithm for importing data from the social network Twitter and building weighted social graphs. To import data, the given posts are taken as a basis, users who have had any of the recorded interactions with them are downloaded. Further, the algorithm focuses on the given configuration and uses ...

Added: July 25, 2021

A One-by-One Method for Community Detection in Attributed Networks

Shalileh S., Mirkin B., , in: Intelligent Data Engineering and Automated Learning – IDEAL 2020/ 21st International Conference, Guimaraes, Portugal, November 4–6, 2020, Proceedings, Part IIVol. 12490: Lecture Notes in Computer Science. Cham: Springer, 2020. P. 413–422.

The problem of community detection in a network with features at its nodes takes into account both the graph structure and node features. The goal is to find relatively dense groups of interconnected entities sharing some features in common. We apply the so-called data recovery approach to the problem by combining the least-squares recovery criteria ...

Added: November 14, 2020

Communications in Computer and Information Science, Vol. 542, Springer, 2015

Semenov A., Natekin A., Nikolenko S. I. et al., Springer, 2015.

In online social networks, high level features of user behavior such as character traits can be predicted with data from user profiles and their connections. Recent publications use data from online social networks to detect people with depression propensity and diagnosis. In this study, we investigate the capabilities of previously published methods and metrics applied to the Russian online social ...

Added: December 21, 2015

Recent Trends in Analysis of Images, Social Networks and Texts. 9th International Conference, AIST 2020, Skolkovo, Moscow, Russia, October 15–16, 2020 Revised Supplementary Proceedings

Springer, 2021.

This book constitutes revised selected papers from the 9th International Conference on Analysis of Images, Social Networks and Texts, AIST 2020, held during October 15-16, 2020. The conference was planned to take place in Moscow, Russia, but changed to an online format due to the COVID-19 pandemic. The 27 full papers and 4 short papers presented ...

Added: October 7, 2020

Supplementary Proceedings of the 3rd International Conference on Analysis of Images, Social Networks and Texts (AIST 2014)

Ekaterinburg: CEUR Workshop Proceedings, 2014.

AIST'2014 is an international data science conference on Analysis of Images, Social Networks, and Texts. Traditionally, the conference is held annually in Yekaterinburg, Russia. The conference is intended for computer scientists and practitioners whose research interests involve Internet mathematics and other related fields of data science. LIST OF TOPICS (NON EXHAUSTIVE) Applications of Data Mining and Machine ...

Added: August 28, 2014

Communications in Computer and Information Science

Cham: Springer, 2014.

The CCIS series is devoted to the publication of proceedings of computer science conferences. Its aim is to efficiently disseminate original research results in informatics in printed and electronic form. While the focus is on publication of peer-reviewed full papers presenting mature work, inclusion of reviewed short papers reporting on work in progress is welcome, ...

Added: October 15, 2014

Survey on graph embeddings and their applications to machine learning problems on graphs

Makarov I., Kiselev D., Nikitinsky N. et al., PeerJ Computer Science 2021 Vol. 7 P. 1–62

Dealing with relational data always required significant computational resources, domain expertise and task-dependent feature engineering in order to incorporate structural information into predictive model. Nowadays, a family of automated graph feature engineering techniques have been proposed in different streams of literature. So-called graph embeddings provide a powerful tool to construct vectorized feature spaces for graphs ...

Added: October 27, 2020

Galaxy Clusters Reconstruction

Zarodnyuk A., Trofimova E., Solovyov A. et al., Journal of Physics: Conference Series 2021 No. 1740 Article 012017

In the present work, we introduce a machine learning-based approach for galaxy clustering. It requires to determine clusters to provide further galaxies groups' masses estimation. The knowledge of mass distribution is crucial in dark matter research and study of the large-scale structure of the Universe. State-of-the-art telescopes allow various spectroscopy range data accumulation that highlights ...

Added: January 25, 2021

Характеристики текстов сообществ социальных сетей

Аванесян Н. Л., Соловьев Ф. Н., Chepovskiy A., Вестник Новосибирского государственного университета. Серия: Информационные технологии 2021 Т. 19 № 1 С. 5–14

In this paper the authors describe the methodology for the statistical analysis of texts in social networks based on comparison of automatically generated frequency dictionaries by methods of correlation analysis. Psycholinguistic characteristics and coefficients of pairwise rank correlation are considered for comparing the frequency characteristics of texts in natural language ...

Added: April 14, 2021

CEUR Workshop Proceedings. Proceedings of the International Workshop on Social Network Analysis using Formal Concept Analysis (SNAFCA 2015)

Malaga: CEUR Workshop Proceedings, 2015.

Social network analysis (SNA) is a multidisciplinary research area that has attracted many researchers from different disciplines such as Physics, Mathematics, Sociology, Biology and Computer Science, and has been studied according to different approaches and techniques. A social network is a dynamic structure (generally represented as a graph) of a set of entities/actors (nodes) together ...

Added: October 19, 2015

Proceedings of the 6th International Conference on Knowledge Discovery and Data Mining, Workshop on Social Network Mining and Analysis

ACM, 2012.

The sixth SNA-KDD workshop (www.snakdd.com) is proposed as the sixth in a successful series of workshops on social network mining and analysis co-held with KDD, soliciting experimental and theoretical work on social network mining and analysis in both online and offline social network systems. ...

Added: March 4, 2015

Machine Learning and Data Mining in Pattern Recognition

Springer, 2014.

This book constitutes the refereed proceedings of the 10th International Conference on Machine Learning and Data Mining in Pattern Recognition, MLDM 2014, held in St. Petersburg, Russia in July 2014. The 40 full papers presented were carefully reviewed and selected from 128 submissions. The topics range from theoretical topics for classification, clustering, association rule and ...

Added: September 30, 2014

Analysis of Images, Social Networks and Texts Third International Conference, AIST 2014, Yekaterinburg, Russia, April 10-12, 2014, Revised Selected Papers

Berlin: Springer, 2014.

This book constitutes the proceedings of the Third International Conference on Analysis of Images, Social Networks and Texts, AIST 2014, held in Yekaterinburg, Russia, in April 2014. The 11 full and 10 short papers were carefully reviewed and selected from 74 submissions. They are presented together with 3 short industrial papers, 4 invited papers and ...

Added: November 13, 2014

Выделение сообществ в графе взаимодействующих объектов

Коломейченко М. И., Polyakov I. V., Chepovskiy A. et al., Фундаментальная и прикладная математика 2016 Т. 21 № 3 С. 131–139

This article describes the problem of analysis of social network graphs and other interacting objects. It also presents community detection algorithms in social networks, their classification and analysis. In addition, it considers applicability of algorithms for real tasks in social network graph analysis. ...

Added: February 23, 2017

Growing Homophilic Networks Are Natural Navigable Small Worlds

Мальков Ю. А., Ponomarenko A., Plos One 2016 Vol. 11 No. 6 P. 1–14

Navigability, an ability to find a logarithmically short path between elements using only local information, is one of the most fascinating properties of real-life networks. However, the exact mechanism responsible for the formation of navigation properties remained unknown. We show that navigability can be achieved by using only two ingredients present in the majority of ...

Added: September 9, 2016

Formal Concept Analysis of Social Networks

Springer, 2017.

The book studies the existing and potential connections between Social Network Analysis (SNA) and Formal Concept Analysis (FCA) by showing how standard SNA techniques, usually based on graph theory, can be supplemented by FCA methods, which rely on lattice theory. The book presents contributions to the following areas: acquisition of terminological knowledge from social networks, knowledge ...

Added: December 17, 2017

Detection of Communities in a Graph of Interactive Objects

Chepovskiy A., Chepovskiy A., Polyakov I. V. et al., Journal of Mathematical Sciences 2019 Vol. 237 No. 3 P. 426–431

This article describes the problem of analysis of social network graphs and other interacting objects. It also presents community detection algorithms in social networks and their classification and analysis. In addition, it considers applicability of algorithms for real tasks in social network graph analysis. ...

Added: June 5, 2019

Взаимосвязь сетевых характеристик и субъектности сетевых сообществ в социальной сети Твиттер

Chepovskiy A., Воронин А. Н., Ковалева Ю. В., Вопросы кибербезопасности 2020 Т. 37 № 3 С. 40–57

The purpose of the study: analysis of the graph of interacting objects of social networks based on the selection of implicit communities, assessment of the subjectivity of the selected communities and comparison of the network characteristics of communities and various indicators of their subjectivity. Method: communities detection on the constructed weighted graph of a social network, ...

Added: August 26, 2020

A Method for Community Detection in Networks with Mixed Scale Features at its Nodes

Mirkin B., , in: Complex Networks & Their Applications IX. Volume 1: Proceedings of the Ninth International Conference on Complex Networks and Their Applications COMPLEX NETWORKS 2020. Springer, 2021. P. 3–14.

Added: October 31, 2020