How many clusters? An Entropic Approach to Hierarchical Cluster Analysis

S. Koltsov; V. Ignatenko; S. Pashakhin

doi:10.1007/978-3-030-52243-8_40

Publications

?

How many clusters? An Entropic Approach to Hierarchical Cluster Analysis

P. 560–569.

Koltsov S., Ignatenko V., Pashakhin S.

Clustering large and heterogeneous data of user-profiles from social media is problematic as the problem of finding the optimal number of clusters becomes more critical than for clustering smaller and homo- geneous data. We propose a new approach based on the deformed R ́enyi entropy for determining the optimal number of clusters in hierarchical clustering of user-profile data. Our results show that this approach allows us to estimate R ́enyi entropy for each level of a hierarchical model and find the entropy minimum (information maximum). Our approach also shows that solutions with the lowest and the highest number of clusters correspond to the entropy maxima (minima of information).

Language: English

DOI

Text on another site

Keywords: number of clusters online social networks hierarchical clustering User profiles Rényi entropy

In book

Intelligent Computing: SAI 2020: Volume 3

Vol. 1230. Book 3. , Cham: Springer, 2020.

Fast Tuning of Topic Models: An Application of Rényi Entropy and Renormalization Theory

Sergei Koltcov, Ignatenko V., Pashakhin S., Proceedings 2020 Vol. 46 No. 1 P. 1–8

In practice, the critical step in building machine learning models of big data (BD) is costly in terms of time and the computing resources procedure of parameter tuning with a grid search. Due to the size, BD are comparable to mesoscopic physical systems. Hence, methods of statistical physics could be applied to BD. The paper ...

Added: March 12, 2020

Friends network expansion and reduction: investigating the role of structural and psychological factors

Sinyavskaya Y., Porshnev A., , in: Networks in the Global World V: Proceedings of NetGloW 2020. Lecture Notes in Networks and SystemsVol. 181. Springer, 2021. P. 196–208.

Rich data from social network sites (SNS) attracts the attention of psychologists and sociologists interested in interpersonal dynamics, friendship networks, and social capital. The presented study explores the effect of network structural features and psychological characteristics of SNS users on changes in their friendship networks. The data from the representative and diverse sample of 375 ...

Added: September 25, 2020

Networks in the Global World V: Proceedings of NetGloW 2020. Lecture Notes in Networks and Systems

Springer, 2021.

Added: September 22, 2020

Network Structure of an AIDS-denialist Online Community: Identifying Core Members and the Risk Group

Rykov Y., Meylakhs P., Sinyavskaya Y., / Высшая школа экономики. Series SOC "Sociology". 2016. No. 71.

Background: With the rapid growth of online social network sites (SNS), the issue of health-related online communities and its social and behavioral implications have become increasingly important for public health and healthcare. Unfortunately, online communities often become vehicles for promotion of pernicious misinformation, for example, alleged harm of vaccination or that HIV-virus is a myth ...

Added: October 11, 2016

Функциональность электронных платформ общественного участия: причем здесь социальные сети?

Revyakin S., Вопросы государственного и муниципального управления 2019 № 3 С. 88–106

Information technologies are actively used by government agencies to interact with citizens in public administration. At the same time exploitation of advanced communication technologies through Facebook, Instagram and WhatsApp requires developed communication capabilities of electronic platforms used for public participation. The article explores the features of e-participation platforms employed by public administration in Russia. It ...

Added: March 25, 2020

Do personality characteristics explain the associations between self-esteem and online social networking behaviour?

Shchebetenko S., Computers in Human Behavior 2019 Vol. 91 P. 17–23

The relationships between online social networking (OSN) behaviour and users’ self-esteem are as important as well as ambiguous: Both positive and negative self-esteem can encourage users to engage in OSNs. This work examined whether personality traits and attitudes toward traits can explain this controversy. Data from 830 users of a local OSN were analysed. I ...

Added: September 24, 2018

Особенности использования социальных сетей в связи с прокрастинацией и саморегуляцией

Kornienko D. S., Руднова Н. А., Психологические исследования: электронный научный журнал 2018 Т. 11 № 59

Nowadays online social networks (Facebook, VKontake, etc.) are the part of modern life, but they are also the cause of achievement (especially academic) decline. Self-regulation and procrastination are the characteristics that can be defined as personality factors in online social network use. Procrastination is conceptualized as postponing some actions crucial to the timely completion of ...

Added: November 1, 2019

From quantitative to semantic analysis: Russian construcitons with dative subjects in diachrony

Bonch-Osmolovskaya A. A., , in: Quantitative approaches to the Russian language. Abingdon: Routledge, 2018. P. 158–174.

The paper presents diachronic study of dative subject constructions with predicatives in Russian. The dataset from corpus of 19-21 century is analysed with clustering method, the classes of predicates which examin similar behaviour are defined. Semantic interpretation is proposed for the observed distribution. ...

Added: July 14, 2017

Инструменты описания неполной коммуникации в блогосфере

Гусейнов Г.Ч., В кн.: Русский язык и новые технологии. М.: Новое литературное обозрение, 2014. Гл. 14 С. 227–238.

Выражение «неполная коммуникация» представляет собой плеоназм: никакое общение не может обеспечить полной тождественности отправленного и принятого сообщения. И все же участники общения стремятся по мере сил избегать естественных разночтений, переспрашивать друг друга, публиковать соответствующие словари в поисках общезначимых единиц обмена, расширять каналы передачи сообщений и т.п. На первый взгляд, виртуальное пространство создает условия для максимальной ...

Added: February 3, 2014

От социального пространства к пространству онлайн социальных сетей: исследовательские подходы и вызовы

Минина В. Н., Василькова В. В., Социальное пространство 2019 № 5(22) С. 1–15

The article discusses the actual sociological problem of interrelation of ontological and epistemological bases in the study of social space. The authors draw attention to the fragmentation of Internet space research and aim to analyze the transformation of scientific approaches to the problem of social space in the process of nderstanding the new ontology of ...

Added: October 7, 2022

Extracting Functional Job Roles From Professional Social Networking Sites Profiles

Nesterenko A., , in: Supplementary Proceedings of the 5th International Conference on Analysis of Images, Social Networks and Texts (AIST-SUP 2016), Yekaterinburg, Russia, April 7-9, 2016.Vol. 1710. Aachen: CEUR Workshop Proceedings, 2016.

Added: November 3, 2020

Большая Пятерка черт личности и активность пользователей в социальной сети «Вконтакте»

Shchebetenko S., Вестник Южно-Уральского государственного университета. Серия: Психология 2013 Т. 6 № 4 С. 73–83

На выборке из 1079 студентов изучалась связь Большой Пятерки черт личности с пользовательской активностью в социальной сети «Вконтакте» Были определены 12 фактических показателей активности пользователей на сайте. Установлено, что экстраверсия, нейротизм и доброжелательность являются важными независимыми предикторами многих поведенческих показателей активности, среди которых - количество друзей, количество записей на стене, количество фотографий, количество отметок «мне ...

Added: September 24, 2018

From equality to diversity: Classifying Russian universities in a performance oriented system

Abankina I., Aleskerov F. T., Belousova V. et al., Technological Forecasting and Social Change 2016 No. 103 P. 228–239

Over the last few decades, performance-based funding models of universities have been introduced and have made universities build and implement different strategies to enable them to compete and be viable in changing circumstances. In turn, national governments are focused on providing universities with more opportunities to run efficient programmes that advance higher education. This paper ...

Added: October 14, 2015

Clustering of modal-valued symbolic data

Kejžar N., Korenjak-Černe S., Batagelj V., Advances in Data Analysis and Classification 2021 Vol. 15 No. 2 P. 513–541

Symbolic data analysis is based on special descriptions of data known as symbolic objects (SOs). Such descriptions preserve more detailed information about units and their clusters than the usual representations with mean values. A special type of SO is a representation with frequency or probability distributions (modal values). This representation enables us to simultaneously consider ...

Added: November 18, 2020

Анализ профилей сообществ социальных сетей

Соколова Т. В., Chepovskiy A., Системы высокой доступности 2018 Т. 14 № 3 С. 82–86

This paper presents the problem of forming user profiles based ondata from social networks. For user profiling both the user and his friends data are used. Community allocation algorithms in social graphs are used to detect groups of communication. Each community has its own profile, which includes the characteristics of the users that belong to it ...

Added: October 28, 2018

Application of Modern Data Analysis Methods to Cluster the Clinical Pathways in Urban Medical Facilities

Prokofyeva E. S., Zaitsev R., Maltseva S. V., , in: 2019 IEEE 21st Conference on Business Informatics (CBI)Vol. 1. M.: IEEE Computer Society, 2019. P. 75–83.

Patient flow modeling in healthcare plays a large role in understanding the operation of the system and its characteristics. Besides, modeling techniques can significantly improve the effectiveness of the medical facilities. The existing level of automation in these facilities enables the accumulation of large amounts of various data. Therefore, the collected data might be considered ...

Added: September 10, 2019

Communications in Computer and Information Science, Vol. 542, Springer, 2015

Semenov A., Natekin A., Nikolenko S. I. et al., Springer, 2015.

In online social networks, high level features of user behavior such as character traits can be predicted with data from user profiles and their connections. Recent publications use data from online social networks to detect people with depression propensity and diagnosis. In this study, we investigate the capabilities of previously published methods and metrics applied to the Russian online social ...

Added: December 21, 2015

Social signature in an online environment: stability and cognitive limits

Koltsova Olessia Y., Mararitsa Larisa V., Terpilovskii Maxim A. et al., Computers in Human Behavior 2021 No. 122 Article 106856

Social tie maintenance has always had cognitive and emotional costs and has always been leading to uneven distribution of communication volume among egos' alters. This distribution, known as a social signature, is assumed to be relatively stable for each individual. Availability of digital traces of human communication allows testing whether this assumption is true and whether it holds in specific ...

Added: May 11, 2021

Русский язык и новые технологии

М.: Новое литературное обозрение, 2014.

Changes in modern Russian due to the expansion of the new technologies; Russian of the Internet (Runet). Social and cultural consequences of the CMC-revolution. ...

Added: February 3, 2014

Capturing the right number of clusters with K-Means using the complementary criterion and affinity propagation

Токмаков М. А., Mirkin B., Journal of Classification 2017

K-Means is a most popular method for clustering. Yet it has some shortcomings such as the need in prior choice of the number of clusters K and a starting location of their centers. This paper pursues an approach of taking advantage of a reformulation of the square-error criterion based on a Pythagorean decomposition of the ...

Added: April 15, 2017

Social Networks as a New Tool that Facilitates the Development of Urban Adolescents

Polivanova K. N., Koroleva D., Russian Education and Society 2018 Vol. 60 No. 6 P. 496–505

The article demonstrates that the extent and mode of adolescent participation in social networks have changed with the rapid development of social media technologies. If, at the time when they were first introduced, social networks complemented direct communication and were studied as a separate additional space, now, because of the development of mobile technologies, direct ...

Added: December 12, 2018

Между алгоритмическим воображаемым и «большими данными»

Колпинец Е. В., Социодиггер 2021 Т. 2 № 7 (12) С. 16–21

В разговоре об алгоритмах, их влиянии на коллективное и индивидуальное восприятие соцсетей, на мой взгляд, важен конфликт очищенных от контекста больших данных, на которые так любят ссылаться маркетологи и аналитики, и пользовательских практик, пронизанных аффектами и догадками. Даже имея большие данные, например, подробную статистику пользователей с учетом возраста, гендера, территориальной принадлежности пользователей, исследователи и маркетологи часто не ...

Added: April 19, 2022

Использование социальных сетей для целей образования и социализации подростка: аналитический обзор эмпирических исследований (международный опыт)

Koroleva D., Психологическая наука и образование 2015 Т. 20 № 1 С. 28–37

We present a review of foreign studies on the use of social networks in teaching practice. We provide the description of social media as a potential new resource, providing the organization of teacher-student interaction, group communication of students, increasing the involvement of students in the learning process. It is shown that social networking services are ...

Added: March 31, 2015

Social Media-based Research of Interpersonal and Group Communication in Russia

Koltsova O., Porshnev A., Sinyavskaya Y., , in: The Palgrave Handbook of Digital Russia Studies. Palgrave Macmillan, 2021. P. 335–352.

Rapidly proliferating social media not only serve as a new channel of human communication but also open up research opportunities to ask a wider set of questions about political, sociological and psychological factors that influence interpersonal and group online communication, development and maintenance of personal networks, and the growth or decline of social capital. In ...

Added: September 25, 2020