?
Defining Kinds of Violence in Russian Short Stories of 1900–1930: A Case of Topic Modelling With LDA and PCA
P. 281-290.
Gryaznova E., Kirina M.
Publication based on the results of:
In book
CEUR Workshop Proceedings, 2021
Byzov A., Социология: методология, методы, математическое моделирование 2019 № 49 С. 131-160
Throughout most of their history, sociologists have sought to study unstructured organic texts: newspaper materials, diaries, memoirs, letters, documents, and, more recently, messages, publications and other texts on various online platforms. This article discusses how modern techniques of text mining can improve classical sociological approaches to the analysis of this type of data. The article ...
Added: December 9, 2019
Bogomolov E., Golubev Y., Lobanov A. et al., , in : ASE '20: Proceedings of the 35th IEEE/ACM International Conference on Automated Software Engineering. : ACM, 2020. P. 1316-1320.
Added: October 26, 2021
Zelenkov Y., , in : Knowledge Management in Organizations. 14th International Conference, KMO 2019, Zamora, Spain, July 15–18, 2019, Proceedings. Vol. 1027.: Switzerland : Springer, 2019. P. 324-335.
The intellectual structure of academic discipline can be viewed as a set of interacting topics evolving over time. Dynamics of those topics i.e. changes in their popularity and impact is the subject of special attention because it reflects a shift in actual researchers’ interest. This paper analyzes topics of knowledge management (KM) on the base ...
Added: June 14, 2019
Sergei Koltcov, Ignatenko V., Pashakhin S., Proceedings 2020 Vol. 46 No. 1 P. 1-8
In practice, the critical step in building machine learning models of big data (BD) is costly in terms of time and the computing resources procedure of parameter tuning with a grid search. Due to the size, BD are comparable to mesoscopic physical systems. Hence, methods of statistical physics could be applied to BD. The paper ...
Added: March 12, 2020
Nikita Kaspruk, Olga Silyutina, Karepin V., , in : Digital Transformation & Global Society: Second International Conference, DTGS 2017, St. Petersburg, Russia, June 21-23, 2017, Revised Selected Papers. : Springer, 2017. P. 341-346.
In this work in progress, we analyze how perceived hotel value dimensions and the perception of city sights are connected with categories of hotels. Applying a topic modelling algorithm to 21,165 reviews from 201 hotels located in Saint Petersburg, we show that clients of hotels of different categories pay attention to different value dimensions. Analyzing ...
Added: December 2, 2017
Koltsov S., Koltsova O., Nikolenko S. I., , in : Proceedings of WebSci '14 ACM Web Science Conference, Bloomington, IN, USA — June 23 - 26, 2014. : NY : ACM, 2014. P. 161-165.
Topic modeling, in particular the Latent Dirichlet Allocation (LDA) model, has recently emerged as an important tool for understanding large datasets, in particular, user-generated datasets in social studies of the Web. In this work, we investigate the instability of LDA inference, propose a new metric of similarity between topics and a criterion of vocabulary reduction. ...
Added: October 17, 2014
Бойченко А. Е., Zhuchkova S., Журнал социологии и социальной антропологии 2020 Т. 23 № 2 С. 130-165
Th e study presents an attempt of the complex exploratory analysis of Russian rap based on the corpus of texts of the Russian-language songs of this genre. Th e corpus contains more than 11,000 texts that vary in their date of creation and popularity by more than 500 artists collected by automatically extracting data from web ...
Added: August 12, 2020
Svetlana S. Bodrunova, Koltsova O., Sergey Koltcov et al., International Journal of Communication 2017 Vol. 11 P. 3242-3264
Communication in social media is increasingly being found to reproduce or even reinforce ethnic prejudice and hostility toward migrants. In Russia of the 2010s, with its world’s second largest immigrant population, polls have detected high levels of hostility of the Russian population toward migranty (migrants), a label attached to resettlers from Central Asia and the ...
Added: October 4, 2017
Apishev M., Koltsov S., Koltsova O., Computacion y Sistemas 2016 Vol. 20 No. 3 P. 387-403
Social studies of the Internet have adopte large-scale text mining for unsupervised discovery o topics related to specific subjects. A recently develope approach to topic modeling, additive regularizatio of topic models (ARTM), provides fast inference an more control over the topics with a wide variety o possible regularizers than developing LDA extensions We apply ARTM ...
Added: November 17, 2016
Kolmogorova A., , in : Proceedings of the International Conference "Artificial Intelligence in Automated Control Systems and data processing". : [б.и.], 2023.
The article is devoted to the problem of the specificity of the semantics and structure of the texts of social networks, which represent different emotional states of their authors. The hypothesis of whether there is specificity in the thematic content of texts of different emotional classes is tested on the material of eight subcorpora of ...
Added: December 10, 2023
Shirokanova A., Silyutina O., , in : Digital Transformation and Global Society Third International Conference, DTGS 2018, St. Petersburg, Russia, May 30 –June 2, 2018, Revised Selected Papers, Part I. Issue 858.: Cham : Springer, 2018. P. 181-194.
Internet regulation in Russia has vigorously expanded in recent years to transform the relatively free communication environment of the 2000s into a heavily regulated one. Our goal was to identify the topic structure of Russian media discourse on Internet regulation and compare it between political and non-political media outlets. We used structural topic modeling on ...
Added: October 10, 2018
Koltsov S., Physica A: Statistical Mechanics and its Applications 2018 Vol. 512 P. 1192-1204
This study proposes to minimize Rényi and Tsallis entropies for finding the optimal number of topics T in topic modeling (TM). A promising tool to obtain knowledge about large text collections, TM is a method whose properties are underresearched; in particular, parameter optimization in such models has been hindered by the use of monotonous quality ...
Added: October 11, 2018
Koltsova, O., Koltcov, S., Alexeeva, S., , in : Proceedings of WebSci '14 ACM Web Science Conference, Bloomington, IN, USA — June 23 - 26, 2014. : NY : ACM, 2014. P. 166-170.
In this paper we describe structural and topical properties of "ordinary" blogs versus "popular" blogs. Using the complete directory of the Russian language LiveJournal, we sample both groups and show that the main difference between them is in the volume of posting activity and of commenting feedback and in the skewedness of respective distributions. No ...
Added: October 8, 2014
L.E. Limonov, M.V. Nesena, Regional Research of Russia 2016 Vol. 6 No. 2 P. 144-155
The goal of the research is to identify main types of large and the largest cities of Russia taking into account particularities of their structure and economic performance. The objects of the research are Russian cities – administrative centers of regions and autonomous districts of Russian Federation, as well as other Russian cities with population ...
Added: June 10, 2016
Koltsov S., Ignatenko V., Boukhers Z. et al., Entropy 2020 Vol. 22 No. 4 P. 1-13
Topic modeling is a popular technique for clustering large collections of text documents. A variety of different types of regularization is implemented in topic modeling. In this paper, we propose a novel approach for analyzing the influence of different regularization types on results of topic modeling. Based on Renyi entropy, this approach is inspired by ...
Added: April 1, 2020
Mavrin A., Filchenkov A., Koltsov S., , in : Artificial Intelligence and Natural Language, 7th International Conference, AINL 2018, St. Petersburg, Russia, October 17–19, 2018, Proceedings. Issue 930.: Switzerland : Springer, 2018. P. 117-129.
Added: November 30, 2018
Matkin N. A., Коммуникации. Медиа. Дизайн 2024
The article offers an analysis and visualization of Russian city images that emerge in the comments of urban community subscribers and posts from administrative press services. The city image is regarded as a frame structure that develops through political and interpersonal communication in the network. The social component of the city image is identified as ...
Added: November 15, 2023
Shakina E., Molodchik M., Parshakov P., Russian Management Journal 2020 Vol. 18 No. 3 P. 433-456
The study offers a structural literature review on the twenty years the evolution of the fast-growing research topic of intellectual capital (IC) and intangible-driven performance. Despite a rather short independent history, the IC concept has undergone a substantial transformation, bringing to the discussion vast empirical and methodological literature. Several endeavors carrying out literature review studies ...
Added: January 13, 2021
Koltsova O., Pashakhin S., Media, War and Conflict 2020 Vol. 13 No. 3 P. 237-257
Although conflict representation in media has been widely studied, few attempts have been made to perform large-scale comparisons of agendas in the media of conflicting parties, especially for armed country-level confrontations. In this paper, we introduce quantitative evidence of agenda divergence between the media of conflicting parties in the course of the Ukrainian crisis 2013–2014. ...
Added: December 4, 2017
Koltsov S., Nikolenko S. I., Koltsova O. et al., , in : WebSci 2016 - Proceedings of the 2016 ACM Web Science Conference. : Elsevier, 2016. P. 342-343.
Topic modeling is a powerful tool for analyzing large collections of user-generated web content, but it still suffers from problems with topic stability, which are especially important for social sciences. We evaluate stability for differenttopic models and propose a new model, granulated LDA,that samples short sequences of neighboring words at once. We show that gLDA ...
Added: October 24, 2016
Ignatenko V., Koltsov S., Staab S. et al., Physica A: Statistical Mechanics and its Applications 2019
Topic modeling is a popular approach for clustering text documents. A variety of different types of regularization is implemented in topic modeling. In this paper, we propose a novel approach for analyzing the influence of different regularization types on results of topic modeling. Based on Renyi entropy, this approach is inspired by the concepts from ...
Added: October 31, 2019
Nagornyy O. S., Мухетдинова А. Т., В кн. : Математическое и компьютерное моделирование [Электронный ресурс]: материалы IV Международной научной конференции (Омск, 11 ноября 2016 г.). : Омск : Издательство Омского государственного университета, 2016. С. 154-156.
В данной работе на материалах раздела о здоровом образе жизни блога lifehacker.ru при помощи тематического моделирования и синтаксического анализа текстов исследуется, как дискурс о биопедагогике проявляет себя в Интернете, какие лингвистические средства для этого используются и какие темы затрагиваются. ...
Added: November 25, 2016
Koltsov S., , in : Applied Informatics. Vol. 1: Communications in Computer and Information Science.: Springer Publishing Company, 2022.
This work demonstrates the possibility of applying the duality properties of a statistical collection of texts to determine the optimal number of topics/clusters. In a series of numerical experiments on text
data, it was demonstrated that Renyi entropy of topic models, expressed in Sq form (based on the escort distribution), as a function of the number ...
Added: October 28, 2022