Topic Modelling with NMF vs. Expert Topic Annotation: the Case Study of Russian Fiction

T. Sherstinova; Митрофанова О. А.; Скребцова Т. Г.; Замирайлова Е.; M. Kirina

doi:10.1007/978-3-030-60887-3_13

Publications

?

Topic Modelling with NMF vs. Expert Topic Annotation: the Case Study of Russian Fiction

P. 134–151.

Sherstinova T., Митрофанова О. А., Скребцова Т. Г., Замирайлова Е., Kirina M.

The paper presents an experiment aimed at comparison of results of topic modelling via non-negative matrix factorization (NMF) with that of manual topic annotation performed by an expert. The experiment was conducted on the annotated corpus of Russian short stories of the initial three decades of the 20th century, which contains 310 stories with a total of 1000000 tokens written by 300 Russian writers. The annotation scheme used in topic annotation includes 89 topics, further this list was reduced down to 30 generalized ones, the most frequent of which turned out to be the following: death, relationships, love, social groups, social processes, family, money, human sins, nature, religion, and war. Then, the corpus divided into three consecutive time periods was subjected to NMF topic modelling which provided a model including 24 topics. The results of both topic annotations were compared and described. The paper discusses the main findings of the study and the difficulties of fiction topic modelling which should be taken into account. For example, experimental results showed that topic modelling via NMF should be primarily recommended for the revealing of topics referring to general background of literary texts (e.g., war, love, nature, family) rather than for detecting topics related with some critical events or relations between characters (e.g., death or relations). The comparison of human and automatic topic annotation seems an important step for the improvement of artificial technologies techniques related with NLP.

Language: English

DOI

Keywords: russian literature topic modelling

In book

Advances in Computational Intelligence: 19th Mexican International Conference on Artificial Intelligence, MICAI 2020. Part II

Vol. 12469. , Springer, 2020.

Digital Humanities and Literary Realism

Skorinkin D., Orekhov B., , in: The Oxford Handbook of Global Realisms.: Oxford: Oxford University Press, 2025. Ch. 10 P. 177–204.

This chapter investigates literary prose of the realist era in Russia using digital humanities methods. It focuses on how computational analysis can enhance an understanding of descriptions of literary characters, geographical locations, and lexical composition in literary texts. Using a corpus of more than five hundred texts (forty-six million word occurrences), it eschews the focus ...

Added: September 14, 2025

Динамика языковых и культурных процессов в современной России. Выпуск 8. Материалы VIII Конгресса РОПРЯЛ (г. Красноярск, 10–14 сентября 2024 года)

РОПРЯЛ, 2024.

The book includes the texts of reports and scientific presentations of the participants of the VIII Congress of ROPRYAL (Krasnoyarsk, September 10-14, 2024), devoted to topical aspects of the study of Russian language and literature. Special attention is paid to new trends in the description of the Russian language, to the issues of interaction between ...

Added: January 14, 2025

The Space of Coordination: Accounting for Multiple Expert Knowledges in Environmental Communication

Antonyuk A., Vera.N.Minina, Pivovarov A. et al., Environmental Communication: A Journal of Nature and Culture 2024 No. 26 Nov P. 1–18

Activists, private companies, and nonprofits increasingly address environmental issues along with scientific and governmental bodies, each bringing valuable experience and original perspectives. However, the growing diversity of expert knowledges in environmental communication may complicate policy development and implementation. To help address this issue, we propose an account of environmental communication as a dynamic space involving ...

Added: January 14, 2025

Литература и публичная сфера: (От составителей)

Poselyagin N., Вайзер Т. В., Slavic Literatures (ранее - Russian Literature) 2024 Т. 144-145 С. 1–16

In this introductory article to the thematic cluster Literature and the Public Sphere, we briefly outline the adapting of a concept “public sphere” offered by Jürgen Habermas to the humanities and social sciences. It has been adapted in the early 1990s because of heated international discussions, debates, and a partial shift away from Habermas’s peculiar ...

Added: June 20, 2024

Россия Владимира Кантора, или Судьба в борьбе с настоящим и будущим. Рецензия на книгу: Кантор В. К. (2023). Россия как судьба. М.: Центр гуманитарных инициатив. — 524 с. ISBN 978-5-98712-932-6

Девликамов Р. Т., Социологическое обозрение 2024 Т. 23 № 1 С. 382–389

The proposed article presents reflections on a new book by Professor V. K. Kantor "Russia as Destiny", dedicated to research in the field of philosophy of Russian culture. In the era of the prevailing value relativism and the "duality" of thought, the author sets himself a grand task – to preserve the real meaning of ...

Added: March 31, 2024

Natural School and Realism

Vdovin A., , in: The New Cambridge History of Russian Literature.: Cambridge: Cambridge University Press, 2024. P. 89–106.

This chapter exolains the role of realism in the Russian Literature ...

Added: February 1, 2024

The New Cambridge History of Russian Literature

Cambridge: Cambridge University Press, 2024.

The new history of Russian and Russophone literature produced by international group of scholars ...

Added: February 1, 2024

От составителей. Об этом сборнике

Chaban A., Kharitonov D., Bazhenova-Sorokina A., В кн.: Творческое письмо в России: сюжеты, подходы, проблемы.: М.: Новое литературное обозрение, 2023. С. 27–33.

The section “From the Compilers” gives a description of the collection and describes the articles included here, and outlines the most promising directions for studying creative writing in Russia. ...

Added: December 25, 2023

Творческое письмо в России: сюжеты, подходы, проблемы

М.: Новое литературное обозрение, 2023.

This collection of articles introduces a new scientific field into the Russian field of humanities board - creative writing studies, in the Russian version - “research literary education". The authors touch upon in their articles a wide range of problems and topics that can be considered part of creative writing studies and which help to ...

Added: December 24, 2023

Текст и традиция: альманах, 11

СПб.: Издательство "Росток", 2023.

Альманах «Текст и традиция» издается Пушкинским Домом и Ясной Поляной, двумя известнейшими «литературными домами» России. Одной из важных его задач является рассмотрение современной русской литературы в контексте литературной традиции — классической и древней. В определенном смысле альманах соединяет в себе черты научного и литературного («толстого») журналов: в соответствующих разделах публикуются исследования академического типа и литературные эссе. Особое место в издании ...

Added: November 1, 2023

Черноморский текст русской литературы: геокультуры, сопространственность и географическое воображение

Zamyatin D., Географическая среда и живые системы 2023 № 2 С. 47–57

Aim. The purpose of the paper is to develop the concept of geocultural texts on the example of the Black Sea text of Russian literature. Methodology. Various concepts of local texts based on the research of Russian literature are analyzed. Problems are identified from the point of view of historical and cultural geography. The research relies on the use ...

Added: October 2, 2023

Оидзуми Кокусэки: литература и жизнь

Ёмота И., В кн.: История и культура Японии. Вып. 15.: М.: Издательский дом НИУ ВШЭ, 2023. С. 434–443.

Ōizumi Kokuseki 􀕶􀲅􅖎􀵆 (1893–1957) is a totally forgotten writer now, but a century ago he was a best selling author. He was half Russian and half Japanese, who at fi rst went to school in Moscow and then entered lyceum in Paris. Lately he continued his education in schools of Kyoto and Tokyo. He wrote novels, autobiographies and ...

Added: May 19, 2023

Датасет "Бытование литературных текстов в ГУЛАГе"

Lugovskaya D., Uspensky P., Vdovin A. et al., В кн.: Репозиторий открытых данных по русской литературе и фольклору.: СПб.: Институт русской литературы (Пушкинский Дом) РАН, 2023.

This dataset presents a list of literary texts circulating in the Soviet penitentiary system between 1917 and 1991. The data are extracted from texts of memoirs published between 1928 and 2016. The database includes poetic and prosaic texts, as well as the authors mentioned, taking into account the situation of the texts' recitation, the geography ...

Added: February 17, 2023

Русская литература в современной школе: проблемы, практики, перспективы (аналитический обзор)

Скулачев А. А., Российские исследования 2022 Т. 3 № 1-2 С. 32–45

В рамках статьи предлагается аналитический обзор проблем, методологических установок и перспектив современного преподавания русской литературы в школе. Особое внимание уделяется как методологической рефлексии в широком гуманитарном контексте, так и описанию конкретных практических решений. Отдельно описываются перспективы работы с цифровой средой, с учащимися, требующими индивидуального подхода. На основании обзора перспективных практик сделаны выводы о возможных направлениях ...

Added: February 9, 2023

Преступный сюжет в русской литературе

Markuntsov S. A., Право. Журнал Высшей школы экономики 2022 № 4 С. 236–242

The review analyzes the content of the monograph "Criminal plot in the Russian literature" (M.: Prospect, 2021. 640 p.), the author of which is an outstanding scientist in the field of criminal law - A.V. Naumov. In the reviewed work the scientist makes an attempt to explain the phenomenon of "criminal" through the disclosure of ...

Added: December 11, 2022

Русский рассказ 1900-1930 и его восприятие читателем: опыт квантитативного анализа оценки художественного текста

Sherstinova T., Колпащикова Е. О., Сейнова А. Р. et al., Человек: образ и сущность. Гуманитарные аспекты 2023 № 2(54) С. 164–184

The article is devoted to modern readers’ perception of literary texts written about 100 years ago. A selection of 210 short stories written and published in the first 3 decades of the 20th century by various authors, including both well-known and lesser-known writers, has been read and rated by three independent readers in terms of how much they liked ...

Added: December 10, 2022

Мифы и мотивы Московского текста в русской лингвокультуре XXI века

Белова П.Е., Журнал филологических исследований 2022 Т. 7 № 3 С. 87–91

The article is devoted to the study of the Moscow text and the peculiarities of its mythotectonics on the material of literary works and everyday texts of the XXIst century about Moscow, containing linguistic and cultural ideas about the Russian capital. The article names the myths and motifs contained in the Moscow text of Russian ...

Added: November 15, 2022

Статьи о французской литературе: к 100-летию Л. Г. Андреева

М.: Литфакт, 2022.

The collection of essays is devoted to the eminent connaisseur of French literature Leonid Andreev ...

Added: October 31, 2022

И всё же, почему Россия вызывает у нас ностальгию?

Ёмота И., Историческая экспертиза 2022 № 1 С. 11–25

The current international socio-political situation in connection with Russia sets up Japanese intellectuals and Japanese society as a whole for a memoirreflexive, nostalgic tone. What is Russia and the Soviet Union in the eyes of the Japanese? Domestic history of culture in the broadest sense — classical literature, classical music, science, cinema, space, cuisine — all this is subjected to ...

Added: October 25, 2022

Пространство и время в русской философии и культуре: сборник трудов молодых ученых

М.: МАКС Пресс, 2021.

The collection includes articles prepared by young scientists, students and undergraduates, who participated in the conferences “Space and Time in Russian Literature and Philosophy” in 2018–2020. These conferences were regularly organized within the framework of the Project “Russian Literature and Philosophy: Ways of interaction” (Russian Science Foundation, project No. 17-18-01432-П) by the A.M. Gorky Institute of World Literature of the ...

Added: September 10, 2022

Писатель - критика - читатель (Механизмы формирования литературной репутации в России во второй половине XIX - первой третий ХХ в.)

СПб.: Издательство "Росток", 2022.

В главах книги акцент сделан на изучении института литературной критики и его роли в формировании репутации автора. В отдельных параграфах монографии анализируются следующие актуальные вопросы: дебют писателя: триумфы и поражения; литературные премии как факт общественного признания; сенсации, скандалы, мистификации как инструменты достижения популярности; литературный эпатаж и эксплуатация образа писателя-декадента; роль информационного пространства начала XX века ...

Added: January 28, 2022