NRU-HSE at SemEval-2016 Task 4: The open quantification library with two iterative methods

N. Karpov; A. Porshnev; Rudakov Kirill

?

NRU-HSE at SemEval-2016 Task 4: The open quantification library with two iterative methods

P. 171–177.

Karpov N., Porshnev A., Rudakov Kirill

In many areas, such as social science, politics or market research, people need to track sentiment and their changes over time. For sentiment analysis in this field it is more important to correctly estimate proportions of each sentiment expressed in the set of documents (quantification task) than to accurately estimate sentiment of a particular document (classification). Basically, our study was aimed to analyze the effectiveness of two iterative quantification techniques and to compare their effectiveness with baseline methods. All the techniques are evaluated using a set of synthesized data and the SemEval-2016 Task4 dataset. We made the quantification methods from this paper available as a Python open source library. The results of comparison and possible limitations of the quantification techniques are discussed.

Language: English

Full text

Text on another site

Keywords: sentiment analysis quantification comparison iterative methods performance evaluation

In book

The 10th International Workshop on Semantic Evaluation

San Diego: Association for Computational Linguistics, 2016.

Changes in the UK leading media's portrayal of China during the Covid-19 pandemic and the special military operation

Balakina Y. V., Yin Z., Известия Саратовского университета. Новая серия. Серия: Филология. Журналистика 2025 Vol. 25 No. 2 P. 229–236

The aim of the present study is to trace changes in the construction of the image of China in the British media during two crisis periods: the COVID-19 pandemic and the Russian military operation. Each period encompasses a panic (escalation) phase and a recovery (stagnation) phase. Using data from the Factiva database, 70,356 articles published ...

Added: January 20, 2026

Сопоставительный анализ уникальных впечатлений американских туристов о мемориале Линкольну в доковидный и постковидный периоды

Smolyanina E., Morozova I., Харитонова Н. В., Географический вестник 2025 № 4 (75) С. 162–177

The unique tourist experience is one of the main components of tourism activity. However, it is not studied in Russian and Western science. This determined the purpose of the study, that is to identify the characteristics of unique American tourists’ experiences in online reviews about the Lincoln Memorial on the travel site TripAdvisor in the ...

Added: January 7, 2026

Negation as a modality in a quantified setting

Speranski S. O., Journal of Logic and Computation 2021 Vol. 31 No. 5 P. 1330–1355

The idea of treating negation as a modality manifests itself in various logical systems, especially in Došen's propositional logic N, whose negation is weaker than that of Johansson's minimal logic. Among the interesting extensions of N are the propositional logics N* and Hype; the former was proposed in [Cabalar et al. 2006], while the latter has ...

Added: December 26, 2025

Sharpening complexity results in quantified probability logic

Speranski S. O., Logic Journal of the IGPL 2025 Vol. 33 No. 3 Article jzae114

We shall be concerned with two natural expansions of the quantifier-free ‘polynomial’ probability logic of [Fagin et al. 1990]. One of these, denoted by QPL-e, is obtained by adding quantifiers over arbitrary events, and the other, denoted by p-QPL-e, uses quantifiers over propositional formulas — or equivalently, over events expressible by such formulas. The earlier proofs ...

Added: December 26, 2025

Императивный интернет-комментарий как особый жанр конфликтной интернет-коммуникации

Shulginov V., Жанры речи 2025 Т. 20 № 3(47) С. 327–336

The article examines the imperative internet comment as a special genre of conflict internet discourse. The research was based on the study of two communities of the social network “VKontakte”, differing in the structure of social connections: vertical type (official community “VKontakte with authors”) and horizontal type (“Showbiz stars news”). Using automatic methods of data collection and analysis, ...

Added: October 12, 2025

Representation of the Post-Soviet Countries in the Global Online Information Space in 2020–2021: Frequency of Mention, Media Dynamics, Mood Characteristics

Sharikov A., , in: Internet in the Post-Soviet Area: Technological, Economic and Political Aspects.: Cham: Springer, 2023. Ch. 1 P. 7–46.

The chapter contains results of a study of the representation of 19 post-Soviet countries and territories on the global Internet in 2020–2021. It was carried out with the help of the FACTIVA monitoring database (information texts, over 23000 online resources from more than 100 countries in 26 languages). It turned out that in 2020–2021 only ...

Added: February 6, 2025

Представленность России в британских онлайн-источниках в 2022 г.

Sharikov A., Вестник Российского университета дружбы народов. Серия: Литературоведение, журналистика 2024 Т. 29 № 3 С. 534–550

The article examines peculiarities of representation of russia in British online sources in 2022, when russia launched a special military operation in ukraine. the author used a statistical approach to analysis based on the factiva monitoring system, the database of which contains about 4.5 million texts published on 416 British online resources from January 1 ...

Added: February 5, 2025

О соотношении сообщений позитивной и негативной тональности на русскоязычных информационных онлайн-ресурсах

Sharikov A., Потапова В. В., Вестник Академии медиаиндустрии 2023 Т. 34 № 2 С. 48–64

The article presents the results of a study conducted at the Higher School of Economics (HSE) on the corpus of texts of the monitoring system Factiva, published in 2020. The purpose of the study is to identify the quantitative relationship between positive and negative tone publications on Russian-language online resources in comparison with publications of ...

Added: February 5, 2025

Fear and Loathing in Russian Literature: A Case of Emotion Annotation of Short Stories of the 20th Century

Anna Moskvina, Margarita Kirina, , in: 27th International Conference, IMS 2024, St. Petersburg, Russia, June 24–26, 2024, Selected Papers. Internet and Modern Society. Human-Computer Communication. CCIS, volume 2534Vol. 2534.: Springer, 2025. P. 113–129.

The paper presents an investigation of the emotional aspect of the Russian short story of the 20th century. Our study is two-fold: firstly, we delve into emotional representation at the lexical level, building upon previous work on utilizing vector models to quantify emotional content. In this study, we introduce an annotated corpus where words are ...

Added: November 29, 2024

Партийно-политическая динамика в Норвегии как фактор российско-норвежских отношений

Chistikov M., Полис. Политические исследования 2024 № 4 С. 38–55

The relations between Russia and Norway are of a contradictory nature, containing both positive and negative elements. In the scientific literature, the systemic factors affecting Russian-Norwegian relations are well studied, while the internal political reasons for the transformation of bilateral relations are insufficiently explored. In 2013, i.e. before the international political crisis of 2014, a ...

Added: August 2, 2024

Identifying American tourists’ unique experiences from the Lincoln Memorial

Smolyanina E., Morozova I., Kharitonova N., Географический вестник 2024 No. 2(69) P. 150–164

Detailed experiences of travelers are presented in online tourist reviews that affect the way other tourists perceive and plan their trips. Such reviews are sources of information in the form of open writing that allows reliable sharing of experience about tourist attractions. Previous studies have made use of tourist reviews to obtain lists of the ...

Added: July 18, 2024

Two challenges for existentialist approaches to strict negative concord

Rudnev P., TABU: Bulletin voor Taalwetenschap 2024 P. 312–328

I present two challenges for the popular approach to the meaning of negative concord items, or neg-words, as existential quantifiers or indefinites. The first challenge concerns the interaction of that analysis with the approaches to fragment answers as instances of clausal ellipsis. The second challenge stems from the ability of multiple neg-words within one clause ...

Added: April 19, 2024

Perception of AI-generated art: text analysis of online discussions

Bosonogov S., Suvorova A., Journal of Mathematical Sciences 2023 Vol. 529 P. 6–23

In this work we analyze comments on three subreddits related to AI-generated art to understand how people perceive the ability of AI to create art and the topics and moods of discussions in the context of widespread usage of pre-trained models. We used computational text analysis techniques such as LDA topic modeling and sentiment analysis ...

Added: February 4, 2024

Ähnlichkeit in Lyrik und Poetik der Gegenwart

Warsz., Brux., Oxford, Wien, NY, Bern: Peter Lang, 2023.

Similarity makes it possible to recognize sameness in deviation. In terms of the history of ideas, thinking in similarities goes back to ancient times and has recently become more relevant, especially in cultural studies research. The volume focuses on the question in which ways similarity thinking structures contemporary poetry and poetics and manifests itself in ...

Added: January 11, 2024

Исследовательский потенциал корпуса советских песен: эмоциональная тональность и география песенных текстов через призму компьютерных технологий

Kolmogorova A., Зарембо В. С., Ткачева Е. С. et al., В кн.: Лингвистическая семантика в пространственном измерении: Словарь. Дискурс. Корпус.: Екатеринбург: Кабинетный ученый, 2024. Гл. 10 С. 423–445.

The purpose of this study is to describe the characteristics of the text of a popular Soviet song as a linguo-ideological phenomenon. The corpus of Soviet songs collected by the research group is used as material. The focus of this publication is on two characteristics: changes in the emotional tonality of popular songs released on ...

Added: December 10, 2023

О прошлом, но в разное время: компьютерный анализ текстов учебников по истории СССР/России для шести поколений студентов

Kolmogorova A., Колмогорова П. А., Куликова Е. Р., Вестник Томского государственного университета. Филология 2024 № 89 С. 73–103

In this article, we focus on the analysis of the texts of three history textbooks for university students published at different times: in 1946, in 1983 and in 2006. As a material, we use texts devoted in each of the textbooks to seven historical topics since the beginnings of Kiev principality till the Reforms of ...

Added: December 10, 2023

Sentiment Analysis of Literary Texts: A Study of Theme and Readers' Preferences in Russian Short Stories from 1900-1930s

Tatiana Sherstinova, Anna Moskvina, Margarita Kirina et al., , in: Literature, Language and Computing: Russian Contribution from the LiLaC-2023.: Springer, 2025. P. 23–35.

Added: December 9, 2023

Where Is Happily Ever After? A Study of Emotions and Locations in Russian Short Stories of 1900–1930

Moskvina A., Kirina M., , in: Digital Geography: Proceedings of the International Conference on Internet and Modern Society (IMS 2023).: Springer, 2023. P. 123–135.

The paper tackles the problem of the automatic detection of emotions in literary texts using distributional semantics techniques. The experiment was carried out on the material of Russian short stories from the 1900-1930s. We investigated the emotional lexis distribution across different locations in narratives. At first, we calculated the semantic association score between each word ...

Added: December 9, 2023

Несчастливы по-своему: как измерить тональность литературного текста?

Sherstinova T., Moskvina A., Kirina M. et al., В кн.: Труды международной конференции «Корпусная лингвистика — 2023».: СПб.: Издательство Санкт-Петербургского государственного университета, 2024. С. 232–240.

In the experimental study, the results of three different approaches to the evaluation of the tonality of literary texts are compared: dictionary-based, machine learning, and distributional semantics. The material for analysis was a selection of 210 stories by Russian writers from the first three decades of the 20th century. The research showed that the correlation ...

Added: December 9, 2023