?
Mining for Opinions Across Domains: A Cross-Language Study
P. 67–74.
Kravchenko A.
An important task in opinion mining is detecting subjective expressions in texts and distinguishing them from factual information. High lexicon diversity between different domains excludes the possibility of formulating universal rules that would work for any area of knowledge.
In this article we suggest a solution for this problem. We define the features that most opinionated sentences share and propose a cross-language classification of subjective expressions, illustrated by examples in Russian, English and Chinese. We also propose an algorithm based on this classification that generates a set of extraction patterns for any domain from a corpus of untagged texts. The corpus requires no additional preparation except for POS-tagging.
The effectiveness of the proposed approach is evaluated for English and Russian on collections of approximately 300 000 sentences each, gathered from three different domains: user reviews on movies, headphones and photo cameras.
Language:
English
Ankudinov I., Социология: методология, методы, математическое моделирование 2025 № 61 С. 165–203
The changing political mood of Russians is a constant subject of interest for sociological agencies. With the development of the Internet, conventional questionnaire research began to be supplemented by online surveys and, despite some skepticism, by social media mining. This article attempts to adjust an accidental web-sample so as to bring its estimates closer to ...
Added: April 22, 2026
Bocharova A., Денисов И. Е., Зуенко И. Ю., Вестник Санкт-Петербургского университета. Востоковедение и африканистика 2025 Т. 17 № 2 С. 366–377
The article focuses on analyzing the perceptions of recent changes in China’s demographic pol-icy by the contemporary Chinese society. These changes involved the relaxation of restrictions on the number of children in a family, first to two children in 2015 and subsequently to three children in 2021. The relevance of this research stems from the ...
Added: February 19, 2026
Balakina Y. V., Yin Z., Известия Саратовского университета. Новая серия. Серия: Филология. Журналистика 2025 Vol. 25 No. 2 P. 229–236
The aim of the present study is to trace changes in the construction of the image of China in the British media during two crisis periods: the COVID-19 pandemic and the Russian military operation. Each period encompasses a panic (escalation) phase and a recovery (stagnation) phase. Using data from the Factiva database, 70,356 articles published ...
Added: January 20, 2026
Smolyanina E., Morozova I., Харитонова Н. В., Географический вестник 2025 № 4 (75) С. 162–177
The unique tourist experience is one of the main components of tourism activity. However, it is not studied in Russian and Western science. This determined the purpose of the study, that is to identify the characteristics of unique American tourists’ experiences in online reviews about the Lincoln Memorial on the travel site TripAdvisor in the ...
Added: January 7, 2026
Makeeva N., Наволоцкая П. А., Iskyandyarov R. et al., Вопросы экономики 2026 № 6 С. 135–154
This paper analyzes the informational value and predictive power of retail investor sentiment, formed by news flows and social media publications, for the dynamics of the Russian stock market. The aim of the study is to assess the informational value and predictive power of sentiment indicators, calculated using the FinBERT model, for key stock market ...
Added: December 2, 2025
Shulginov V., Жанры речи 2025 Т. 20 № 3(47) С. 327–336
The article examines the imperative internet comment as a special genre of conflict internet discourse.
The research was based on the study of two communities of the social network “VKontakte”, differing in the
structure of social connections: vertical type (official community “VKontakte with authors”) and horizontal
type (“Showbiz stars news”). Using automatic methods of data collection and analysis, ...
Added: October 12, 2025
Sharikov A., , in: Internet in the Post-Soviet Area: Technological, Economic and Political Aspects.: Cham: Springer, 2023. Ch. 1 P. 7–46.
The chapter contains results of a study of the representation of 19 post-Soviet countries and territories on the global Internet in 2020–2021. It was carried out with the help of the FACTIVA monitoring database (information texts, over 23000 online resources from more than 100 countries in 26 languages). It turned out that in 2020–2021 only ...
Added: February 6, 2025
Sharikov A., Вестник Российского университета дружбы народов. Серия: Литературоведение, журналистика 2024 Т. 29 № 3 С. 534–550
The article examines peculiarities of representation of russia in British online sources in 2022, when russia launched a special military operation in ukraine. the author used a statistical approach to analysis based on the factiva monitoring system, the database of which contains about 4.5 million texts published on 416 British online resources from January 1 ...
Added: February 5, 2025
Sharikov A., Потапова В. В., Вестник Академии медиаиндустрии 2023 Т. 34 № 2 С. 48–64
The article presents the results of a study conducted at the Higher School of Economics (HSE) on the corpus of texts of the monitoring system Factiva, published in 2020. The purpose of the study is to identify the quantitative relationship between positive and negative tone publications on Russian-language online resources in comparison with publications of ...
Added: February 5, 2025
Anna Moskvina, Margarita Kirina, , in: 27th International Conference, IMS 2024, St. Petersburg, Russia, June 24–26, 2024, Selected Papers. Internet and Modern Society. Human-Computer Communication. CCIS, volume 2534Vol. 2534.: Springer, 2025. P. 113–129.
The paper presents an investigation of the emotional aspect of the Russian short story of the 20th century. Our study is two-fold: firstly, we delve into emotional representation at the lexical level, building upon previous work on utilizing vector models to quantify emotional content. In this study, we introduce an annotated corpus where words are ...
Added: November 29, 2024
Chistikov M., Полис. Политические исследования 2024 № 4 С. 38–55
The relations between Russia and Norway are of a contradictory nature, containing both positive and negative elements. In the scientific literature, the systemic factors affecting Russian-Norwegian relations are well studied, while the internal political reasons for the transformation of bilateral relations are insufficiently explored. In 2013, i.e. before the international political crisis of 2014, a ...
Added: August 2, 2024
Smolyanina E., Morozova I., Kharitonova N., Географический вестник 2024 No. 2(69) P. 150–164
Detailed experiences of travelers are presented in online tourist reviews that affect the way other tourists perceive and plan their trips. Such reviews are sources of information in the form of open writing that allows reliable sharing of experience about tourist attractions. Previous studies have made use of tourist reviews to obtain lists of the ...
Added: July 18, 2024
Bosonogov S., Suvorova A., Journal of Mathematical Sciences 2024 Vol. 285 P. 1–13
In this work we analyze comments on three subreddits related to AI-generated art to understand how people perceive the ability of AI to create art and the topics and moods of discussions in the context of widespread usage of pre-trained models. We used computational text analysis techniques such as LDA topic modeling and sentiment analysis ...
Added: February 4, 2024
Kolmogorova A., Зарембо В. С., Ткачева Е. С. et al., В кн.: Лингвистическая семантика в пространственном измерении: Словарь. Дискурс. Корпус.: Екатеринбург: Кабинетный ученый, 2024. Гл. 10 С. 423–445.
The purpose of this study is to describe the characteristics of the text of a popular Soviet song as a linguo-ideological phenomenon. The corpus of Soviet songs collected by the research group is used as material. The focus of this publication is on two characteristics: changes in the emotional tonality of popular songs released on ...
Added: December 10, 2023
Kolmogorova A., Колмогорова П. А., Куликова Е. Р., Вестник Томского государственного университета. Филология 2024 № 89 С. 73–103
In this article, we focus on the analysis of the texts of three history textbooks for university students published at different times: in 1946, in 1983 and in 2006. As a material, we use texts devoted in each of the textbooks to seven historical topics since the beginnings of Kiev principality till the Reforms of ...
Added: December 10, 2023
Tatiana Sherstinova, Anna Moskvina, Margarita Kirina et al., , in: Literature, Language and Computing: Russian Contribution from the LiLaC-2023.: Springer, 2025. P. 23–35.
Added: December 9, 2023
Where Is Happily Ever After? A Study of Emotions and Locations in Russian Short Stories of 1900–1930
Moskvina A., Kirina M., , in: Digital Geography: Proceedings of the International Conference on Internet and Modern Society (IMS 2023).: Springer, 2023. P. 123–135.
The paper tackles the problem of the automatic detection of emotions in literary texts using distributional semantics techniques. The experiment was carried out on the material of Russian short stories from the 1900-1930s. We investigated the emotional lexis distribution across different locations in narratives. At first, we calculated the semantic association score between each word ...
Added: December 9, 2023
Sherstinova T., Moskvina A., Kirina M. et al., В кн.: Труды международной конференции «Корпусная лингвистика — 2023».: СПб.: Издательство Санкт-Петербургского государственного университета, 2024. С. 232–240.
In the experimental study, the results of three different approaches to the evaluation of the tonality of literary texts are compared: dictionary-based, machine learning, and distributional semantics. The material for analysis was a selection of 210 stories by Russian writers from the first three decades of the 20th century. The research showed that the correlation ...
Added: December 9, 2023