Mining for Opinions Across Domains: A Cross-Language Study

A. Kravchenko

?

Mining for Opinions Across Domains: A Cross-Language Study

P. 67–74.

Kravchenko A.

An important task in opinion mining is detecting subjective expressions in texts and distinguishing them from factual information. High lexicon diversity between different domains excludes the possibility of formulating universal rules that would work for any area of knowledge. In this article we suggest a solution for this problem. We define the features that most opinionated sentences share and propose a cross-language classification of subjective expressions, illustrated by examples in Russian, English and Chinese. We also propose an algorithm based on this classification that generates a set of extraction patterns for any domain from a corpus of untagged texts. The corpus requires no additional preparation except for POS-tagging. The effectiveness of the proposed approach is evaluated for English and Russian on collections of approximately 300 000 sentences each, gathered from three different domains: user reviews on movies, headphones and photo cameras.

Language: English

Keywords: sentiment analysis opinion mining cross-language information retrieval

In book

Proceedings of the First International Workshop on Sentiment Discovery from Affective Data (SDAD 2012), Leuven, Belgium, 2012

Leuven: [б.и.], 2012.

Social media-based opinion retrieval for product analysis using multi-task deep neural networks

Gozuacik N., Sakar C. O., Ozcan S., Expert Systems with Applications 2021 Vol. 183 No. 30 November 2021 P. 1–13

Social media platforms are considered one of the most effective intermediaries for companies to interact with consumers. Social media-based decision support systems for the marketing domain are highly developed, but product development and innovation-oriented studies remain limited. This study offers a novel approach which utilises opinion retrieval theme along with sentiment analysis to support the ...

Added: December 12, 2021

Opinion Mining for Modeling User Experience of Online Education: Sentiment Analysis and Keywords Extraction of Student Reviews

Moskvina A., Kirina M., Anastasia Gavrilyuk, , in: 2022 32nd Conference of Open Innovations Association (FRUCT). IEEE, 2022. P. 187–195.

The paper discusses the possibilities of applying modern natural language processing technologies of opinion mining to investigate and improve the user experience of online-courses students. We analyzed 27 000 student reviews of projects within the Python programming language course. First, we applied keyword extraction algorithms as a way of semantic compression to receive a generalized ...

Added: December 9, 2022

Язык и искусственный интеллект: Сборник статей по итогам конференции «Лингвистический форум 2020: Язык и искусственный интеллект»

Издательский дом ЯСК, 2023.

The collection presents articles by participants of the "Linguistic Forum 2020: Language and Artificial Intelligence" (conference, November 2020, RAS), reflecting general and specific problems of scientific research in the field of linguistics and computer technologies. The authors of the published articles offer solutions to special issues against the background of larger-scale objects of scientific heuristics ...

Added: October 31, 2023

Партийно-политическая динамика в Норвегии как фактор российско-норвежских отношений

Чистиков М. Н., Полис. Политические исследования 2024 № 4 С. 38–55

The relations between Russia and Norway are of a contradictory nature, containing both positive and negative elements. In the scientific literature, the systemic factors affecting Russian-Norwegian relations are well studied, while the internal political reasons for the transformation of bilateral relations are insufficiently explored. In 2013, i.e. before the international political crisis of 2014, a ...

Added: August 2, 2024

Identifying American tourists’ unique experiences from the Lincoln Memorial

Smolyanina E., Morozova I., Kharitonova N., Географический вестник 2024 No. 2(69) P. 150–164

Detailed experiences of travelers are presented in online tourist reviews that affect the way other tourists perceive and plan their trips. Such reviews are sources of information in the form of open writing that allows reliable sharing of experience about tourist attractions. Previous studies have made use of tourist reviews to obtain lists of the ...

Added: July 18, 2024

Think about what you’ve learned: аспектный анализ тональности для моделирования пользовательского опыта в сфере онлайн-образования

Kirina M., Человек: образ и сущность. Гуманитарные аспекты 2023 № 2(58) С. 176–204

The article focuses on the application of opinion mining techniques to evaluate user experience on the Hyperskill educational platform, using Python, Java, and Kotlin programming projects as the basis of analysis. The study utilizes sentiment analysis and keyword extraction methods to gauge users' attitudes towards the platform, learning process, and topics covered. To achieve this, ...

Added: December 9, 2023

Количественный сравнительный анализ сентимента прямой речи и нарратива на материалах Корпуса русского рассказа

Сейнова А. Р., Социо- и психолингвистические исследования 2023

The article is devoted to measuring the emotional component of Russian small prose of the early XX century. On the basis of a random sample of 70 stories written in 1900-1930, with the help of the Dostoevsky mood analysis library, the manifestation of sentiment in the literary text was analyzed. The stories were divided into ...

Added: December 10, 2023

Attitude of Russians to the topic of material well-being: analysis of comments in social media

Fabrykant M., Magun V., Милкова М. А., / Series SocArXiv "SocArXiv". 2023.

The current study is a content analysis of news comments posted on the Russian social network VKontakte to investigate the expression of opinions on material well being. Based on NLP methods, we analyze the main groups of expression, which are considered in the context of formulating evaluations and appealing to social norms. We analyze the content of discourse at the micro level; analyze how attitudes towards the topic ...

Added: December 4, 2023

К вопросу об использовании эмоциональной окрашенности команды при голосовом управлении роботом

Karpova I. P., Ровбо М. А., В кн.: Шестнадцатая национальная конференция по искусственному интеллекту с международным участием КИИ-2018 (24-27 сентября 2018 г., г. Москва, Россия). Труды конференции. В 2-х томах.Т. 1. М.: РКП, 2018. С. 116–123.

The paper considers the problem of controlling a robot using a voice interface with speech recognition and analysis of the resulting set of words. The proposed method of command recognition is based on a dictionary of commands and special modifier words that are used for sentiment analysis of the command phrase and determining the priority ...

Added: September 28, 2018

Несчастливы по-своему: как измерить тональность литературного текста?

Sherstinova T., Moskvina A., Kirina M. et al., В кн.: Корпусная лингвистика - 2023. [б.и.], 2023.

In the experimental study, the results of three different approaches to the evaluation of the tonality of literary texts are compared: dictionary-based, machine learning, and distributional semantics. The material for analysis was a selection of 210 stories by Russian writers from the first three decades of the 20th century. The research showed that the correlation ...

Added: December 9, 2023

Volatility Prediction using Financial Disclosures Sentiments with Word Embedding-based IR Models

Rekabsaz N., Lupu M., Baklanov A. et al., , in: Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). [б.и.], 2017. P. 1712–1721.

Volatility prediction—an essential concept in financial markets—has recently been addressed using sentiment analysis methods. We investigate the sentiment of annual disclosures of companies in stock markets to forecast volatility. We specifically explore the use of recent Information Retrieval (IR) term weighting models that are effectively extended by related terms using word embeddings. In parallel to ...

Added: July 17, 2019

Emotions and Monoamines: New Approach to the Emotional Text Classification in Sentiment Analysis

Kolmogorova A., Advances in Intelligent Systems and Computing 2021 No. 3 P. 375–384

The paper presents the classification of Internet-texts in Russian according to their monoamine status. We consider the levels of serotonin, noradrenaline and dopamine supposedly present in the blood of the text producer. The procedure of the non-discrete annotation of emotional tonality was used for the corpora data to identify the verbal markers for low-serotonergic, high-serotonergic, ...

Added: October 30, 2022

The Applications of Sentiment Analysis for Russian Language Texts: Current Challenges and Future Perspectives

Smetanin S., IEEE Access 2020 Vol. 8 P. 110693–110719

Sentiment analysis has become a powerful tool in processing and analysing expressed opinions on a large scale. While the application of sentiment analysis on English-language content has been widely examined, the applications on the Russian language remains not as well-studied. In this survey, we comprehensively reviewed the applications of sentiment analysis of Russian-language content and ...

Added: June 24, 2020

Where Is Happily Ever After? A Study of Emotions and Locations in Russian Short Stories of 1900–1930

Moskvina A., Kirina M., , in: Digital Geography: Proceedings of the International Conference on Internet and Modern Society (IMS 2023). Springer, 2023. P. 123–135.

The paper tackles the problem of the automatic detection of emotions in literary texts using distributional semantics techniques. The experiment was carried out on the material of Russian short stories from the 1900-1930s. We investigated the emotional lexis distribution across different locations in narratives. At first, we calculated the semantic association score between each word ...

Added: December 9, 2023

STRENGTH OF WORDS: DONALD TRUMP'S TWEETS, SANCTIONS AND RUSSIA'S RUBLE

Afanasyev D., Fedorova E., Ledyaeva S., Journal of Economic Behavior and Organization 2021 Vol. 184 P. 353–377

We empirically test if the US President Donald Trump's rhetoric towards Russia (represented by his Russia-related tweets) affects ruble's exchange rate. Using three-stepped empirical framework, we find that escalation of negative sentiment of Trump's Russia-related tweets leads to ruble's depreciation (4–10%) in short-term periods (around 3 days). Though these episodes tend to coincide with imposition or announcement ...

Added: November 29, 2021

Value Propositions of Restaurant Delivery Systems: A Text Mining-Based Review.

Fainshtein E., , in: XIV International Scientific Conference “INTERAGROMASH 2021"Vol. 1: Precision Agriculture and Agricultural Machinery Industry. Springer, 2021. P. 475–483.

Due to the e-commerce rapid development during the COVID pandemic, the demand for logistics and its importance is increasing. A satisfied customer can drive e-commerce business forward. As logistical needs become more complex and logistics market becomes more competitive, service companies must strive to continually improve their value proposition to maintain their competitive edge. This ...

Added: November 1, 2021

The Impact of Disclosure Sentiment on the Share Prices of Russian Companies

Maksim Kopyrin, Naidenova I. N., Journal of Corporate Finance Research 2021 Vol. 15 No. 2 P. 5–15

Information about companies published in a news feed is invariably tinted by emotional tonality. As such, resulting perceptions may influence the opinion of market players, and consequently affect the dynamics of a company’s share price. This study aims to evaluate various hypotheses about the impact of the tone of news items regarding dividends, capital expenditures, and development on ...

Added: June 16, 2021

Pulse of the Nation: Observable Subjective Well-Being in Russia Inferred from Social Network Odnoklassniki

Sergey Smetanin, Mathematics 2022 Vol. 10 No. 16 Article 2947

Policymakers and researchers worldwide are interested in measuring the subjective well-being (SWB) of populations. In recent years, new approaches to measuring SWB have begun to appear, using digital traces as the main source of information, and show potential to overcome the shortcomings of traditional survey-based methods. In this paper, we propose the formal model for ...

Added: August 15, 2022

PolSentiLex: Sentiment Detection in Socio-political Discussions on Russian Social Media

Koltsova O., Alexeeva S., Pashakhin S. et al., , in: Artificial Intelligence and Natural Language. AINL 2020. Communications in Computer and Information ScienceBook 1292: Communications in Computer and Information Science. Springer, 2020. P. 1–16.

We present a freely available Russian language sentiment lexicon PolSentiLex designed to detect sentiment in user-generated content related to social and political issues. The lexicon was generated from a database of posts and comments of the top 2,000 LiveJournal bloggers posted during one year (~1.5 million posts and 20 million comments). Following a topic modeling ...

Added: September 22, 2020

Text mining approach for logistic companies’ efficiency assessment

Kuznetsova Y. A., Expert Systems with Applications: X 2023

The growing activity of companies on the Internet and popularity of social networks promote text message generation, containing people’s feedback on products and services. It seems obvious that such sources of unstructured big data being analysed can give additional competitive advantages to a company. Although there are many attempts to cope with this unstructured, messy ...

Added: March 30, 2023

Extracting Domain-Specific Opinion Words for Sentiment Analysis

Shamshurin I., , in: Advances in Computational Intelligence. 11th Mexican International Conference on Artificial Intelligence, MICAI 2012, San Luis Potosi, Mexico, October 27 - November 4, 2012, Proceedings* II: Lecture Notes in Artificial Intelligence. Heidelberg: Springer, 2012. P. 58–68.

In this paper, we consider opinion word extraction, one of the key problems in sentiment analysis. Sentiment analysis (or opinion mining) is an important research area within computational linguistics. Opinion words, which form an opinion lexicon, describe the attitude of the author towards certain opinion targets, i.e., entities and their attributes on which opinions have ...

Added: December 18, 2012

ТОНАЛЬНОСТЬ ОСВЕЩЕНИЯ ПОЗИЦИИ РОССИИ В АНГЛОЯЗЫЧНЫХ СМИ В ПЕРИОД САНКЦИЙ

Khrustova L., Федоров Ф. Ю., Fedorova E., Контуры глобальных трансформаций: политика, экономика, право 2020 Т. 13 № 4 С. 292–310

Обострение политической обстановки, которая свойственна текущей стадии развития международных отношений, сопровождается масштабной информационной войной. Проблема освещения положения России в международной прессе с негативной точки зрения обсуждается с начала 2000-х годов. Российско-украинский конфликт, который начался в конце 2013 - начале 2014 годов, заставил иностранные средства массовой информации вновь обратить внимание на Россию и спровоцировал увеличение количества ...

Added: October 29, 2020

АНТИМИГРАНТСКАЯ РИТОРИКА В БРИТАНСКИХ СМИ ДО И ПОСЛЕ РЕФЕРЕНДУМА

Balakina Y. V., Галочкин А. Е., Вестник Пермского университета. Серия: Политология 2020 № 4 С. 115–126

The present comparative study focuses on the British anti-migrant media discourse of two key periods of migration policy - before and after Brexit. The methodological basis of the work constituted the theory of social actors of van Leuven (2008), the conceptual opposition “us” and “them” by T. van Dijk (1989), and the agenda-setting theory of M. McCombs and ...

Added: December 23, 2020