?
TEXTS OF DIFFERENT EMOTIONAL CLASSES AND THEIR TOPIC MODELING
The article is devoted to studying verbalization specifics of various emotional states in the texts in Russian with the purpose to confirm or refute the hypothesis that texts of different emotional classes reflect the denotative situation not identically, which is reflected in thematic specifics and lexical content. The research material consisted of eight corpus texts in Russian, which were extracted from the public pages of the social network VKontakte. The texts were selected according to emotional hashtags that corresponded to eight basic emotions, according to the H. Lövheim’s model: anger, surprise, shame, enjoyment, disgust, distress, excitement, fear. The correspondence of emotion and hashtag was established in a preliminary psycholinguistic experiment. While analyzing the text collection, we used the method of computer thematic modeling to identify statistically non–random groups of words (topics). We applied the BERTopic neural network model to the collected data. As a result of the analysis, it was found that texts of 8 emotional classes contain an uneven number of topics, despite the fact that their number does not correlate directly with the amount
of data: with a relatively small amount of data, there may be many topics, but in a voluminous corpus – few. For example, there are only two topics in the subcorpus of "inspired" texts (184,074 tokens), and nine in the subcorpus of "disgusting" texts (45,868 tokens). In addition, the sets of words (tokens) that make up each non-random group (topic) differ in each subcorpora, reflecting the specifics of the denotative situation, which is formed under the influence of the emotional state of the speaker. The idea of diverse thematic "granularity" of texts of different emotional classes is theoretically justified.