Evaluating the Pragmatic Competence of Large Language Models in Detecting Mitigated and Unmitigated Types of Disagreement

V. Shulginov; Hasan Berkcan Şimşek; Sergei Kudriashov; Randautsova R.; Shevela S.

doi:10.28995/2075-7182-2025-23-345-360

Publications

?

Evaluating the Pragmatic Competence of Large Language Models in Detecting Mitigated and Unmitigated Types of Disagreement

P. 345–360.

Shulginov V., Hasan Berkcan Şimşek, Sergei Kudriashov, Randautsova R., Shevela S.

This study presents a framework for evaluating the effectiveness of language models (LLMs) in detecting disagreement across a wide range of pragmatic strategies, from mitigated forms to overt verbal aggression. Special attention is given to complex cases of implicit manifestations of irony and sarcasm, which pose significant challenges for both automated analysis and interpersonal communication. Experimental testing of LLMs was conducted in two types of tasks: binary classification for identifying disagreement and classification of specific strategies for its expression. The results showed that large multilingual models outperformed other models, especially in binary classification. However, models that focus primarily on the Russian language, such as GigaChat and YaGPT, tend to interpret irony and sarcasm more accurately and have a higher result density. Comparative analysis with human judgments revealed that, despite progress, the accuracy of sarcasm detection by LLMs still lags significantly behind human judgments. The results suggest a need for further optimization of LLMs to improve their pragmatic competence in real communicative situations.

Keywords: прагматика pragmatics ирония irony бенчмарк disagreement sarcasm benchmark LLMs evaluation QUD оценка БЯМ несогласие сарказм вопрос над дискуссией

In book

Computational Linguistics and Intellectual Technologies: Papers from the Annual International Conference “Dialogue” (2025)

Issue 23. , [б.и.], 2025.

Ирония в пьесе Ватсараджи «Киратарджуния» (XII в.)

Минаева М. Д., Вестник Института востоковедения РАН 2025 № 6 С. 143–155

This article examines the rhetorical device of “irony” in the Sanskrit poetic tradition, using examples from the medieval playwright Vatsarāja’s Kirātārjunīya (“The Kirāta and Arjuna,” 12th century). This play belongs to the rare vyāyoga genre, which is characterized by the depiction of a great battle between two renowned heroes accompanied by a verbal duel filled with ...

Added: April 30, 2026

Христианская лексика в сфере IT: ирония и пафос

Комышкова А. Д., В кн.: Проблемы семантики и прагматики языковых единиц разных уровней в эпоху больших языковых данных. Сборник трудов Международной научной конференции, посвященной памяти доктора филологических наук, профессора Ольги Павловны Ермаковой.: Калуга: ФБГОУ ВПО "Калужский государственный университет им. К.Э.Циолковского", 2025. С. 219–227.

The article examines the lexical representation of the Christian metaphor in texts that are subjectively and thematically related to the field of information technologies, as well as addressed to all those who are professionally associated with this field. The study focuses on the pragmatic shifts in semantics that occur in words such as геенна (gehenna) ...

Added: April 3, 2026

Коммуникативная концепция Т. Г. Винокур в контексте прагматической социологии (на примере пьесы Д. Данилова «Сережа очень тупой»)

Nikishina E., В кн.: Говорящий и пишущий: К 100-летию со дня рождения Татьяны Григорьевны Винокур.: М.: Институт русского языка им. В.В. Виноградова РАН, 2024. С. 238–258.

The book is dedicated to the memory of a remarkable Russian language scholar, Tatyana Grigoryevna Vinokur (1924–1992). The range of issues addressed in the collected scholarly articles reflects the breadth of Tatyana Grigoryevna's research interests: the history of language, poetics, the language of fiction, stylistics, speech culture, problems of communication studies, and many other topics. ...

Added: March 8, 2026

HoTPP benchmark: Are we good at the long horizon events forecasting?

Karpukhin I., Shipilov F., Savchenko A., Neurocomputing 2026 Vol. 672 Article 132771

Forecasting multiple future events within a given time horizon is essential for applications in finance, retail, social networks, and healthcare. This problem is typically addressed using Marked Temporal Point Processes (MTPP), which provide a principled framework for modeling both event timing and event labels. While most existing research focuses on predicting only the next event, forecasting distant future ...

Added: February 25, 2026

Речевое воздействие в разных контекстах

Гданьск: Wydawnictwo Uniwersytetu Gdanskiego, 2021.

Сборник научных трудов «Речевое воздействие в разных дискурсах» посвящён комплексному исследованию механизмов языкового влияния в различных типах коммуникации. В центре внимания авторов находятся стратегии и тактики речевого воздействия, прагматические и когнитивные механизмы формирования убеждения, манипуляции, аргументации и эмоционального воздействия в институциональных и неинституциональных дискурсах. В статьях рассматриваются особенности функционирования средств речевого воздействия в политическом, медиадискурсе, образовательном, ...

Added: February 23, 2026

Explorations in Applied Ethnolinguistics: Words, Cultures, and Global Perspectives

Palgrave Macmillan, 2025.

This volume contributes to the growing body of cutting-edge research into the Natural Semantic Metalanguage (NSM) approach in linguistics. It explores the broad range of possible applications enabled by the NSM approach, from linguistic studies of semantics and culture to cross-cultural studies, psychology and childhood education. The volume builds on previous studies, bringing a diversity ...

Added: January 28, 2026

ComputAgeBench: Epigenetic Aging Clocks Benchmark

Kriukov D., Efimov E., Kuzmina E. et al., , in: KDD '25: Proceedings of the 31th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. Volume 2.: Association for Computing Machinery (ACM), 2025. P. 5560–5570.

The success of clinical trials of longevity drugs relies heavily on identifying integrative health and aging biomarkers, such as biological age. Epigenetic aging clocks predict the biological age of individuals using their DNA methylation profiles, commonly retrieved from blood samples. However, there is no standardized methodology to validate and compare epigenetic clock models. We propose ComputAgeBench, ...

Added: January 12, 2026

Прагматика в цифровую эпоху: база данных «Рутиникон»

Rakhilina E. V., Гюласарян С. М., Бычкова П., Слово.ру: балтийский акцент 2025 Т. 16 № 2 С. 28–52

This study focuses on the Routinicon database as a digital tool for describing routines — a distinct class of formulaic phraseological units that represent reactions to or comments on standard extralinguistic situations. For instance, the formula Kogo ya vizhu! (Whom do I see!) serves as a reaction to an unexpected meeting, while Kto tam? (Who’s ...

Added: October 29, 2025

The immediate and the naive metaphysics

Ivan B. Mikirtumov, Epistemology and Philosophy of Science 2025 Vol. 62 No. 3 P. 126–131

In this article, I discuss Pirmin Stekeler-Weithofer’s ideas about the nature of language and the metaphysical residue that seems to be present in the realm of immediate experience, despite all the criticism and success of positive knowledge. This includes, first and foremost, the ability to perceive objects, facts, and possible worlds which humans have from ...

Added: September 1, 2025