• A
  • A
  • A
  • АБВ
  • АБВ
  • АБВ
  • A
  • A
  • A
  • A
  • A
Обычная версия сайта
  • RU
  • EN
  • HSE University
  • Publications
  • Book chapter
  • Automatic Annotation of Discourse and Speech Formulas in Internet Communication: A Telegram Comment Corpus
  • RU
  • EN
Расширенный поиск
Высшая школа экономики
Национальный исследовательский университет
Priority areas
  • business informatics
  • economics
  • engineering science
  • humanitarian
  • IT and mathematics
  • law
  • management
  • mathematics
  • sociology
  • state and public administration
by year
  • 2027
  • 2026
  • 2025
  • 2024
  • 2023
  • 2022
  • 2021
  • 2020
  • 2019
  • 2018
  • 2017
  • 2016
  • 2015
  • 2014
  • 2013
  • 2012
  • 2011
  • 2010
  • 2009
  • 2008
  • 2007
  • 2006
  • 2005
  • 2004
  • 2003
  • 2002
  • 2001
  • 2000
  • 1999
  • 1998
  • 1997
  • 1996
  • 1995
  • 1994
  • 1993
  • 1992
  • 1991
  • 1990
  • 1989
  • 1988
  • 1987
  • 1986
  • 1985
  • 1984
  • 1983
  • 1982
  • 1981
  • 1980
  • 1979
  • 1978
  • 1977
  • 1976
  • 1975
  • 1974
  • 1973
  • 1972
  • 1971
  • 1970
  • 1969
  • 1968
  • 1967
  • 1966
  • 1965
  • 1964
  • 1963
  • 1958
  • More
Subject
News
May 22, 2026
HSE Graduates AI Project Wins at TECH & AI Awards
Daria Davydova, graduate of the HSE Graduate School of Business and Head of the AI Implementation Unit at the Artificial Intelligence Department of Alfa-Bank, received a prize at the TECH & AI Awards. She was awarded for the best AI solution for optimising business processes. The winners were determined as part of the VII Russian Summit and Awards on Digital Transformation (CDO/CDTO Summit & Awards).
May 20, 2026
HSE University Opens First Representative Office of Satellite Laboratory in Brazil
HSE University-St Petersburg opened a representative office of the Satellite Laboratory on Social Entrepreneurship at the University of Campinas in Brazil. The platform is going to unite research and educational projects in the spheres of sustainable development, communications and social innovations.
May 18, 2026
The 'Second Shift' Is Not Why Women Avoid News
Women are more likely than men to avoid political and economic news, but the reasons for this behaviour are linked less to structural inequality or family-related stress than to personal attitudes and the emotional perception of news content. This conclusion was reached by HSE researchers after analysing data from a large-scale survey of more than 10,000 residents across 61 regions of Russia. The study findings have been published in Woman in Russian Society.

 

Have you spotted a typo?
Highlight it, click Ctrl+Enter and send us a message. Thank you for your help!

Publications
  • Books
  • Articles
  • Chapters of books
  • Working papers
  • Report a publication
  • Research at HSE

?

Automatic Annotation of Discourse and Speech Formulas in Internet Communication: A Telegram Comment Corpus

P. 278–292.
Maslenikova A., Tatiana I. Popova

This article presents a system for the automatic processing of user comments aimed at annotating speech and discourse formulas that actively function in everyday interaction, including digital communication. A Python-based program using the Telegram API was developed to automate the collection, filtering, and annotation of empirical data. In addition to building a user corpus, the study also included the evaluation of automatic processing results. The source material was drawn from the Telegram news channel Fontanka SPB Online. As a result of automatic processing, 70 speech and discourse formulas were extracted and grouped based on their source lexicons. The classification of the examined multiword units was grounded in the findings of two research projects: the construction of the Pragmaticon in Moscow and the annotation of stable multiword units in Saint Petersburg. The implementation of automatic annotation enabled the identification of formulas with a high pragmatic load and captured their specific functions in internet communication. For example, semantic irony was observed in the use of formulas such as ‘khorosho’ (‘fine’) and ‘bez problem’ (‘no problem’), which traditionally indicate agreement. The study identified the most frequent types of user responses reflected by the formulas: affirmation and negation. The results demonstrate the potential of the automatic approach for describing speech and discourse formulas in digital discourse and highlight the need to refine existing classifications of speech act.

Language: English
DOI
Keywords: corpus linguisticscomputational linguisticsDiscourse Analysis

In book

27th International Conference, SPECOM 2025, Szeged, Hungary, October 13–15, 2025, Proceedings, Part I. Speech and Computer. Lecture Notes in Artificial Intelligence 16187
Vol. 16187: Lecture Notes in Artificial Intelligence. , Springer, 2025.
Similar publications
Эволюция нарративов о международной системе во внешнеполитическом дискурсе Си Цзиньпина в 2013−2024 годах
Милькова А. А., Вестник Пермского университета. Серия: Политология 2026 Т. 20 № 1 С. 104–114
The article is devoted to strategic narratives about the international system in the foreign policy dis-course of the President of the People's Republic of China Xi Jinping. An analysis of their evolution revealed the growing influence of Chinese political thought in shaping the modern world order, which is a topic of great academic and practical ...
Added: May 6, 2026
Российская социология в условиях цифровизации общества: результаты анализа корпуса научных текстов
Smirnov A., Социологические исследования 2023 № 4 С. 39–50
Using the analysis of a corpus of texts from eight leading Russian sociological journals, the article examines the impact of the digitalization of society on sociology in 2000–2021. Frequency analysis of 13.8 thousand scientific texts tracked the introduction of concepts related to digitalization into academic circulation. The article reveals the differences between the journals, due ...
Added: March 18, 2026
Promotional adjectives in grant proposal abstracts: a corpus study
Dmitriy S. Tulyakov, Tatiana M. Permyakova, Ekaterina A. Balezina, Вестник Волгоградского государственного университета. Серия 2: Языкознание 2025 Vol. 24 No. 6 P. 58–67
By effectively integrating promotional discourse into grant proposal abstracts, researchers can more compellingly present their ideas and increase their chances of securing funding. Implications of promotional adjectives in grant writing might differ across various research fields. This study aims to explore the use of promotional adjectives in abstracts of research grant proposals in six research ...
Added: March 2, 2026
Динамика восприятия площадей в пространстве города носителями русского языка (сравнительный анализ по данным НКРЯ)
Belova P., В кн.: Актуальные вопросы лингвистики и литературоведения: сборник научных статей по материалам международной научной конференции памяти доктора филологических наук, профессора Л.А. Араевой (6–8 февраля 2025).: Кемеровский государственный университет, 2025. С. 155–160.
This article contains research results on the dynamics of squares’ perception in the city space in the Russian language picture of the world over time, starting from the second half of the XXth century to the present. Turning to the subcorpus of literary texts of the second half of the XXth century and the XXIst ...
Added: February 4, 2026
Internarrativity of Authoritarian Rule: A Discourse Analysis of Vladimir Putin’s Speeches between 2012 and 2019
Vershinin I., Canadian Journal of European and Russian Studies 2025 Vol. 18 No. 3 P. 28–64
This research examines how Russia, with a focus on Putin, shapes its messages by analyzing his interactions with foreign journalists between 2012 and 2019. To understand how these messages work together, the study employs the concept of ‘internarrativity,’ which explores how narratives are interlinked and reinforce one another. The author argues that it is essential ...
Added: December 13, 2025
Preposition drop in Russian spoken by Mari and Beserman bilinguals
Yakovleva A., Kosheliuk N., Moroz G., International Journal of Bilingualism 2025 P. 1–19
Aims and Research Questions: In this paper, we present a corpus-based study of preposition drop (p-drop) in the speech of Mari-Russian and Beserman-Russian bilinguals compared to the speech of Russian monolinguals. Based on data from spoken corpora, we demonstrate that the prepositions v ‘in’, k ‘to’, s ‘with’ are omitted in the speech of bilinguals ...
Added: November 26, 2025
Вариативность годов vs. лет в русских говорах: корпусное исследование
Zemicheva S., Moroz G., Naccarato C., Вопросы языкознания 2025 № 6 С. 7–34
Наличие супплетивной формы лет в парадигме существительного год отличает русский язык от других восточнославянских. При этом в русских говорах вместо лет может использоваться вариант годов. Данные панхронического подкорпуса НКРЯ показывают, что форма годов, зафиксированная впервые в XV в., на всем протяжении истории русского языка была периферийной, в XVII–XVIII вв. использовалась преимущественно в нехудожественных текстах, а в ...
Added: November 12, 2025
27th International Conference, SPECOM 2025, Szeged, Hungary, October 13–15, 2025, Proceedings, Part II. Speech and Computer. Lecture Notes in Artificial Intelligence 16188
Springer, 2025.
This work is subject to copyright. All rights are solely and exclusively licensed by the Publisher, whether the whole or part of the material is concerned, specifically the rights of translation, reprinting, reuse of illustrations, recitation, broadcasting, reproduction on microfilms or in any other physical way, and transmission or information storage and retrieval, electronic adaptation, computer software, or ...
Added: October 19, 2025
Employing computational linguistic technologies and oculography to develop diagnostic tool for detecting autoaggressive tendencies in young people: a riveted gaze into “get rid of the shackles of this world”
Khomenko A., Kasimova L., Sychugov E. et al., Psychiatria Danubina 2025 Vol. 37 No. Suppl. 1 P. 213–223
Background: Early recognition of autoaggressive tendencies in young people is essential for diagnostic screening and reducing suicidality risks. This can be achieved through psycholinguistic approaches such as corpus analysis and eye-tracking studies. Corpus research helps to develop generalized speech patterns of those at risk of suicide, while oculographic methods examine perceptual cues linked to suicidal ...
Added: October 19, 2025
Computational linguistics and intellectual technologies. Papers from the Annual International Conference "Dialogue" (2025)
[б.и.], 2025.
This collection includes 39 papers from the Dialogue 2025 International Conference on Computational Linguistics and Intelligent Technologies, representing a wide range of theoretical and applied research in the fields of natural language description, modeling language processes, and the development of practical computational linguistic technologies. This publication is intended for specialists in theoretical and applied linguistics and ...
Added: October 19, 2025
27th International Conference, SPECOM 2025, Szeged, Hungary, October 13–15, 2025, Proceedings, Part I. Speech and Computer. Lecture Notes in Artificial Intelligence 16187
Springer, 2025.
Added: October 13, 2025
Variation in a Narrative Corpus of Mano and Kpelle: Contact-Induced or Not?.
Khachaturyan M., Konoshenko M., Moroz G. et al., , in: N’yng-dyuumgu, n’yng-ngafq: Festschrift for Ekaterina GruzdevaVol. 126.: Helsinki: Studia Orientalia, 2025. P. 35–59.
This paper explores a corpus of spontaneous narratives and narrative retellings told by children and adults in Mano and Kpelle, two contacting Mande languages. It focuses on quotative constructions as a key point of grammatical dissimilarity between Mano and Kpelle. In the Mano speech of some bilingual children, however, these constructions are found to manifest ...
Added: September 5, 2025
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics
Wien: Association for Computational Linguistics, 2025.
Added: August 26, 2025
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Association for Computational Linguistics, 2025.
Originally named the Association for Machine Translation and Computational Linguistics (AMTCL), the Association for Computational Linguistics was founded in 1962 and renamed the ACL in 1968. The ACL is run by some 20 volunteers overseeing the administration of the Association (organising elections, deciding on new actions, adapting to the fast changing trends of our fields), ...
Added: July 17, 2025
Переписка Н. С. Хрущева и Ф. Кастро периода Карибского кризиса: опыт компьютеризованного анализа
Герцен А. С., В кн.: Четвёртая зимняя школа по гуманитарной информатике.: Балтийский федеральный университет им. Иммануила Канта, 2020. С. 92–97.
The article analyzes the 1st Secretary of the Central Committee of the CPSU and Chairman of the Council of Ministers of the USSR N. S. Khrushchev and the leader of the Cuban revolution F. Castro Ruz’s letters written in the period from October 26 to 31, 1962 on the topic of the Caribbean crisis and ...
Added: July 15, 2025
An overview of morphosyntactic variation in the speech of Russian-Chuvash bilinguals: number, gender, case assignment and preposition drop
Grishanova A., Russian linguistics 2025 Vol. 49 Article 10
The purpose of this study is to present a summary of morphosyntactic variation and a detailed analysis of the phenomenon of preposition drop in the Russian speech of Chuvash bilinguals. Specifically, I investigate what underlying factors might condition the variation. I conduct a qualitative analysis of the data extracted from the corpus of Russian spoken ...
Added: July 10, 2025
Effects of the Belt and Road Initiative: Impact of the “Rise of China” on Russian foreign policy regarding Central-Eastern Europe (2013-2020)
Dontsov A., Leipzig: Universitätsbibliothek Leipzig, 2024.
By looking at the reactions of Russian actors regarding the development of the Belt and Road Initiative of the People’s Republic of China in Central and Eastern Europe, this dissertation presents new theoretical and empirical findings that can be used extensively in the fields of Area and Global Studies, as well as practical policymaking. In ...
Added: June 25, 2025
Computational Linguistics and Intellectual Technologies. Papers from the Annual International Conference “Dialogue” (2025)
., 2025.
The volume includes 9 papers from the international conference on computational linguistics and intelligent technologies “Dialogue 2025,” representing a wide range of theoretical and applied research in the fields of natural language description, modeling of linguistic processes, and the development of practically applicable computational linguistic technologies. Intended for specialists in theoretical and applied linguistics and intelligent ...
Added: April 28, 2025
Do Formal Stance Strategies Reveal Disciplinary Variation in Professional Scientific Writing?
Smirnova E. A., Pérez-Guerra J., International Journal of Applied Linguistics 2025 Vol. 35 No. 3 P. 1242–1261
Stance in academic discourse has been extensively studied, with numerous investigations indicating that its expression varies across disciplines, depending on the authors’ intention to either enhance or diminish their voice or presence (e.g. It seems fairly certain versus This is based on the belief that...). This paper hypothesises that stance can be viewed as a ...
Added: April 10, 2025
Computational Linguistics and Intellectual Technologies: Proceedings of the International Conference “Dialogue 2020"
., 2020.
Added: April 10, 2025
  • About
  • About
  • Key Figures & Facts
  • Sustainability at HSE University
  • Faculties & Departments
  • International Partnerships
  • Faculty & Staff
  • HSE Buildings
  • HSE University for Persons with Disabilities
  • Public Enquiries
  • Studies
  • Admissions
  • Programme Catalogue
  • Undergraduate
  • Graduate
  • Exchange Programmes
  • Summer University
  • Summer Schools
  • Semester in Moscow
  • Business Internship
  • Research
  • International Laboratories
  • Research Centres
  • Research Projects
  • Monitoring Studies
  • Conferences & Seminars
  • Academic Jobs
  • Yasin (April) International Academic Conference on Economic and Social Development
  • Media & Resources
  • Publications by staff
  • HSE Journals
  • Publishing House
  • iq.hse.ru: commentary by HSE experts
  • Library
  • Economic & Social Data Archive
  • Video
  • HSE Repository of Socio-Economic Information
  • HSE1993–2026
  • Contacts
  • Copyright
  • Privacy Policy
  • Site Map
Edit