• A
  • A
  • A
  • АБВ
  • АБВ
  • АБВ
  • A
  • A
  • A
  • A
  • A
Обычная версия сайта
  • RU
  • EN
  • HSE University
  • Publications
  • Book chapter
  • Pragmatic Markers and Parts of Speech: on the Problems of Annotation of the Speech Corpus
  • RU
  • EN
Расширенный поиск
Высшая школа экономики
Национальный исследовательский университет
Priority areas
  • business informatics
  • economics
  • engineering science
  • humanitarian
  • IT and mathematics
  • law
  • management
  • mathematics
  • sociology
  • state and public administration
by year
  • 2027
  • 2026
  • 2025
  • 2024
  • 2023
  • 2022
  • 2021
  • 2020
  • 2019
  • 2018
  • 2017
  • 2016
  • 2015
  • 2014
  • 2013
  • 2012
  • 2011
  • 2010
  • 2009
  • 2008
  • 2007
  • 2006
  • 2005
  • 2004
  • 2003
  • 2002
  • 2001
  • 2000
  • 1999
  • 1998
  • 1997
  • 1996
  • 1995
  • 1994
  • 1993
  • 1992
  • 1991
  • 1990
  • 1989
  • 1988
  • 1987
  • 1986
  • 1985
  • 1984
  • 1983
  • 1982
  • 1981
  • 1980
  • 1979
  • 1978
  • 1977
  • 1976
  • 1975
  • 1974
  • 1973
  • 1972
  • 1971
  • 1970
  • 1969
  • 1968
  • 1967
  • 1966
  • 1965
  • 1964
  • 1963
  • 1958
  • More
Subject
News
June 5, 2026
Neural Network Maps as a Method for Constructing Mathematical Models
Scientists from HSE University–Nizhny Novgorod and the Institute of Physics Belgrade, Serbia, are jointly exploring the application of machine learning techniques and neural networks to the study of nonlinear dynamics. Natalya Stankevich, Leading Research Fellow at the Laboratory of Topological Methods in Dynamics of the Faculty of Informatics, Mathematics, and Computer Science at HSE University–Nizhny Novgorod, spoke to the HSE News Service about this international project.
June 5, 2026
‘In the Age of Technology, It Is Interesting to Look into the Past and Think about What We Can Take from It
Polina Tabakova decided to apply for a Philology degree at HSE in Nizhny Novgorod because she grew up in Mari El and did not want to move far away from the Russian forests. In an interview for the Young Scientists of HSE University project, she spoke about the genre of the campus novel, the existential drama of Kolobok, and a blackout version of Eugene Onegin.
June 5, 2026
HSE Scientists Develop Method to Compress Large Language Models Without Losing Quality
Researchers from the AI and Digital Science Institute at the HSE Faculty of Computer Science have developed a new compression method for large language models such as GPT and LLaMA that reduces their size by 25–36% without additional training or significant loss of accuracy. This is the first approach to use mathematical transformations—specifically, rotations of model weights—to make models more amenable to compression with structured matrices. The study results have been published in ACL Findings 2025. The code is available on GitHub.

 

Have you spotted a typo?
Highlight it, click Ctrl+Enter and send us a message. Thank you for your help!

Publications
  • Books
  • Articles
  • Chapters of books
  • Working papers
  • Report a publication
  • Research at HSE

?

Pragmatic Markers and Parts of Speech: on the Problems of Annotation of the Speech Corpus

P. 129–139.
Bogdanova-Beglarian Natalia, Zaides K.
Language: English
Text on another site
Keywords: part of speechspeech corpusspoken speechpragmaticalizationpragmatic markermodel of formation

In book

CEUR Workshop Proceedings (Proceedings of the International Conference "Internet and Modern Society" IMS-2020, 17-20 June 2020, ITMO University, St. Petersburg, Russia)
CEUR Workshop Proceedings, 2020.
Similar publications
Особенности функционирования существительных с опустошенной семантикой в русской разговорной речи
Nikishina E., Труды института русского языка им. В.В. Виноградова 2025 № 3(45) С. 231–244
The article examines placeholder nouns (штука, фигня, хреновина, etc.) in Russian colloquial speech. These words can function similarly to pronouns by substituting for other words, but their range of functions is much broader than that of pronouns. The study analyzes two groups of vague reference words: initially neutral ones (штука, вещь, дело) and initially evaluative ...
Added: March 8, 2026
Multiword Units in Russian Everyday Speech: Empirical Classification and Corpus-Based Studies
Natalia V. Bogdanova-Beglarian, Olga V. Blinova, Khokhlova M. et al., , in: Speech and Computer: 26th International Conference, SPECOM 2024, Belgrade, Serbia, November 25–28, 2024, Proceedings, Part I.: Springer, 2024. P. 187–200.
Added: November 9, 2024
Speech and Computer: 26th International Conference, SPECOM 2024, Belgrade, Serbia, November 25–28, 2024, Proceedings, Part I
Springer, 2024.
The article is dedicated to the results of a research project describing the classes and functioning of multiword units in contemporary Russian everyday speech. The concept of multiword units encompasses quite diverse linguistic phenomena, making the creation of a working typology one of the project's central tasks. This typology is necessary for annotating corpus material ...
Added: November 9, 2024
Наш круг час от часу редеет: о диахронии семантики и сочетаемости конструкции «час от часу»
Панова А. В., Budennaya E., Русский язык в научном освещении 2024 № 2(48) С. 150–184
In the article the evolution of semantics and combinability of the construction čas ot času ‘hour from hour’ is considered using the framework of diachronic construction grammar. This construction belongs to a more general schema X ot ‘X-a ‘X from X-a’ with the meaning of time (den’ oto dnja ‘day from day’, god ot goda ...
Added: October 30, 2024
Multilingual Pragmaticon: Database of Discourse Formulae
Buzanov A., Bychkova P., Molchanova A. et al., , in: Proceedings of the Thirteenth Language Resources and Evaluation Conference.: European Language Resources Association (ELRA), 2022. P. 3331–3336.
The paper presents a multilingual database aimed to be used as a tool for typological analysis of response constructions called discourse formulae (DF), cf. English ‘No way¡ or French ‘Ça va¡(‘all right’). The two primary qualities that make DF of theoretical interest for linguists are their idiomaticity and the special nature of their meanings (cf. ...
Added: October 14, 2022
Рragmatic Markers in the Corpus “Оne Day of Speech”: Approaches to the Annotation
Zaides K., Popova T., Bogdanova-Beglarian Natalia, , in: Proceedings of Computational Models in Language and Speech Workshop (CMLS 2018) co-located with the 15th TEL International Conference on Computational and Cognitive Linguistics (TEL-2018)Vol. 2303: Computational Models in Language and Speech 2018.: Kazan: CEUR Workshop Proceedings, 2018. P. 128–143.
Added: February 3, 2022
Прагматические маркеры предикативного типа в устной спонтанной речи: подходы к описанию
Zaides K., Коммуникативные исследования 2019 Т. 6 № 2 С. 375–396
В статье обсуждается и обосновывается необходимость введения в коллоквиалистику понятия прагматического маркера (ПМ) предикативного типа, а также описывается инвентарь прагматических единиц русской устной спонтанной речи, в частности, те из них, которые, несмотря на свой по преимуществу хезитативный статус, обладают формальной предикативностью. С точки зрения синтаксической структуры все ПМ могут быть классифицированы на маркеры-слова, маркеры-словосочетания и ...
Added: February 3, 2022
Прагматические маркеры предикативного типа в устной спонтанной речи представителей разных социальных групп
Zaides K., Социо- и психолингвистические исследования 2020 № 8 С. 40–47
В статье рассматриваются особенности употребления прагматических маркеров предикативного типа (знаешь/те, (я) не знаю, (я) (не) думаю (что), представь/те и т. п.) в устной спонтанной речи представителей разных социальных групп. Материалом для исследования послужил рабочий подкорпус, сформированный из 150 000 токенов корпуса повседневной русской речи (фактически – диалогов) «Один речевой день» и 150 000 токенов корпуса ...
Added: February 3, 2022
Прагматический маркер ИЛИ ТАМ: свой среди чужих, чужой среди своих
Zaides K., Русская речь 2021 № 1 С. 22–36
В статье описываются функции и специфика употребления одного из прагматических маркеров, встречающихся в устной спонтанной речи, – или там. Данный маркер формально схож по модели построения с рефлексивными маркерами – или как его/её/их, или как это, или что и под. Однако, в отличие от этих маркеров, единица или там, как показано в статье, выполняет в устной речи принципиально иные функции – аппроксимативную ...
Added: February 3, 2022
Pragmatic Markers of Russian Everyday Speech: Invariants in Dialogue and Monologue
Bogdanova-Beglarian N., Blinova O. V., Sherstinova T. et al., , in: Speech and Computer. 23rd International Conference, SPECOM 2021, St. Petersburg, Russia, September 27–30, 2021Vol. 12997.: St. Petersburg: Springer, 2021. P. 81–90.
The paper presents the distribution of pragmatic markers (PM) of Russian everyday speech in two types of discourse: dialogical and monologic. PMs are an essential part of any oral discourse, therefore, quantitative data on their distribution are necessary for solving both theoretical and practical tasks related to studies of speech communication, as well as for ...
Added: October 31, 2021
Технологии эмоциональной демократии в эпоху постправды для манипуляции гражданским обществом в Республике Корея
Vishnyakova V., В кн.: Современные проблемы Корейского полуострова.: ИДВ РАН, 2021. Гл. 11 С. 116–126.
The article deals with the development of Korean linguistics and the formation of their linguistic tradition. Four main periods are distinguished such as origin, formation, division of Korea and the modern period, which are represented by the Korean linguists’ landmark achievements. The Korean linguistic tradition developed evolutionarily, and in a hundred years formed into an ...
Added: October 20, 2021
Artie Bias Corpus: An Open Dataset for Detecting Demographic Bias in Speech Applications
Meyer J., Rauchenstein L., Eisenberg J., , in: Proceedings of The 12th Language Resources and Evaluation ConferenceVol. 12.: European Language Resources Association (ELRA), 2020. P. 6462–6468.
We describe the creation of the Artie Bias Corpus, an English dataset of expert-validated <audio, transcript> pairs with demographic tags for age, gender, accent. We also release open software which may be used with the Artie Bias Corpus to detect demographic bias in Automatic Speech Recognition systems, and can be extended to other speech technologies. ...
Added: April 20, 2021
Позиционные свойства русских апеллятивов: формат описания в речевом корпусе
Blinova O. V., Компьютерная лингвистика и интеллектуальные технологии 2018 Т. 2 № 17(24) С. 96–109
The article suggests a way of modelling the linear position of appellatives in Russian. Under the name «appellatives» are combined the units with similar functions and syntactic properties, namely truncated vocative forms and discursive markers of the type «slushaj» (lit. ‘listen-Imp.2P’). The model assumes distinction between accented and non-accented uses in three positions (initial, middle, ...
Added: November 1, 2020
Pragmatic Markers in Dialogue and Monologue: Difficulties of Identification and Typical Formation Models
Natalia Bogdanova-Beglarian, Blinova O. V., Tatiana Sherstinova et al., , in: Speech and Computer. 22nd International Conference, SPECOM 2020Vol. 12335.: Springer, 2020. P. 68–78.
The paper deals with new research findings on pragmatic markers (PMs) use in spoken Russian. The study is based on two speech corpora: “One Day of Speech” (ORD, which contains mainly dialogues), and “Balanced Annotated Collection of Texts” (SAT, which contains only monologues). We explored two annotated subcorpora consisting of 321,504 tokens and 50,128 tokens respectively. ...
Added: November 1, 2020
Russian Pragmatic Markers Database: Developing Speech Technologies for Everyday Spoken Discourse
Sherstinova T., Blinova O. V., Богданова-Бегларян Н. В. et al., , in: Proceedings of the 26th Conference of Open Innovations Association FRUCT.: IEEE, 2020. P. 60–66.
The paper presents recent results obtained within the ongoing project dedicated to the study of Russian pragmatic markers. Pragmatic markers are obligatory elements of natural speech in any language; moreover, they are considered to be functionally important for speech production and overcoming inevitable speech difficulties. A correct understanding of use and functions of pragmatic markers ...
Added: November 1, 2020
Развитие языкознания в Республике Корея
Vishnyakova V., В кн.: Корейский полуостров: история и современность.: ИДВ РАН, 2020. Гл. 1 С. 407–416.
Added: October 28, 2020
Свойства дискурсивных формул на примере русских конструкций ты что и что ты
Bychkova P., Русский язык в научном освещении 2020 № 2 (40) С. 88–111
The paper discusses semantic description of the so-called discourse formulae, idiomatic expressions used as speaker's reactions in a dialogue. They are considered in the framework of construction grammar, as a peripheral class of constructions with its specific properties. A case study of two synonymous Russian discourse formulae TY ČTO and ČTO TY provides an account ...
Added: September 23, 2020
Pragmatic Markers of Russian Everyday Speech: the Revised Typology and Corpus-Based Study
Богданова-Бегларян Н. В., Blinova O. V., Sherstinova T. et al., , in: Proceedings of the 25th Conference of Open Innovations Association FRUCT, University of Helsinki, Helsinki, Finland.: Helsinki: IEEE, 2019. P. 57–63.
Pragmatic markers (PMs) mainly have an influence on a pragmatic aspect of communication and are mostly devoid of their own referential meaning. These markers are indispensable elements of oral communication in any language. The article suggests a typology of pragmatic markers for Russian everyday speech that includes 10 basic types. The frequency study for the ...
Added: October 31, 2019
Pragmatic Markers Distribution in Russian Everyday Speech: Frequency Lists and Other Statistics for Discourse Modeling
Богданова-Бегларян Н. В., Sherstinova T., Blinova O. V. et al., , in: Speech and Computer. 21st International Conference, SPECOM 2019, Istanbul, Turkey, August 20–25, 2019, ProceedingsVol. 11658.: Switzerland: Springer, 2019. P. 433–443.
Pragmatic markers (PMs) are discourse units (words and multiword expressions) with a weakened referential meaning, which perform a variety of pragmatic tasks. For example, in English the common PMs are “well”, “you know”, “I think”, and many others. PMs are integral elements of spoken discourse in every language. According to the results obtained from the ...
Added: October 29, 2019
Audible Paralinguistic Phenomena in Everyday Spoken Conversations: Evidence from the ORD Corpus Data
Sherstinova T., , in: Language, Music and Computing. Second International Workshop, LMAC 2017, St. Petersburg, Russia, April 17–19, 2017, Revised Selected PapersVol. 943.: Switzerland: Springer, 2019. P. 131–145.
Paralinguistic phenomena are non-verbal elements in conversation. Paralinguistic studies are usually based on audio or video recordings of spoken communication. In this article, we will show what kind of audible paralinguistic information may be obtained from the ORD speech corpus of everyday Russian discourse containing long-term audio recordings of conversations made in natural circumstances. This linguistic resource provides rich authentic ...
Added: October 29, 2019
  • About
  • About
  • Key Figures & Facts
  • Sustainability at HSE University
  • Faculties & Departments
  • International Partnerships
  • Faculty & Staff
  • HSE Buildings
  • HSE University for Persons with Disabilities
  • Public Enquiries
  • Studies
  • Admissions
  • Programme Catalogue
  • Undergraduate
  • Graduate
  • Exchange Programmes
  • Summer University
  • Summer Schools
  • Semester in Moscow
  • Business Internship
  • Research
  • International Laboratories
  • Research Centres
  • Research Projects
  • Monitoring Studies
  • Conferences & Seminars
  • Academic Jobs
  • Yasin (April) International Academic Conference on Economic and Social Development
  • Media & Resources
  • Publications by staff
  • HSE Journals
  • Publishing House
  • iq.hse.ru: commentary by HSE experts
  • Library
  • Economic & Social Data Archive
  • Video
  • HSE Repository of Socio-Economic Information
  • HSE1993–2026
  • Contacts
  • Copyright
  • Privacy Policy
  • Site Map
Edit