• A
  • A
  • A
  • АБВ
  • АБВ
  • АБВ
  • A
  • A
  • A
  • A
  • A
Обычная версия сайта
  • RU
  • EN
  • HSE University
  • Publications
  • Book chapter
  • Multiword Units in Russian Everyday Speech: Empirical Classification and Corpus-Based Studies
  • RU
  • EN
Расширенный поиск
Высшая школа экономики
Национальный исследовательский университет
Priority areas
  • business informatics
  • economics
  • engineering science
  • humanitarian
  • IT and mathematics
  • law
  • management
  • mathematics
  • sociology
  • state and public administration
by year
  • 2027
  • 2026
  • 2025
  • 2024
  • 2023
  • 2022
  • 2021
  • 2020
  • 2019
  • 2018
  • 2017
  • 2016
  • 2015
  • 2014
  • 2013
  • 2012
  • 2011
  • 2010
  • 2009
  • 2008
  • 2007
  • 2006
  • 2005
  • 2004
  • 2003
  • 2002
  • 2001
  • 2000
  • 1999
  • 1998
  • 1997
  • 1996
  • 1995
  • 1994
  • 1993
  • 1992
  • 1991
  • 1990
  • 1989
  • 1988
  • 1987
  • 1986
  • 1985
  • 1984
  • 1983
  • 1982
  • 1981
  • 1980
  • 1979
  • 1978
  • 1977
  • 1976
  • 1975
  • 1974
  • 1973
  • 1972
  • 1971
  • 1970
  • 1969
  • 1968
  • 1967
  • 1966
  • 1965
  • 1964
  • 1963
  • 1958
  • More
Subject
News
June 5, 2026
Neural Network Maps as a Method for Constructing Mathematical Models
Scientists from HSE University–Nizhny Novgorod and the Institute of Physics Belgrade, Serbia, are jointly exploring the application of machine learning techniques and neural networks to the study of nonlinear dynamics. Natalya Stankevich, Leading Research Fellow at the Laboratory of Topological Methods in Dynamics of the Faculty of Informatics, Mathematics, and Computer Science at HSE University–Nizhny Novgorod, spoke to the HSE News Service about this international project.
June 5, 2026
‘In the Age of Technology, It Is Interesting to Look into the Past and Think about What We Can Take from It
Polina Tabakova decided to apply for a Philology degree at HSE in Nizhny Novgorod because she grew up in Mari El and did not want to move far away from the Russian forests. In an interview for the Young Scientists of HSE University project, she spoke about the genre of the campus novel, the existential drama of Kolobok, and a blackout version of Eugene Onegin.
June 5, 2026
HSE Scientists Develop Method to Compress Large Language Models Without Losing Quality
Researchers from the AI and Digital Science Institute at the HSE Faculty of Computer Science have developed a new compression method for large language models such as GPT and LLaMA that reduces their size by 25–36% without additional training or significant loss of accuracy. This is the first approach to use mathematical transformations—specifically, rotations of model weights—to make models more amenable to compression with structured matrices. The study results have been published in ACL Findings 2025. The code is available on GitHub.

 

Have you spotted a typo?
Highlight it, click Ctrl+Enter and send us a message. Thank you for your help!

Publications
  • Books
  • Articles
  • Chapters of books
  • Working papers
  • Report a publication
  • Research at HSE

?

Multiword Units in Russian Everyday Speech: Empirical Classification and Corpus-Based Studies

P. 187–200.
Natalia V. Bogdanova-Beglarian, Olga V. Blinova, Khokhlova M., Tatiana Y. Sherstinova, Tatiana I. Popova
Language: English
DOI
Keywords: statistical analysisspeech corpuscollocation

In book

Speech and Computer: 26th International Conference, SPECOM 2024, Belgrade, Serbia, November 25–28, 2024, Proceedings, Part I
Springer, 2024.
Similar publications
Analysis of the Method for Statistical Analysis of the Distribution Characteristics of Contact Radio Interference
Utkin B., Nikolay Grachev, Utkin M., , in: 2025 International Ural Conference on Electrical Power Engineering (UralCon).: IEEE, 2025. P. 241–245.
This article discusses the problem of statistical analysis of contact radio interference that occurs in mobile objects and affects the operation of radio receiving devices. The proposed method is based on a statistical analysis of the envelope distribution of the total level of contact interference using an electronic computer in real time. The method makes ...
Added: January 29, 2026
Психологические исследования креативности в России (2000 – 2017 гг.). Часть II. Методические рекомендации для исследователей
Мирошник К. Г., Sherbakova O., Психологический журнал 2020 Т. 41 № 3 С. 32–42
This article is concerned with analysis and discussion of empirical results of the study of the most common methodological practices in the Russian field of creativity research. In the present study, methodological practices are defined as research methodology, statistical data processing, and description of results in scientific papers. Based on the obtained results, we composed ...
Added: March 31, 2025
Психологические исследования креативности в России (2000 – 2017 гг.). Часть I. Анализ эмпирических работ
Мирошник К. Г., Sherbakova O., Психологический журнал 2020 Т. 41 № 2 С. 15–25
This article reports the results of the study of the most common methodological practices in the field of creativity research in Russia. In this study, methodological practices are understood as research methodology, statistical data processing, and description of results in scientific papers. Using the search query “creativity”, 369 articles with empirical data (N = 377) ...
Added: March 31, 2025
Speech and Computer: 26th International Conference, SPECOM 2024, Belgrade, Serbia, November 25–28, 2024, Proceedings, Part I
Springer, 2024.
The article is dedicated to the results of a research project describing the classes and functioning of multiword units in contemporary Russian everyday speech. The concept of multiword units encompasses quite diverse linguistic phenomena, making the creation of a working typology one of the project's central tasks. This typology is necessary for annotating corpus material ...
Added: November 9, 2024
Macroeconomic indicators of Russia's media communication industry in 2000-2020: Quantitative analysis
Vartanov S., Vardanyan E., World of Media. Journal of Russian Media and Journalism Studies 2024 No. 1 P. 5–29
Technological and social processes of last years inspired by the information and communication technologies development, such as processes of digital transformation of society and the convergence of (mass) media, have led to the formation of a new macro-social entity - the media communication industry, integrated into the national and global economy and interacting in various ...
Added: April 1, 2024
Statistical model for assessing the reliability of non-destructive testing systems by solving inverse problems
Alexandrov A. E., S. P. Borisov, Bunina L. V. et al., Российский технологический журнал 2023 Vol. 11 No. 3 P. 56–69
Objectives. The wear monitoring of metal structural elements of power plants-in particular, pipelines of nuclear power plants-is an essential means of ensuring safety during their operation. Monitoring the state of the pipeline by direct inspection requires a considerable amount of labor, as well as, in some cases, the suspension of power plant operation. In order ...
Added: March 21, 2024
Статистические методы анализа экономики и общества. 14-я Международная научно-практическая конференция студентов и аспирантов (16-19 мая 2023 г.).
М.: Издательский дом НИУ ВШЭ, 2023.
В сборнике представлены отобранные оргкомитетом труды участников 14-й Международной научно-практической конференции студентов и аспирантов «Статистические методы анализа экономики и общества» из Азербайджана, Бангладеша, Беларуси, России, представляющих 22 вуза из 16 городов: Баку, Владивостока, Дакка, Екатеринбурга, Йошкар-Олы, Магнитогорска, Махачкалы, Минска, Москвы, Оренбурга, Ростова-на-Дону, Санкт-Петербурга, Саранска, Саратова, Ульяновска, Уфы. Исследования посвящены вопросам статистической методологии, применению математико-статистических и эконометрических ...
Added: January 16, 2024
Эмпирические вызовы и методологические подходы в сравнительной политологии (сквозь призму “Политического атласа современного мира 2.0”)
Melville A. Y., Мальгин А. В., Mironyuk M. et al., Полис. Политические исследования 2023 № 5 С. 153–171
In recent decades, the expanding volume, diversity and coverage of data have created new or have transformed existing areas of research. They have also turned data into a key element of politics today. In this context, the status of empirical research that became the political science mainstream at the turn of the 20th - 21st ...
Added: September 29, 2023
Профессиональный и социальный статусы как предикторы потребления алкоголя работающими россиянами
Eltsov N., Khorkina N., Вопросы статистики 2023 Т. 30 № 3 С. 92–108
The authors of the article analyze the characteristics of alcohol consumption by working residents of Russia, depending on their professional and social status. The study is based on data from the Russian Longitudinal Monitoring Survey of the National Research University Higher School of Economics for the period 2013-2019. The sample of the analysis included working ...
Added: March 24, 2023
Еще раз об анти-антропоморфизмах в Септуагинте: опыт статистического анализа
Seleznev M., Индоевропейское языкознание и классическая филология 2013 Т. 17 С. 800–812
The article deals with the treatment of the so-called anthropomorphisms of the Hebrew Bible in the Greek translation of the Hebrew Scriptures (the Septuagint). By anthropomorphism we mean attribution of human physical form or psychological characteristics to God (e.g. speaking about God’s eyes, God’s hand, God’s repentance etc.). Within the Septuagint some scholars find a ...
Added: December 2, 2022
Statistical Analysis and Modeling of User Micromobility for THz Cellular Communications
Stepanov N., Moltchanov D., Begishev V. et al., IEEE Transactions on Vehicular Technology 2022 Vol. 71 No. 1 P. 725–738
Terahertz (THz, 0.3!!-!!3 THz) wireless access is nowadays considered as a major enabling technology for sixth generation (6G) cellular systems. To compensate for high propagation losses, these systems will utilize antenna arrays with extremely directional beams. The performance of such systems will thus be heavily affected by micromobility such as shakes and rotations of user ...
Added: October 27, 2022
Pragmatic Markers and Parts of Speech: on the Problems of Annotation of the Speech Corpus
Bogdanova-Beglarian Natalia, Zaides K., , in: CEUR Workshop Proceedings (Proceedings of the International Conference "Internet and Modern Society" IMS-2020, 17-20 June 2020, ITMO University, St. Petersburg, Russia).: CEUR Workshop Proceedings, 2020. P. 129–139.
Added: February 3, 2022
Прагматический маркер ИЛИ ТАМ: свой среди чужих, чужой среди своих
Zaides K., Русская речь 2021 № 1 С. 22–36
В статье описываются функции и специфика употребления одного из прагматических маркеров, встречающихся в устной спонтанной речи, – или там. Данный маркер формально схож по модели построения с рефлексивными маркерами – или как его/её/их, или как это, или что и под. Однако, в отличие от этих маркеров, единица или там, как показано в статье, выполняет в устной речи принципиально иные функции – аппроксимативную ...
Added: February 3, 2022
Применение статистических тестов NIST для анализа выходных последовательностей блочных шифров
Perov A., Научный вестник Новосибирского государственного технического университета 2019 Т. 76 № 3 С. 87–96
Modern iterative block ciphers are one of the most popular methods for providing a secure information exchange in internet networks. A widespread use of this technology and the development of computing power give rise to a whole list of threats to cryptanalysis of ciphers. Ensuring cryptographic security is in this case one of the key ...
Added: November 22, 2021
СТАТИСТИЧЕСКОЕ ТЕСТИРОВАНИЕ СОВРЕМЕННЫХ ИТЕРАТИВНЫХ БЛОЧНЫХ ШИФРОВ С ПОМОЩЬЮ ПРОГРАММНОЙ БИБИЛОТЕКИ "УНИБЛОКС-2015"
Perov A., Инновации в жизнь 2016 № 2(17) С. 89–97
The statistical analysis of iterative block ciphers is carried out for detection of dependences of statistical properties of output sequence depending on number of rounds. It is reasonable to utilize source codes available in the Internet, but their integration into own programs is impeded by at least the following reasons. Firstly, different implementations have different ...
Added: November 1, 2021
Artie Bias Corpus: An Open Dataset for Detecting Demographic Bias in Speech Applications
Meyer J., Rauchenstein L., Eisenberg J., , in: Proceedings of The 12th Language Resources and Evaluation ConferenceVol. 12.: European Language Resources Association (ELRA), 2020. P. 6462–6468.
We describe the creation of the Artie Bias Corpus, an English dataset of expert-validated <audio, transcript> pairs with demographic tags for age, gender, accent. We also release open software which may be used with the Artie Bias Corpus to detect demographic bias in Automatic Speech Recognition systems, and can be extended to other speech technologies. ...
Added: April 20, 2021
Зависимость запасов древесины в лесах России от климатических параметров
Грабовский В., Zamolodchikov D., Лесоведение 2019 № 2 С. 83–92
Mean timber storages across different species and age groups of forests based on the State Forest Inventory 2013 data were correlated by means of regression analysis to climatic variables, averaged over 1981–2000. The following species categories were predefined: all species, conifers, hardwoods, softwoods, and others. The following age groups were predefined: all ages, young growth, ...
Added: April 17, 2021
Позиционные свойства русских апеллятивов: формат описания в речевом корпусе
Blinova O. V., Компьютерная лингвистика и интеллектуальные технологии 2018 Т. 2 № 17(24) С. 96–109
The article suggests a way of modelling the linear position of appellatives in Russian. Under the name «appellatives» are combined the units with similar functions and syntactic properties, namely truncated vocative forms and discursive markers of the type «slushaj» (lit. ‘listen-Imp.2P’). The model assumes distinction between accented and non-accented uses in three positions (initial, middle, ...
Added: November 1, 2020
Russian Pragmatic Markers Database: Developing Speech Technologies for Everyday Spoken Discourse
Sherstinova T., Blinova O. V., Богданова-Бегларян Н. В. et al., , in: Proceedings of the 26th Conference of Open Innovations Association FRUCT.: IEEE, 2020. P. 60–66.
The paper presents recent results obtained within the ongoing project dedicated to the study of Russian pragmatic markers. Pragmatic markers are obligatory elements of natural speech in any language; moreover, they are considered to be functionally important for speech production and overcoming inevitable speech difficulties. A correct understanding of use and functions of pragmatic markers ...
Added: November 1, 2020
Semantic Coherence in Schizophrenia in Russian Written Texts
Panicheva P., Litvinova T., , in: Proceedings of the 25th Conference of Open Innovations Association FRUCT, University of Helsinki, Helsinki, Finland.: Helsinki: IEEE, 2019. P. 241–249.
Schizophrenia is widely known to manifest in language disturbance. Namely, speech incoherence, tangentiality, derailment are indicative of thought disorder characteristic of schizophrenia. Recent advances in distributional semantics have made it possible to measure coherence in text in a unified and objective manner. It has been shown that semantic coherence measures based on distributional semantic models ...
Added: October 29, 2020
Early Study of Transistor and Circuit Parameter Variation for 180 nm High-Temperature SOI CMOS Production Technology
Lev M. Sambursky, Mamed R. Ismail-zade, Nina V. Blokhina, , in: 2020 Moscow Workshop on Electronic and Networking Technologies (MWENT).: IEEE, 2020. P. 1–7.
In this paper, we do an early study of circuit parameter variation for temperature-resistant SOI CMOS production technology on the examples of several standard circuit fragments. Circuits electrical characteristics are simulated at several values of temperature (in the range +27…+300 °C) and with account for MOSFET parameter mismatch figures derived from measurement data. The method ...
Added: May 7, 2020
  • About
  • About
  • Key Figures & Facts
  • Sustainability at HSE University
  • Faculties & Departments
  • International Partnerships
  • Faculty & Staff
  • HSE Buildings
  • HSE University for Persons with Disabilities
  • Public Enquiries
  • Studies
  • Admissions
  • Programme Catalogue
  • Undergraduate
  • Graduate
  • Exchange Programmes
  • Summer University
  • Summer Schools
  • Semester in Moscow
  • Business Internship
  • Research
  • International Laboratories
  • Research Centres
  • Research Projects
  • Monitoring Studies
  • Conferences & Seminars
  • Academic Jobs
  • Yasin (April) International Academic Conference on Economic and Social Development
  • Media & Resources
  • Publications by staff
  • HSE Journals
  • Publishing House
  • iq.hse.ru: commentary by HSE experts
  • Library
  • Economic & Social Data Archive
  • Video
  • HSE Repository of Socio-Economic Information
  • HSE1993–2026
  • Contacts
  • Copyright
  • Privacy Policy
  • Site Map
Edit