• A
  • A
  • A
  • АБВ
  • АБВ
  • АБВ
  • A
  • A
  • A
  • A
  • A
Обычная версия сайта
  • RU
  • EN
  • HSE University
  • Publications
  • Book chapter
  • The impact of syntactic structure on verb-noun collocation extraction
  • RU
  • EN
Расширенный поиск
Высшая школа экономики
Национальный исследовательский университет
Priority areas
  • business informatics
  • economics
  • engineering science
  • humanitarian
  • IT and mathematics
  • law
  • management
  • mathematics
  • sociology
  • state and public administration
by year
  • 2027
  • 2026
  • 2025
  • 2024
  • 2023
  • 2022
  • 2021
  • 2020
  • 2019
  • 2018
  • 2017
  • 2016
  • 2015
  • 2014
  • 2013
  • 2012
  • 2011
  • 2010
  • 2009
  • 2008
  • 2007
  • 2006
  • 2005
  • 2004
  • 2003
  • 2002
  • 2001
  • 2000
  • 1999
  • 1998
  • 1997
  • 1996
  • 1995
  • 1994
  • 1993
  • 1992
  • 1991
  • 1990
  • 1989
  • 1988
  • 1987
  • 1986
  • 1985
  • 1984
  • 1983
  • 1982
  • 1981
  • 1980
  • 1979
  • 1978
  • 1977
  • 1976
  • 1975
  • 1974
  • 1973
  • 1972
  • 1971
  • 1970
  • 1969
  • 1968
  • 1967
  • 1966
  • 1965
  • 1964
  • 1963
  • 1958
  • More
Subject
News
May 12, 2026
‘Any Real-Economy Company Can Use Our Products
The HSE Centre for Financial Research and Data Analytics combines fundamental and applied work, including in areas unique to Russia such as the connection between sentiment in the media and social networks and financial markets. The HSE News Service spoke with the centre’s director, Professor Tamara Teplova, about its work.
May 7, 2026
Researchers Find More Effective Approach to Revealing Majorana Zero Modes in Superconductors
An international team of researchers, including physicists from HSE MIEM, has demonstrated that nonmagnetic impurities can help more accurately reveal Majorana zero modes—quantum states considered promising building blocks for quantum computing. The researchers found that these impurities shift the energy levels that typically obscure the Majorana signal, while leaving the mode itself largely unaffected, thereby making its spectral peak more distinct. The study has been published in Research.
May 6, 2026
The Future of Cardiogenetics Lies in Artificial Intelligence
Researchers from the AI and Digital Science Institute at the HSE Faculty of Computer Science have developed a program capable of analysing regions of the human genome that were previously inaccessible for accurate interpretation in genetic testing. The program adapts large generative AI (GenAI) models for cardiogenetics to predict how specific mutations affect the function of individual genes.

 

Have you spotted a typo?
Highlight it, click Ctrl+Enter and send us a message. Thank you for your help!

Publications
  • Books
  • Articles
  • Chapters of books
  • Working papers
  • Report a publication
  • Research at HSE

?

The impact of syntactic structure on verb-noun collocation extraction

P. 2–17.
Toldova S. Y., Akinina Y. S., Kuznetsov I. O.

Automatic verb-noun collocation extraction is an important natural language processing task. The results obtained in this area of research can be used in a variety of applications including language modeling, thesaurus building, semantic role labeling, and machine translation. Our paper de-scribes an experiment aimed at comparing the verb-noun collocation lists extracted from a large corpus using a raw word order-based and a syntax-based approach. The hypothesis was that the latter method would result in less noisy and more exhaustive collocation sets. The experiment has shown that the collocation sets obtained using the two methods have a surprisingly low degree of correspondence. Moreover, the collocate lists extracted by means of the window-based method are often more complete than the ones obtained by means of the syntax-based algorithm, despite its ability to filter out adjacent collocates and reach the distant ones. In order to interpret these differences, we provide a qualitative analysis of some common mismatch cases.

Language: English
Full text
Text on another site
Keywords: corpus analysiscollocationstreebanksyntactic dependenciesautomatic collocation extractionparsing

In book

Компьютерная лингвистика и интеллектуальные технологии: По материалам ежегодной Международной конференции «Диалог» (Бекасово, 29 мая - 2 июня 2013 г.). В 2-х т.
Т. 1: Основная программа конференции. Вып. 12 (19). , М.: РГГУ, 2013.
Similar publications
A genre-based model of rhetorical structure in scoping review introductions
Elena V. Tikhonova, Kosycheva M. A., Training, Language and Culture 2025 Vol. 9 No. 4 P. 35–55
As genre modelling advances, describing research articles rhetorical structures becomes crucial. Though secondary to empirical studies, scoping reviews shape scholarly communication by framing analysis and setting epistemological benchmarks. Their introductions act as conceptual lenses, defining interpretive frameworks. However, most rhetorical models, designed for empirical articles, appear to be inadequate for scoping reviews. We propose a ...
Added: February 9, 2026
Репрезентация фрейма ГОРОД в текстах почтовой переписки: корпусное исследование
Куликова В. А., Человек: образ и сущность. Гуманитарные аспекты 2026 № 1 С. 64–81
The verbal representation of a city is studied using the material of 993 contexts containing a description of a city in postcards of the pre-revolutionary, Soviet and post-Soviet periods. The object of the analysis is the frame-structure CITY, whilst the subject of the analysis is the features of its verbalization in the corpus of postal ...
Added: November 2, 2025
Правовой режим персональных данных в социальных сетях: проблемы квалификации и практические аспекты обработки
Kovaleva N. N., Zhirnova N., Закон 2025 № 9 С. 61–68
The study is devoted to a comprehensive analysis of legal conflicts that arise during the processing of personal data on social networks. Based on a systematic study of Russian legislation and the evolution of case law, key problems of qualifying the status of such data have been identified. The erroneous identification of the concepts of ...
Added: October 1, 2025
Медиаконцепт «вакцинация» в дискурсе немецких СМИ во время пандемии COVID-19
Balakina Y. V., Вестник Томского государственного университета 2024 № 509 С. 23–34
The relevance of the research is justified by the influence of the media on the consciousness and behavior of people during the crisis, allowing to form discursive phenomena that have specific characteristics. In addition, it seems particularly relevant to use linguistic tools to describe media and political phenomena, as well as to apply media and ...
Added: December 12, 2024
Teaching Russian Through STEM: Contexts, Tools, and Approaches
L.: Routledge, 2024.
Routledge Russian Language Pedagogy and Research features edited volumes and monographs on the advances in the research and pedagogy of teaching the Russian language. Written by experts from across the world, the series brings together diverse schools of thought and serves as an inclusive discussion forum showcasing state-of-the-art advances in teaching and researching Russian. ...
Added: November 8, 2024
Academic English melting pot: Reconsidering the use of lexical bundles in academic writing
Gritsenko E.S, Kamou O.M., Russian Journal of Linguistics 2024 Vol. 28 No. 3 P. 615–632
Many studies addressing the differences in the use of lexical bundles in academic English by L1 and L2 writers interpret these differences as a deficiency or deviation that L2 writers need to eliminate. In this paper, we argue that this “deviant” use is not essentially the product of insufficient knowledge of English and/or Anglophone norms ...
Added: October 31, 2024
Hedges in Written Academic Discourse: A Corpus Analysis of L2 Students’ Project Proposals
Nuriiat Omarovna Omarova, , in: The Youth in Science: Challenges and Prospects.: [б.и.], 2024. P. 64–72.
Hedges, expressions of doubt used in writing to tone down claims, are crucial in academic discourse, as they weaken assertions and make texts less categorical. Even though hedging in academic writing has attracted scholars’ attention, the number of works focusing on disciplinary variation in the use of hedges in L2 learners’ texts remains relatively scarce. ...
Added: October 30, 2024
Оценочная лексика в почтовой коммуникации: динамический аспект
Куликова В. А., В кн.: Динамика коммуникативных практик в почтовой переписке (на материале корпуса «Пишу тебе»).: М.: Издательство РОИФН, 2024. Гл. 3 С. 92–130.
В главе 3 «Оценочная лексика в почтовой коммуникации: динамический аспект» (В.А. Куликова) изучается динамика на примере отдельной лексической подсистемы – качественных прилагательных  с оценочной семантикой. Изучено функционирование оценочных прилагательных и их дериватов в текстах дореволюционных, советских, постсоветских открыток: вхождение оценочных лексем в списки ключевых слов и уникальных слов, изменения в частотности и функционировании оценочных лексем. ...
Added: October 28, 2024
Exploring collocational complexity in L2 Russian: A corpus-driven contrastive analysis
Kopotev M., Klimov A., Kisselev O., International Journal of Bilingualism 2025 Vol. 29 No. 2 P. 439–455
Objective: The objective of this article is to discuss the pedagogical and practical need for automated assessment tools that enable teachers, researchers, and other language practitioners to relatively quickly and automatically assess the general proficiency of second language (L2) speakers according to a number of different linguistic parameters, specifically the use of collocations. Introduction: The Introduction discusses existing ...
Added: September 9, 2024
Словами героев русского рассказа: речевая картина XX века
Kirina M., Лукьянчикова А. С., В кн.: Русская и зарубежная филология в диалоге культур : материалы Всероссийской научно-практической конференции с международным участием (г. Ростов-на-Дону, 19–21 октября 2023 г.).: Издательство Южного федерального университета, 2024. С. 16–20.
Added: December 10, 2023
Семантическое наполнение понятия «популизм» в английском языке (опыт лексикографического и корпусного анализа)
Gritsenko E., Галочкин А. Е., Вопросы лексикографии 2023 № 27 С. 29–46
The aim of the article is to reveal the semantic content of the concept “populism” in modern English. The need to address this topic is driven by the fact that a significant part of the research is dedicated to the analysis of specific forms of populism or populist parties in the aspect of political science, discourse theory, political rhetoric, ...
Added: May 6, 2023
Изражавања ауторског става у уводницима и онлајн коментарима на руском језику
Trnavac R., Зборник Матице српске за филологију и лингвистику 2020 Vol. 63 No. 2 P. 139–153
AUTHORIAL STANCE EXPRESSED WITH GRAMMATICAL STRUCTURES IN EDITORIALS AND ONLINE COMMENTS IN RUSSIAN S u m m a r y This paper describes major patterns of register variation between editorials and online news comments based on the use of a wide range of lexico-grammatical structures that express stance in Russian. While applying the methodology of ...
Added: December 28, 2022
Плеонастические причастия в современной русской речи: функции и тенденции развития
Ю. М. Кувшинская, Н. А. Зевахина, Acta Linguistica Petropolitana. Труды института лингвистических исследований 2023 Т. 19 № 1 С. 138–192
The paper studies tendencies in the use of full single (i.e. without their arguments)  redundant participles in the attributive position in the Russian written discourse. Relying upon the data of the Russian National Corpus and the Corpus of Russian Student Texts, as well as a number of the examples collected from various written sources, the ...
Added: December 8, 2022
Clausal complexity of expert and student writing: a corpus-based analysis of papers in social sciences
Smirnova E. A., Language Learning in Higher Education 2022 Vol. 12 No. 2 P. 453–475
Syntactic complexity has been extensively approached in the fields of corpus linguistics and academic discourse studies. However, works focusing on disciplinary variation in terms of linguistic complexity and comparison of professional and novice academic writing are scarce. Addressing these issues is likely to have important implications for EAP/ESP practitioners in terms of selection of target ...
Added: December 7, 2022
Terminology of Migration Studies: A Corpus Analysis of Research Papers in Social Sciences
Elizaveta Smirnova, Tatiana Permyakova, Migration Letters 2022 Vol. 19 No. 4 P. 401–412
Migration studies is a new, rapidly developing research area whose terminology is being established at the intersection of various social sciences. This article undertakes a quantitative and qualitative analysis of terms associated with migration, conducted on a 281,000-word corpus of research articles in social sciences, published in leading academic journals. Our analysis involves corpus processing ...
Added: August 1, 2022
  • About
  • About
  • Key Figures & Facts
  • Sustainability at HSE University
  • Faculties & Departments
  • International Partnerships
  • Faculty & Staff
  • HSE Buildings
  • HSE University for Persons with Disabilities
  • Public Enquiries
  • Studies
  • Admissions
  • Programme Catalogue
  • Undergraduate
  • Graduate
  • Exchange Programmes
  • Summer University
  • Summer Schools
  • Semester in Moscow
  • Business Internship
  • Research
  • International Laboratories
  • Research Centres
  • Research Projects
  • Monitoring Studies
  • Conferences & Seminars
  • Academic Jobs
  • Yasin (April) International Academic Conference on Economic and Social Development
  • Media & Resources
  • Publications by staff
  • HSE Journals
  • Publishing House
  • iq.hse.ru: commentary by HSE experts
  • Library
  • Economic & Social Data Archive
  • Video
  • HSE Repository of Socio-Economic Information
  • HSE1993–2026
  • Contacts
  • Copyright
  • Privacy Policy
  • Site Map
Edit