• A
  • A
  • A
  • АБВ
  • АБВ
  • АБВ
  • A
  • A
  • A
  • A
  • A
Обычная версия сайта
  • RU
  • EN
  • HSE University
  • Publications
  • Articles
  • Cross-Level Semantic Similarity in Newswire Texts and Software Code Comments: Insights from Serbian Data in the AVANTES Project
  • RU
  • EN
Расширенный поиск
Высшая школа экономики
Национальный исследовательский университет
Priority areas
  • business informatics
  • economics
  • engineering science
  • humanitarian
  • IT and mathematics
  • law
  • management
  • mathematics
  • sociology
  • state and public administration
by year
  • 2027
  • 2026
  • 2025
  • 2024
  • 2023
  • 2022
  • 2021
  • 2020
  • 2019
  • 2018
  • 2017
  • 2016
  • 2015
  • 2014
  • 2013
  • 2012
  • 2011
  • 2010
  • 2009
  • 2008
  • 2007
  • 2006
  • 2005
  • 2004
  • 2003
  • 2002
  • 2001
  • 2000
  • 1999
  • 1998
  • 1997
  • 1996
  • 1995
  • 1994
  • 1993
  • 1992
  • 1991
  • 1990
  • 1989
  • 1988
  • 1987
  • 1986
  • 1985
  • 1984
  • 1983
  • 1982
  • 1981
  • 1980
  • 1979
  • 1978
  • 1977
  • 1976
  • 1975
  • 1974
  • 1973
  • 1972
  • 1971
  • 1970
  • 1969
  • 1968
  • 1967
  • 1966
  • 1965
  • 1964
  • 1963
  • 1958
  • More
Subject
News
April 30, 2026
HSE Researchers Compile Scientific Database for Studying Childrens Eating Habits
The database created at HSE University can serve as a foundation for studying children’s eating habits. This is outlined in the study ‘The Influence of Age, Gender, and Social-Role Factors on Children’s Compliance with Age-Based Nutritional Norms: An Experimental Study Using the Dish-I-Wish Web Application.’ The work has been carried out as part of the HSE Basic Research Programme and was presented at the XXVI April International Academic Conference named after Evgeny Yasin.
April 30, 2026
New Foresight Centre Study Identifies the Most Destructive Global Trends for Humankind
A team of researchers from the HSE International Research and Educational Foresight Centre has examined how global trends affect the quality of human life—from life expectancy to professional fulfilment. The findings of the study titled ‘Human Capital Transformation under the Influence of Global Trends’ were published in Foresight.
April 28, 2026
Scientists Develop Algorithm for Accurate Financial Time Series Forecasting
Researchers at the HSE Faculty of Computer Science benchmarked more than 200,000 model configurations for predicting financial asset prices and realised volatility, showing that performance can be improved by filtering out noise at specific frequencies in advance. This technique increased accuracy in 65% of cases. The authors also developed their own algorithm, which achieves accuracy comparable to that of the best models while requiring less computational power. The study has been published in Applied Soft Computing.

 

Have you spotted a typo?
Highlight it, click Ctrl+Enter and send us a message. Thank you for your help!

Publications
  • Books
  • Articles
  • Chapters of books
  • Working papers
  • Report a publication
  • Research at HSE

?

Cross-Level Semantic Similarity in Newswire Texts and Software Code Comments: Insights from Serbian Data in the AVANTES Project

Proceedings of the Conference on Language Technologies and Digital Humanities, Ljubljana, Slovenia. 2022. P. 124–131.
Miličević Petrović М., Batanović V., Trnavac Radoslava, Kovačević B.

This paper presents the Serbian datasets developed within the project Advancing Novel Textual Similarity-based Solutions in Software Development – AVANTES, intended for the study of Cross-Level Semantic Similarity (CLSS). CLSS measures the level of semantic overlap between texts of different lengths, and it also refers to the problem of establishing such a measure automatically. The problem was first formulated about a decade ago, but research on it has been sparse and limited to English. The AVANTES project aims to change this through the study of CLSS in Serbian, focusing on two different text domains – newswire and software code comments – and on two text length combinations – phrase-sentence and sentence-paragraph. We present and compare two newly created datasets, describing the process of their annotation with fine-grained semantic similarity scores, and outlining a preliminary linguistic analysis. We also give an overview of the ongoing detailed linguistic annotation targeted at detecting the core linguistic indicators of CLSS.

Research target: Humanities Philology and Linguistics
Language: English
Text on another site
Keywords: Serbian languageCross-Level Semantic Similarity
Similar publications
Class and Conjuncture in Television, Cinema and Literature
Mylonas Y., L.: Routledge, 2026.
This book presents a critical examination of how cultural forms, ranging from cinema and TV to literature, address class within the overarching context of a crisis conjuncture, specifically the period following the 2008 financial crash. It demonstrates how culture serves as a crucial site for capturing the contemporary "structure of feeling", publicly mediating the period's ...
Added: April 30, 2026
XI Международная конференция молодых исследователей "Текстология и историко-литературный процесс": сборник статей
М.: Издательские решения, 2025.
В настоящий сборник вошли работы участников XI Международной конференции «Текстология и историко-литературный процесс» на филологическом факультете МГУ имени М. В. Ломоносова. Статьи, представленные в книге, посвящены вопросам текстологии и истории литературы. ...
Added: April 30, 2026
«Подснежник. Журнал для детского и юношеского возрастов» (Санкт-Петербург, 1858 –1862). Роспись содержания
Фатеева М. С., Литературный факт 2022 Т. 26 № 4 С. 248–277
Работа представляет собой роспись содержания журнала для детского и юношеского чтения «Подснежник», выходившего в Санкт-Петербурге в 1858–1862 гг. под редакцией В.Н. Майкова. В издании журнала принимали участие многие хорошо известные литераторы середины XIX в. (И.А. Гончаров, Д.В. Григорович, А.Н. Майков и др.). Во вступительной статье кратко обрисована история издания «Подснежника», охарактеризованы появлявшиеся в нем материалы ...
Added: April 30, 2026
Ирония в пьесе Ватсараджи «Киратарджуния» (XII в.)
Минаева М. Д., Вестник Института востоковедения РАН 2025 № 6 С. 143–155
This article examines the rhetorical device of “irony” in the Sanskrit poetic tradition, using examples from the medieval playwright Vatsarāja’s Kirātārjunīya (“The Kirāta and Arjuna,” 12th century). This play belongs to the rare vyāyoga genre, which is characterized by the depiction of a great battle between two renowned heroes accompanied by a verbal duel filled with ...
Added: April 30, 2026
Новостной медианарратив в соцсети «ВКонтакте»: дискурсивные особенности
Алевизаки О. Р., Александрова И. Б., Кара-Мурза Е. С. et al., Вестник Московского университета. Серия 10: Журналистика 2021 № 3 С. 74–102
Особую роль на современном этапе развития журналистики играют социальные сети, которые используются как площадка для размещения материалов СМИ. Свои страницы в разных соцсетях имеют многие ведущие российские СМИ. Поскольку пользователи соцсетей прежде всего обращаютсяк новостным материалам, объектом данного исследования стал новостной дискурс, представленный в наиболее популярной в России соцсети – «ВКонтакте». На примере размещенных здесь ...
Added: April 29, 2026
Прецедентные феномены как когнитивная база для метафор (на примере современного англоязычного дискурса беременных)
Chermoshentseva K., Вестник Томского государственного университета 2025 № 520 С. 57–68
Precedent as a linguistic phenomenon is a complex phenomenon that can create a comprehensive image with minimal linguistic costs due to an extensive cognitive base. The mechanism involved in the use of precedent phenomena is similar to the process of metaphorization, since two cognitive spheres are involved, one of which is a resource base for describing ...
Added: April 29, 2026
"Возможно, мы оба заблуждаемся...": доклад В.М. Лавровского об англо-советской конференции историков 1958 года
Zemlyakov M., Средние века 2026 № 87 (1) С. 168–175
The publication establishes a full text of a report made by Professor Vladimir M. Lavrovsky at Lomonosov Moscow State University (Faculty of History). This report focuses on the results of his scientific mission to the Anglo-Soviet Conference (London, September 1958). Lavrovsky describes in details the discussion on his own report and accentuates the advantages of ...
Added: April 29, 2026
Семантика конструкций со значением всеобщности в малокарачкинском говоре чувашского языка в типологической перспективе
Russkih A., Урало-алтайские исследования 2026 Т. 60 № 1 С. 42–65
This paper examines constructions that express universal quantification in the Poshkart variety of the Chuvash language. This variety has five quantifier words for universals: por, mënbor, pëdëm, kaʐni, and veɕ. These words may combine with an additive or emphatic clitic, a possessive marker, or the instrumental case. The paper describes the semantic distribution of universal ...
Added: April 29, 2026
Паузы хезитации в педагогическом дискурсе: перцептивный аспект
Zubov V., Осадчая М. А., Риехакайнен Е. И., Вестник Санкт-Петербургского университета. Язык и литература 2026 Т. 23 № 1 С. 99–119
The article is a part of a comprehensive study of the linguistic characteristics of teacher’s speech, which contribute to the success of the pedagogical discourse. Based on a survey of secondary school students and an analysis of previous research in the field, non-syntactic pauses of hesitation were chosen as the object of the study, i. ...
Added: April 29, 2026
Метафора как инструмент организации воспоминаний в дискурсивном мнемическом нарративе
Chermoshentseva K., Социальные и гуманитарные науки на Дальнем Востоке 2026 Т. 23 № 1 С. 264–272
The article explores the use of metaphor as a tool for structuring and manifesting memories and events in speech. The relevance of this study stems from the rapid pace of research into memory and its manifestation at the narrative level through linguistic means. Its novelty lies in demonstrating the ways to utilize the two-sided nature ...
Added: April 28, 2026
Детерминологизация в языке СМИ и сопутствующие семантические процессы
Kolchina O., Romanova T. V., Журнал Сибирского федерального университета. Серия: Гуманитарные науки 2026 № 19(3) С. 605–617
This article examines the semantic and pragmatic transformations of cognitive linguistic terms influenced by mass media. When introduced into non-specialized discourse, a term can lose its connection with its scientific concept and develop new, common meanings through processes of narrowing, broadening, differentiation, attraction, metaphorical and metonymic transfer, and more. This results in an incorrect representation ...
Added: April 27, 2026
ЗООЛОГИЧЕСКАЯ МЕТАФОРА КАК СРЕДСТВО РЕПРЕЗЕНТАЦИИ ПСИХОЭМОЦИ-ОНАЛЬНЫХ СОСТОЯНИЙ (НА МАТЕРИАЛЕ ПРОИЗВЕДЕНИЙ Ч. ПАЛАНИКА)
Tsygunova M., Этнопсихолингвистика 2026 № 1(24) С. 81–98
This paper examines the use of animalistic metaphors in Chuck Palahniuk’s prose to describe psychoemotional states. The relevance of the study lies in uncovering the mechanisms by which complex emotional experiences are conveyed through animal metaphors. The aim of the research is to report on the way the author conceptualizes the interconnection between human behaviour ...
Added: April 27, 2026
Способы введения специальной терминологии в научно-популярный нарратив медицинской тематики (на материале произведений Г. Марша)
Nagornaya A., Пинчукова А. Е., Мир науки. Социология, филология, культурология 2025 Т. 16 № 4 С. 1–13
The article examines the use of specialized terminology in popular science texts on medical topics. It focuses on the phenomenon of popularizing medical knowledge in contemporary English-language culture, identifies the reasons for the widespread demand for this type of literature (the need for reliable information, increased personalization of medical texts, and the diversity of genres ...
Added: April 27, 2026
Японский язык в вузе: актуальные проблемы преподавания. Сборник научных работ. Материалы Второго международного форума «Языки и культуры Восточной Азии в образовательном пространстве» (МГПУ, 23–26 апреля 2025). Выпуск 30
МГПУ, Языки народов мира, 2025.
30-й выпуск сборника «Японский язык в вузе» продолжает серию публикаций, посвященных различным вопросам теории и практики преподавания японского языка, лингвистики, культурологии. Данный выпуск содержит материалы Второго международного форума «Языки и культуры Восточной Азии в образовательном пространстве», проходившего в МГПУ, 23–26 апреля 2025 года. ...
Added: April 26, 2026
What we do in the shadows of the pear tree: Tense switching in Shughni Pear Stories
Melenchenko M., Indo-Iranian Languages 2026 Vol. 2 No. 1 P. 74–99
This article presents the results of a study on the narrative functions of verb tenses in Shughni. Shughni is an Eastern Iranian language with a compact TAME system, which has tensed evidentials (with Preterite being the direct past and Perfect, the indirect past) and lacks grammaticalized aspect. The current study analyzes five narrations of the ...
Added: April 25, 2026
Новый большой сербско-русский словарь (общая концепция и проблемы лексикографического описания)
Драгичевич Р., Королькова М. Д., Ryzhova D. et al., Вопросы лексикографии 2024 № 32 С. 43–60
Added: January 31, 2025
ИЗРАЖАВАЊЕ ЕВАЛУАЦИЈЕ И ЕМОЦИЈА У ЈЕЗИЧКОМ СИСТЕМУ
Trnavac R., Јужнословенски филолог 2019 No. LXXV P. 59–81
The purpose of this work is to analyze the functions of emotions in the Appraisal Framework  (MARTIN & WHITE 2005). On the basis of an analysis of online comments from Nezavisimaya.ru and Politika online in Russian and  Serbian, the paper analyzes the relationship among the sub-systems of Appraisal. ...
Added: December 31, 2022
Реторичка структурална теорија и елементарне јединице дискурса у српском и руском језику
Trnavac R., Наш језик 2019 Vol. 50 No. 2 P. 551–570
In this paper, we develop the analysis of discourse relations on the basis of Rhetorical Structure Theory (MannThompson 1988) on the corpus of online news comments which are published in the Serbian broadsheet Politika Online and the Russian site Russia Today. We look at the way and the frequency of discourse relations are signalized in ...
Added: December 30, 2022
Кохерентност и евалуација у тексту у српском и руском језику
Trnavac R., Beograd: САНУ, Одбор за српски језик и књижевност у поређењу са другим језицима и књижевностима, 2018.
The primary goal of this study on the topic of coherence is to examine how coherence relations are signaled in discourse and which signals are used to present coherence relations in the Serbian and Russian languages. Another goal within the same topic is to investigate the question of whether coherence relations are more frequent as ...
Added: December 28, 2022
Семантичка анализа неколико придева са афективним и искуственим значењем у српском и руском језику
Trnavac R., Slavistika 2020 Vol. 24 No. 2 P. 92–114
The objective of this paper is to demonstrate the semantic analysis of twelve adjectives with affective and experiential meanings in Serbian and in Russian based on the Natural Semantic Metalanguage (Goddard and Wierzbicka 2014; Goddard et al. 2019). We suggest that a basic distinction between the two above-mentioned groups of adjectives can be established thanks ...
Added: December 28, 2022
Сигнализација дискурсних односа у новинском тексту на српском и руском језику
Trnavac R., , in: Језици и културе у времену и просторуVol. 9.: Универзитет у Новом Саду, Филозофски факултет, 2020. P. 223–234.
SIGNALIZATION OF DISCOURSE RELATIONS IN THE SERBIAN AND RUSSIAN NEWSPAPER TEXTS Summary This paper presents a collection of groups of signals that mark coherence relations within the corpus of newspaper articles on cultural events from the broadsheet newspapers Politika online and Nezavisimaya Gazeta in Serbian and in Russian. We find the following groups of signals: ...
Added: December 28, 2022
Језици и културе у времену и простору
Универзитет у Новом Саду, Филозофски факултет, 2020.
SIGNALIZATION OF DISCOURSE RELATIONS IN THE SERBIAN AND RUSSIAN NEWSPAPER TEXTS Summary This paper presents a collection of groups of signals that mark coherence relations within the corpus of newspaper articles on cultural events from the broadsheet newspapers Politika online and Nezavisimaya Gazeta in Serbian and in Russian. We find the following groups of signals: ...
Added: December 28, 2022
Некоторые особенности глагольной акцентуации в староштокавских памятниках XV в.
Pekunova I., В кн.: Accent matters. Papers on Baltic and Slavic accentology / STUDIES IN SLAVIC AND GENERAL LINGUISTICS, vol. 37Vol. 37: Accent matters. Papers on Baltic and Slavic accentology.: Editions Rodopi B.V., 2011. С. 295–308.
Previously it has been found that while the a.p.a in Serbian manuscriptsis is stable, reflection of the a.p.b shows diversity. It has been established in particular (Dybo 1983) that in Ev.-apr. and Sborn. the etymological root length is relevant for the accentuation in present forms of a.p.b j-praesentia: in 1SgPrae the long-root prefixed verbs demonstrate ...
Added: September 24, 2018
О некоторых акцентуационных особенностях существительных а.п. c в старосербских памятниках
Ирина С. Пекунова, В кн.: Stressing the past. Papers on Baltic and Slavic accentology / STUDIES IN SLAVIC AND GENERAL LINGUISTICS, vol. 35Vol. 35: Stressing the past. Papers on Baltic and Slavic accentology.: Amsterdam: Editions Rodopi B.V., 2009. С. 93–100.
Added: September 24, 2018
  • About
  • About
  • Key Figures & Facts
  • Sustainability at HSE University
  • Faculties & Departments
  • International Partnerships
  • Faculty & Staff
  • HSE Buildings
  • HSE University for Persons with Disabilities
  • Public Enquiries
  • Studies
  • Admissions
  • Programme Catalogue
  • Undergraduate
  • Graduate
  • Exchange Programmes
  • Summer University
  • Summer Schools
  • Semester in Moscow
  • Business Internship
  • Research
  • International Laboratories
  • Research Centres
  • Research Projects
  • Monitoring Studies
  • Conferences & Seminars
  • Academic Jobs
  • Yasin (April) International Academic Conference on Economic and Social Development
  • Media & Resources
  • Publications by staff
  • HSE Journals
  • Publishing House
  • iq.hse.ru: commentary by HSE experts
  • Library
  • Economic & Social Data Archive
  • Video
  • HSE Repository of Socio-Economic Information
  • HSE1993–2026
  • Contacts
  • Copyright
  • Privacy Policy
  • Site Map
Edit