• A
  • A
  • A
  • АБВ
  • АБВ
  • АБВ
  • A
  • A
  • A
  • A
  • A
Обычная версия сайта
  • RU
  • EN
  • HSE University
  • Publications
  • Articles
  • Тематическая разметка антропологического корпуса: методика классификации шахтерских нарративов
  • RU
  • EN
Расширенный поиск
Высшая школа экономики
Национальный исследовательский университет
Priority areas
  • business informatics
  • economics
  • engineering science
  • humanitarian
  • IT and mathematics
  • law
  • management
  • mathematics
  • sociology
  • state and public administration
by year
  • 2027
  • 2026
  • 2025
  • 2024
  • 2023
  • 2022
  • 2021
  • 2020
  • 2019
  • 2018
  • 2017
  • 2016
  • 2015
  • 2014
  • 2013
  • 2012
  • 2011
  • 2010
  • 2009
  • 2008
  • 2007
  • 2006
  • 2005
  • 2004
  • 2003
  • 2002
  • 2001
  • 2000
  • 1999
  • 1998
  • 1997
  • 1996
  • 1995
  • 1994
  • 1993
  • 1992
  • 1991
  • 1990
  • 1989
  • 1988
  • 1987
  • 1986
  • 1985
  • 1984
  • 1983
  • 1982
  • 1981
  • 1980
  • 1979
  • 1978
  • 1977
  • 1976
  • 1975
  • 1974
  • 1973
  • 1972
  • 1971
  • 1970
  • 1969
  • 1968
  • 1967
  • 1966
  • 1965
  • 1964
  • 1963
  • 1958
  • More
Subject
News
May 14, 2026
Resource Race and Green Transition: Three Unexpected Conclusions from Foresight Centres Research on Climate and Poverty
Beneath the surface of green energy—which most people associate with solar panels, electric vehicles, and reduced CO2 emissions—lies a complex web of geopolitical interests, international inequality, and resource constraints. Researchers from the Laboratory for Science and Technology Studies (LST) at the HSE ISSEK Foresight Centre have published a series of articles in leading international journals on hidden and overt conflicts surrounding critically important metals and minerals, as well as related processes in the energy sector.
May 13, 2026
Immersion in Second Language Environment Influences Bilinguals Perception of Emotions
Researchers at the Cognitive Health and Intelligence Centre at the HSE Institute for Cognitive Neuroscience have discovered how bilingual individuals process emotional words in their native (first) and non-native (second) languages. It was found that the link between word meaning and bodily sensations is weaker in a second language than in a first language. However, the more a person is immersed in a language environment, the smaller this difference becomes. The article has been published in Language, Cognition and Neuroscience.
May 12, 2026
‘Any Real-Economy Company Can Use Our Products
The HSE Centre for Financial Research and Data Analytics combines fundamental and applied work, including in areas unique to Russia such as the connection between sentiment in the media and social networks and financial markets. The HSE News Service spoke with the centre’s director, Professor Tamara Teplova, about its work.

 

Have you spotted a typo?
Highlight it, click Ctrl+Enter and send us a message. Thank you for your help!

Publications
  • Books
  • Articles
  • Chapters of books
  • Working papers
  • Report a publication
  • Research at HSE

?

Тематическая разметка антропологического корпуса: методика классификации шахтерских нарративов

Вестник Самарского университета. История, педагогика, филология. 2024. Т. 30. № 4. С. 156–164.
Мазитова Л. Л., Panteleeva L.

The article describes the methodology for creating an anthropological corpus of texts that are united by
belonging to the mining profession. The content of the work correlates with three research tasks: development of a
thematic classification, introduction of conventions for highlighting narratives in the text, 3) determination of principles
for organizing the corpus according to the themes of narratives. Thematic classification of narratives was the result of
the analysis of several «control» texts. It represents a multi-level systematization of topics of cultural and professional
nature: when the main (basic) topics can have internal detail, which leads to the emergence of micro-topics. The number of such microtopics can be different and is determined by the specifics of the topic itself, i.e. the ability to characterize the corresponding phenomenon of reality in various aspects. Fragments of text in which a particular topic or subtopic is implemented are highlighted with square brackets. On both sides of the brackets, the numerical designations of the corresponding topic/subtopic from the previously given thematic classification are indicated. During thematic marking, the principles of compliance of the narrative with the main theme of the corpus, incompleteness of the thematic classification, integrity of the narrative, non-rigid marking, taking into account «zero» topics are maintained. The article describes particular problems of marking, when a particular topic is not developed by the informant, despite the fact that the context allows for different interpretations of the topic of the narrative. In such situations, during the marking process, the topic purposefully represented by the informant is separated from facts that may have only an indirect relation to any topic, but may not have any. Thus, the described methodology represents the first approach to developing a meta-markup standard. The type of marking used can be called extra-linguistic based on the specifics of the subject information, analytical based on the method of identification, narrative based on the object, multi-level based on the depth of classification, manual based on the method of assigning labels, and neutral in relation to a certain theory. 

Research target: Philology and Linguistics
Language: Russian
Full text
DOI
Text on another site
Keywords: компьютерная лингвистиканарративcomputational linguistics narrativeантропологический корпусanthropological corpusextralinguistic markupthematic classificationэкстралингвистическая разметкатематическая классификация
Publication based on the results of:
Индустриальная культура: цифровые решения для исследовательской экосистемы (2025)
Similar publications
Proceedings of the 9th Student Research Workshop associated with the International Conference Recent Advances in Natural Language Processing
Velichkov B., Nikolova-Koleva I., Slavcheva M., INCOMA Ltd, 2025.
The RANLP 2025 Student Research Workshop (RANLPStud’2025) is a special track of the established international conference Recent Advances in Natural Language Processing (RANLP’2025). The RANLPStud is being organised for the 9th time and this year is running in parallel with the other tracks of the main RANLP 2025 conference. The target of RANLPStud’25 is to be a ...
Added: May 12, 2026
«Плоский мир» Т. Пратчетта глазами русскоязычного фандома
Кульков А. Н., Tsvetkova M. V., Вестник Томского государственного университета. Филология 2026 № 100 С. 158–173
Впервые делается попытка рассмотреть особенности фанфикшн как акта продуктивной рецепции, возникшего на основе цикла романов Терри Пратчетта о Плоском мире в России. Проведенный анализ показывает, что прежде всего авторы фанфиков стремятся передать стилистику и комическое начало оригинального цикла Пратчетта, вне зависимости от жанра и формата создаваемых ими произведений. Фикрайтеры наиболее часто обращаются к таким форматам, ...
Added: May 10, 2026
Вселенная Достоевского
Pershkina A., М.: Альпина нон-фикшн, 2026.
Филолог Анастасия Першкина рассказывает о том, как писатель создавал свой мир, кем его населил, какие законы установил и почему этот мир так ярко действует на нас. Кроме того, вы узнаете, кто помогал Федору Михайловичу работать, как писатель связывал между собой произведения, что думали о его текстах современники и что же такое достоевщина. ...
Added: May 6, 2026
The hypothesis of dependence of the lexical nature of mixed languages on the patterns of their emergence
Gridneva E., Vestnik Tomskogo Gosudarstvennogo Universiteta, Filologiya 2026 No. 100 P. 38–52
This study investigates mixed languages, with a specific focus on their lexical characteristics. It proposes and substantiates the hypothesis that the degree of lexical mixing in such languages — reflected in the prevalence of doublets and the distribution of vocabulary between source languages — is linked to the specific pattern of their emergence, rather than ...
Added: May 6, 2026
Арест писателя Гюнтера Хофе на франкфуртской книжной ярмарке в 1963 г.: конкурирующие образы в медийном пространстве ГДР и ФРГ
Керимов Р. Э., Новое прошлое 2026 № 1 С. 148–162
The arrest of East German writer and publishing director Günter Hofé at the 1963 Frankfurt Book Fair became a unique episode of ideological confrontation between East and West Germany. Hofé is primarily known for his documentary-fiction trilogy about World War II, in which he actively participated as a Wehrmacht soldier. The analysis of the writer’s ...
Added: May 5, 2026
Семантический ореол сакрального в четырехстопном амфибрахии: механизмы культурной памяти в поэзии Ольги Седаковой
Максимов И. В., Новый филологический вестник 2025 Т. 73 № 2 С. 187–196
The majority of studies on the metrical aspects of Olga Sedakova’s poetry focus on the formal elements of versification, rarely exploring the substantive possibilities of the chosen metres. This paper fills this gap by analyzing the unified narrative of the four-foot amphibrach, tracing its development in Russian poetry from V.A. Zhukovsky to O.A. Sedakova. At ...
Added: May 5, 2026
Кубанская стела (Musée des Beaux Arts Grenoble, Collection égyptienne, inv. 1937, 1969, 3565)
Крол А. А., Кузнецов Д. А., Ladynin I. A., Восток. Афро-азиатские общества: история и современность 2026 Т. 1 С. 244–261
The publication presents a new translation and commentary of the Quban Stela of Ramesses II (Musée des beaux-arts Grenoble, Collection égyptienne, inv. 1937, 1969, 3565). This monument dates to the beginning of his reign (ca. 1287 BC); it was found near the ruins of the fortress of Baki, close to the Nubian village of Kuban. The composers of the ...
Added: May 5, 2026
Царь Рамсес и Бактрия. Об одном мотиве позднеегипетского историописания
Ladynin I. A., Вестник древней истории 2024 Т. 84 № 1 С. 5–26
The article analyses a set of Classical evidence reflecting the Egyptian conquest of Bactria or its attempt (Diod. I. 46–47; Tac. Ann. II. 60. 3; Strabo XVII. 1. 46), a statement of Manetho of Sebennytos on the vast conquests of king Sethos-Ramesses (I) (Manetho. Frg. 50 = Ios. C.Ap. I. 15. § 98–102), and the ...
Added: May 5, 2026
Цикл И. Бабеля «Великая Криница»: темпоральная структура в свете модерна.
Гендлина В. В., Новый филологический вестник 2025 № 1 С. 144–154
В статье анализируются две новеллы Исаака Бабеля начала 1930-х гг. о коллективизации -- «Гапа Гужва» и «Колывушка». Новеллы должны были стать частью цикла о коллективизации под общим названием «Великая Криница», однако замысел книги о преобразованиях в советской деревне оказался невоплощенным. В обеих новеллах Бабель показывает грандиозный проект модернизации колхозов как процесс, разрушающий существующий порядок и жизнь отдельно ...
Added: May 4, 2026
К вопросу о частеречной принадлежности и именовании нефинитных форм в лесном ненецком языке
Starchenko A., Kozlov A., Белов П. А., Известия РАН. Серия литературы и языка 2026 Т. 85 № 1 С. 77–97
The article examines the problem of part-speech classification and the terminological description of non-finite forms in Forest Nenets, drawing on new data from the Pur dialect. The study analyzes the system of Forest Nenets non-finite forms, which includes action nouns, participles, the gerund, the conditional form, and the supine. The analysis is carried out within ...
Added: May 4, 2026
РЕЧЕВЫЕ АКТЫ С ВЕЖЛИВЫМИ ДИМИНУТИВАМИ: ЖАНРОВЫЕ И ДИСКУРСИВНЫЕ ОСОБЕННОСТИ
Fufaeva I., Вестник Волгоградского государственного университета. Серия 2: Языкознание 2025 Т. 24 № 4 С. 78–90
The study delves into speech acts with diminutives used for politeness, focusing on their discursive and genre-related aspects. It draws on authorial recordings of colloquial speech, data from the National Corpus of the Russian Language, and recordings of urban speech from the 1970s and late twentieth century. The research highlights the potential usage of polite ...
Added: May 2, 2026
Искусственный интеллект как инструмент дифференциации при обучении иностранному языку
Bogolepova S., Smirnova A., Иностранные языки в школе 2026 № 4 С. 5–11
Differentiation in foreign language teaching is essential for accommodating individual trajectories of communicative competence development; however, its implementation is hindered by teachers' lack of time, resources, and training. Artificial intelligence (AI) helps overcome these barriers by enabling differentiation across content, process, and product. The article illustrates practical techniques supported by AI, including sample prompts and ...
Added: May 1, 2026
XI Международная конференция молодых исследователей "Текстология и историко-литературный процесс": сборник статей
М.: Издательские решения, 2025.
В настоящий сборник вошли работы участников XI Международной конференции «Текстология и историко-литературный процесс» на филологическом факультете МГУ имени М. В. Ломоносова. Статьи, представленные в книге, посвящены вопросам текстологии и истории литературы. ...
Added: April 30, 2026
«Подснежник. Журнал для детского и юношеского возрастов» (Санкт-Петербург, 1858 –1862). Роспись содержания
Фатеева М. С., Литературный факт 2022 Т. 26 № 4 С. 248–277
Работа представляет собой роспись содержания журнала для детского и юношеского чтения «Подснежник», выходившего в Санкт-Петербурге в 1858–1862 гг. под редакцией В.Н. Майкова. В издании журнала принимали участие многие хорошо известные литераторы середины XIX в. (И.А. Гончаров, Д.В. Григорович, А.Н. Майков и др.). Во вступительной статье кратко обрисована история издания «Подснежника», охарактеризованы появлявшиеся в нем материалы ...
Added: April 30, 2026
Ирония в пьесе Ватсараджи «Киратарджуния» (XII в.)
Минаева М. Д., Вестник Института востоковедения РАН 2025 № 6 С. 143–155
This article examines the rhetorical device of “irony” in the Sanskrit poetic tradition, using examples from the medieval playwright Vatsarāja’s Kirātārjunīya (“The Kirāta and Arjuna,” 12th century). This play belongs to the rare vyāyoga genre, which is characterized by the depiction of a great battle between two renowned heroes accompanied by a verbal duel filled with ...
Added: April 30, 2026
Способы введения специальной терминологии в научно-популярный нарратив медицинской тематики (на материале произведений Г. Марша)
Nagornaya A., Пинчукова А. Е., Мир науки. Социология, филология, культурология 2025 Т. 16 № 4 С. 1–13
The article examines the use of specialized terminology in popular science texts on medical topics. It focuses on the phenomenon of popularizing medical knowledge in contemporary English-language culture, identifies the reasons for the widespread demand for this type of literature (the need for reliable information, increased personalization of medical texts, and the diversity of genres ...
Added: April 27, 2026
What we do in the shadows of the pear tree: Tense switching in Shughni Pear Stories
Melenchenko M., Indo-Iranian Languages 2026 Vol. 2 No. 1 P. 74–99
This article presents the results of a study on the narrative functions of verb tenses in Shughni. Shughni is an Eastern Iranian language with a compact TAME system, which has tensed evidentials (with Preterite being the direct past and Perfect, the indirect past) and lacks grammaticalized aspect. The current study analyzes five narrations of the ...
Added: April 25, 2026
Автоматическое выявление побуждений в тексте: применение методов компьютерной лингвистики в работе эксперта-лингвиста
П.Е. Белова, А.К. Сафарян, В кн.: Научно-практическая конференция с международным участием "Национальные и международные тенденции и перспективы развития судебной экспертизы". Сборник докладов.: Н. Новгород: Изд-во ННГУ им. Н.И. Лобачевского, 2024.
В данной статье представлено описание системы автоматического поиска и извлечения побуждений из текстов на русском языке FindImper, основанной на поиске глагольных форм и синтаксических связей. Алгоритм реализован на языке программирования Python с использованием библиотек для морфологического и синтаксического анализа и набора правил. Данный инструмент направлен на оптимизацию работы эксперта-лингвиста и доступен к использованию через веб-сайт ...
Added: January 30, 2026
Дискурсивные возможности больших языковых моделей при решении задач генерации новых текстов
Mylnikova A., Гасимов А. Р., Научно-техническая информация. Серия 2: Информационные процессы и системы 2025 № 9 С. 33–38
На основе изучения функционирования больших языковых моделей (LLMs) и специфических характеристик машинной обработки дискурса показано применение экспериментального метода компьютерного и лингвистического анализа для статистического исследования и интерпретации лингвистических характеристик текстов. В качестве материалов исследования использован лингвистический корпус текстов Brown, а также корпуса искусственно сгенерированных текстов с применением Claude Sonnet 3.7 и Grok-3. В механизмах обработки ...
Added: November 19, 2025
Эскалация как осознанное и бессознательное.
Filippov A. F., Россия в глобальной политике 2025 Т. 23 № 1 С. 36–44
Frames and narratives also operate in politics. This means that political figures don't simply observe each other's actions. In interpreting the actions, intentions, and intentions of another politician, they use frames and narratives, often without realizing it. ...
Added: November 12, 2025
«Детям моим»: понимание символов природы в эпистолярном нарративе П.А. Флоренского
Щедрина Т. Г., Schedrina I., Человек 2025 Т. 36 № 5 С. 177–184
The speed at which the world around a person and their own perception of it are changing is currently increasing more and more. This statement fully applies to the processes affecting the phenomenon of science. Methods change, materials change, and the scientist themselves changes. These changes in science entail the necessity of rethinking research approaches ...
Added: October 26, 2025
Automatic Annotation of Discourse and Speech Formulas in Internet Communication: A Telegram Comment Corpus
Maslenikova A., Tatiana I. Popova, , in: 27th International Conference, SPECOM 2025, Szeged, Hungary, October 13–15, 2025, Proceedings, Part I. Speech and Computer. Lecture Notes in Artificial Intelligence 16187Vol. 16187: Lecture Notes in Artificial Intelligence.: Springer, 2025. P. 278–292.
This article presents a system for the automatic processing of user comments aimed at annotating speech and discourse formulas that actively function in everyday interaction, including digital communication. A Python-based program using the Telegram API was developed to automate the collection, filtering, and annotation of empirical data. In addition to building a user corpus, the ...
Added: October 19, 2025
27th International Conference, SPECOM 2025, Szeged, Hungary, October 13–15, 2025, Proceedings, Part II. Speech and Computer. Lecture Notes in Artificial Intelligence 16188
Springer, 2025.
This work is subject to copyright. All rights are solely and exclusively licensed by the Publisher, whether the whole or part of the material is concerned, specifically the rights of translation, reprinting, reuse of illustrations, recitation, broadcasting, reproduction on microfilms or in any other physical way, and transmission or information storage and retrieval, electronic adaptation, computer software, or ...
Added: October 19, 2025
Employing computational linguistic technologies and oculography to develop diagnostic tool for detecting autoaggressive tendencies in young people: a riveted gaze into “get rid of the shackles of this world”
Khomenko A., Kasimova L., Sychugov E. et al., Psychiatria Danubina 2025 Vol. 37 No. Suppl. 1 P. 213–223
Background: Early recognition of autoaggressive tendencies in young people is essential for diagnostic screening and reducing suicidality risks. This can be achieved through psycholinguistic approaches such as corpus analysis and eye-tracking studies. Corpus research helps to develop generalized speech patterns of those at risk of suicide, while oculographic methods examine perceptual cues linked to suicidal ...
Added: October 19, 2025
  • About
  • About
  • Key Figures & Facts
  • Sustainability at HSE University
  • Faculties & Departments
  • International Partnerships
  • Faculty & Staff
  • HSE Buildings
  • HSE University for Persons with Disabilities
  • Public Enquiries
  • Studies
  • Admissions
  • Programme Catalogue
  • Undergraduate
  • Graduate
  • Exchange Programmes
  • Summer University
  • Summer Schools
  • Semester in Moscow
  • Business Internship
  • Research
  • International Laboratories
  • Research Centres
  • Research Projects
  • Monitoring Studies
  • Conferences & Seminars
  • Academic Jobs
  • Yasin (April) International Academic Conference on Economic and Social Development
  • Media & Resources
  • Publications by staff
  • HSE Journals
  • Publishing House
  • iq.hse.ru: commentary by HSE experts
  • Library
  • Economic & Social Data Archive
  • Video
  • HSE Repository of Socio-Economic Information
  • HSE1993–2026
  • Contacts
  • Copyright
  • Privacy Policy
  • Site Map
Edit