• A
  • A
  • A
  • АБВ
  • АБВ
  • АБВ
  • A
  • A
  • A
  • A
  • A
Обычная версия сайта
  • RU
  • EN
  • HSE University
  • Publications
  • Book chapter
  • Digital Resources for the Shughni Language
  • RU
  • EN
Расширенный поиск
Высшая школа экономики
Национальный исследовательский университет
Priority areas
  • business informatics
  • economics
  • engineering science
  • humanitarian
  • IT and mathematics
  • law
  • management
  • mathematics
  • sociology
  • state and public administration
by year
  • 2027
  • 2026
  • 2025
  • 2024
  • 2023
  • 2022
  • 2021
  • 2020
  • 2019
  • 2018
  • 2017
  • 2016
  • 2015
  • 2014
  • 2013
  • 2012
  • 2011
  • 2010
  • 2009
  • 2008
  • 2007
  • 2006
  • 2005
  • 2004
  • 2003
  • 2002
  • 2001
  • 2000
  • 1999
  • 1998
  • 1997
  • 1996
  • 1995
  • 1994
  • 1993
  • 1992
  • 1991
  • 1990
  • 1989
  • 1988
  • 1987
  • 1986
  • 1985
  • 1984
  • 1983
  • 1982
  • 1981
  • 1980
  • 1979
  • 1978
  • 1977
  • 1976
  • 1975
  • 1974
  • 1973
  • 1972
  • 1971
  • 1970
  • 1969
  • 1968
  • 1967
  • 1966
  • 1965
  • 1964
  • 1963
  • 1958
  • More
Subject
News
May 20, 2026
HSE University Opens First Representative Office of Satellite Laboratory in Brazil
HSE University-St Petersburg opened a representative office of the Satellite Laboratory on Social Entrepreneurship at the University of Campinas in Brazil. The platform is going to unite research and educational projects in the spheres of sustainable development, communications and social innovations.
May 18, 2026
The 'Second Shift' Is Not Why Women Avoid News
Women are more likely than men to avoid political and economic news, but the reasons for this behaviour are linked less to structural inequality or family-related stress than to personal attitudes and the emotional perception of news content. This conclusion was reached by HSE researchers after analysing data from a large-scale survey of more than 10,000 residents across 61 regions of Russia. The study findings have been published in Woman in Russian Society.
May 15, 2026
Preserving Rationality in a Period of Turbulence
The HSE International Laboratory for Logic, Linguistics and Formal Philosophy studies logic and rationality in a transformed world characterised by a diversity of logical systems and rational agents. The laboratory supports and develops academic ties with Russian and international partners. The HSE News Service spoke with the head of the laboratory, Prof. Elena Dragalina-Chernaya, about its work.

 

Have you spotted a typo?
Highlight it, click Ctrl+Enter and send us a message. Thank you for your help!

Publications
  • Books
  • Articles
  • Chapters of books
  • Working papers
  • Report a publication
  • Research at HSE

?

Digital Resources for the Shughni Language

P. 61–64.
Makarov Y., Melenchenko M., Dmitry Novokshanov

This paper describes the Shughni Documentation Project consisting of the Online Shughni Dictionary, morphological analyzer, orthography converter, and Shughni corpus. The online dictionary has not only basic functions such as finding words but also facilitates more complex tasks. Representing a lexeme as a network of database sections makes it possible to search in particular domains (e.g., in meanings only), and the system of labels facilitates conditional search queries. Apart from this, users can make search queries and view entries in different orthographies of the Shughni language and send feedback in case they spot mistakes. Editors can add, modify, or delete entries without programming skills via an intuitive interface. In future, such website architecture can be applied to creating a lexical database of Iranian languages. The morphological analyzer performs automatic analysis of Shughni texts, which is useful for linguistic research and documentation. Once the analysis is complete, homonymy resolution must be conducted so that the annotated texts are ready to be uploaded to the corpus. The analyzer makes use of the orthographic converter, which helps to tackle the problem of spelling variability in Shughni, a language with no standard literary tradition.

Language: English
Full text
Text on another site
Keywords: computational linguisticsdigital resourcesShughni

In book

Proceedings of The Workshop on Resources and Technologies for Indigenous, Endangered and Lesser-resourced Languages in Eurasia within the 13th Language Resources and Evaluation Conference
European Language Resources Association (ELRA), 2022.
Similar publications
What we do in the shadows of the pear tree: Tense switching in Shughni Pear Stories
Melenchenko M., Indo-Iranian Languages 2026 Vol. 2 No. 1 P. 74–99
This article presents the results of a study on the narrative functions of verb tenses in Shughni. Shughni is an Eastern Iranian language with a compact TAME system, which has tensed evidentials (with Preterite being the direct past and Perfect, the indirect past) and lacks grammaticalized aspect. The current study analyzes five narrations of the ...
Added: April 25, 2026
Морфосинтаксический статус и семантика шугнанского показателя -ard: к развитию новых падежных маркеров в иранских языках
Падалка П. В., Ryzhova D., Чистякова Д. Г., Вопросы языкознания 2026 № 1 С. 40–58
The article is dedicated to the morphosyntactic properties and grammatical functions of the Shughni marker -ard. Shughni, like many other Iranian languages, has a reduced case system that seems to be gradually evolving and expanding. We demonstrate that the marker -ard is one of the main candidates for the status of a new case marker ...
Added: January 21, 2026
Базисная лексика шугнанского и бартангского языков
Armand E., Бадеев А. О., Родной язык: лингвистический журнал 2025 № 2 С. 153–175
The article analyzes the basic vocabulary (Swadesh list) of two closely related languages of the Shughni-Rushani subgroup of the East Iranian group of languages. The lists were collected by the authors using the elicitation method during field research in the summer of 2025. Special attention is paid to borrowings from Tajik, as well as to ...
Added: January 18, 2026
Глаголы поля ‘мешать’ в шугнанском языке
Armand E., Ryzhova D., Ризвоншоева Н. Н., Известия РАН. Серия литературы и языка 2025 Т. 84 № 5 С. 99–111
The article presents a system of verbs of the semantic field of ‘mixingʼ in Shughni. Based on dictionary data and the results of speaker interviews using a typological questionnaire, we show that there are about 13 verbs in this semantic zone, including both simple and complex verbs. The paper considers the peculiarities of use of ...
Added: October 30, 2025
Automatic Annotation of Discourse and Speech Formulas in Internet Communication: A Telegram Comment Corpus
Maslenikova A., Tatiana I. Popova, , in: 27th International Conference, SPECOM 2025, Szeged, Hungary, October 13–15, 2025, Proceedings, Part I. Speech and Computer. Lecture Notes in Artificial Intelligence 16187Vol. 16187: Lecture Notes in Artificial Intelligence.: Springer, 2025. P. 278–292.
This article presents a system for the automatic processing of user comments aimed at annotating speech and discourse formulas that actively function in everyday interaction, including digital communication. A Python-based program using the Telegram API was developed to automate the collection, filtering, and annotation of empirical data. In addition to building a user corpus, the ...
Added: October 19, 2025
27th International Conference, SPECOM 2025, Szeged, Hungary, October 13–15, 2025, Proceedings, Part II. Speech and Computer. Lecture Notes in Artificial Intelligence 16188
Springer, 2025.
This work is subject to copyright. All rights are solely and exclusively licensed by the Publisher, whether the whole or part of the material is concerned, specifically the rights of translation, reprinting, reuse of illustrations, recitation, broadcasting, reproduction on microfilms or in any other physical way, and transmission or information storage and retrieval, electronic adaptation, computer software, or ...
Added: October 19, 2025
Employing computational linguistic technologies and oculography to develop diagnostic tool for detecting autoaggressive tendencies in young people: a riveted gaze into “get rid of the shackles of this world”
Khomenko A., Kasimova L., Sychugov E. et al., Psychiatria Danubina 2025 Vol. 37 No. Suppl. 1 P. 213–223
Background: Early recognition of autoaggressive tendencies in young people is essential for diagnostic screening and reducing suicidality risks. This can be achieved through psycholinguistic approaches such as corpus analysis and eye-tracking studies. Corpus research helps to develop generalized speech patterns of those at risk of suicide, while oculographic methods examine perceptual cues linked to suicidal ...
Added: October 19, 2025
Computational linguistics and intellectual technologies. Papers from the Annual International Conference "Dialogue" (2025)
[б.и.], 2025.
This collection includes 39 papers from the Dialogue 2025 International Conference on Computational Linguistics and Intelligent Technologies, representing a wide range of theoretical and applied research in the fields of natural language description, modeling language processes, and the development of practical computational linguistic technologies. This publication is intended for specialists in theoretical and applied linguistics and ...
Added: October 19, 2025
27th International Conference, SPECOM 2025, Szeged, Hungary, October 13–15, 2025, Proceedings, Part I. Speech and Computer. Lecture Notes in Artificial Intelligence 16187
Springer, 2025.
Added: October 13, 2025
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics
Wien: Association for Computational Linguistics, 2025.
Added: August 26, 2025
Стандартизация шугнанского алфавита: теория и практика
Melenchenko M., В кн.: Шугнанские этюды. Сборник статей о шугнанском языке.: Хорог: Институт гуманитарных наук НАНТ, 2025. С. 18–38.
The article is devoted to the problem of standardization of the alphabet for the Shughni language, a burning issue for the Shughni people of Tajikistan. Although de facto Shughni has been used in writing for a long time now, it still does not have a stable orthography. One of the reasons for this is not ...
Added: August 2, 2025
Шугнанские этюды. Сборник статей о шугнанском языке
Хорог: Институт гуманитарных наук НАНТ, 2025.
The book "Shughni Etudes. Collection of Papers on the Shughni Language" (2025) is a reprint of articles published by the HSE Pamiri research team in 2020–2023. ...
Added: August 2, 2025
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Association for Computational Linguistics, 2025.
Originally named the Association for Machine Translation and Computational Linguistics (AMTCL), the Association for Computational Linguistics was founded in 1962 and renamed the ACL in 1968. The ACL is run by some 20 volunteers overseeing the administration of the Association (organising elections, deciding on new actions, adapting to the fast changing trends of our fields), ...
Added: July 17, 2025
Computational Linguistics and Intellectual Technologies. Papers from the Annual International Conference “Dialogue” (2025)
., 2025.
The volume includes 9 papers from the international conference on computational linguistics and intelligent technologies “Dialogue 2025,” representing a wide range of theoretical and applied research in the fields of natural language description, modeling of linguistic processes, and the development of practically applicable computational linguistic technologies. Intended for specialists in theoretical and applied linguistics and intelligent ...
Added: April 28, 2025
Цифровые ресурсы по уральским языкам Сибири: обзор, оценка и применение
Кошелюк Н. А., Урало-алтайские исследования 2025 № 56 С. 60–93
Over the recent years, the growing trend of digitalization has given rise to many independent linguistic projects worldwide. However, there is no roadmap on how to navigate between the resources and what type and amount of information one can get. This paper overviews the openly available online resources which feature the Uralic languages of Siberia, comprising the ...
Added: April 20, 2025
Computational Linguistics and Intellectual Technologies: Proceedings of the International Conference “Dialogue 2020"
., 2020.
Added: April 10, 2025
О принципах глагольной колексификации
Rakhilina E. V., Ryzhova D., Вопросы языкознания 2025 № 2 С. 7–25
The paper is devoted to the basic principles of verbal colexification defined at the cross-linguistic level. We understand colexification as a non-random association of two or more meanings within one and the same lexical item. We argue that these associations, reproducing in different languages, should follow certain principles and form clear patterns. Drawing on our extensive experience of ...
Added: March 11, 2025
Findings of the Association for Computational Linguistics: EACL 2024
Association for Computational Linguistics, 2024.
The 18th Conference of the European Chapter of the Association for Computational Linguistics. EACL is the flagship European conference dedicated to European and international researchers, covering a wide spectrum of research in Computational Linguistics and Natural Language Processing. ...
Added: February 17, 2025
Синтаксические свойства недостаточного глагола жӣwҷ ‘любить’ в шугнанском языке
Melenchenko M., Типология морфосинтаксических параметров 2024 Т. 7 № 1 С. 64–88
“Defective” verb žīwǰ ‘love’ in Shughni only has the Perfect stem, but lacks Present and Preterite stems. Instead, to express these tenses, Shughni uses complex verbs with auxiliary verbs čīdow ‘do’ and vidow ‘be’. This paper attempts to describe the distribution of three constructions: the independent žīwǰ and the two complex verbs which use it, ...
Added: February 5, 2025
Тематическая разметка антропологического корпуса: методика классификации шахтерских нарративов
Мазитова Л. Л., Panteleeva L., Вестник Самарского университета. История, педагогика, филология 2024 Т. 30 № 4 С. 156–164
The article describes the methodology for creating an anthropological corpus of texts that are united by belonging to the mining profession. The content of the work correlates with three research tasks: development of a thematic classification, introduction of conventions for highlighting narratives in the text, 3) determination of principles for organizing the corpus according to the themes of ...
Added: January 18, 2025
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing
Association for Computational Linguistics, 2024.
Added: January 2, 2025
Плюсквамперфект в шугнанском и других памирских языках
Melenchenko M., Индоиранские языки 2025 Т. 1 № 1 С. 50–88
The article presents a comparative review of Pluperfects in Pamir languages, describing both structural and semantic properties of these forms. The data comes from the existing grammatical descriptions as well as recent work with speakers of contemporary Shughni and Ishkashimi conducted in Tajikistan. In the majority of Pamir languages (Bartangi, Roshorvi, Wakhi, Ishkashimi, Sanglechi, Yazghulami), ...
Added: December 27, 2024
Findings of the Association for Computational Linguistics: ACL 2024
Association for Computational Linguistics, 2024.
ACL 2024 invites the submission of long and short papers featuring substantial, original, and unpublished research in all aspects of Computational Linguistics and Natural Language Processing. As in recent years, some of the presentations at the conference will be of papers accepted by the Transactions of the ACL (TACL) and by the Computational Linguistics (CL) ...
Added: December 24, 2024
27th International Conference, IMS 2024, St. Petersburg, Russia, June 24–26, 2024, Selected Papers. Internet and Modern Society. Human-Computer Communication. CCIS, volume 2534
Springer, 2025.
International conference “Internet and Modern Society” (IMS-2024) is mainly organized by ITMO University, held in St. Petersburg, during the Information Society Week. Important tasks of the IMS-2024 are contribution to the formation of specialists’ international community and promotion of research and development in the field of information society technologies. ...
Added: November 29, 2024
  • About
  • About
  • Key Figures & Facts
  • Sustainability at HSE University
  • Faculties & Departments
  • International Partnerships
  • Faculty & Staff
  • HSE Buildings
  • HSE University for Persons with Disabilities
  • Public Enquiries
  • Studies
  • Admissions
  • Programme Catalogue
  • Undergraduate
  • Graduate
  • Exchange Programmes
  • Summer University
  • Summer Schools
  • Semester in Moscow
  • Business Internship
  • Research
  • International Laboratories
  • Research Centres
  • Research Projects
  • Monitoring Studies
  • Conferences & Seminars
  • Academic Jobs
  • Yasin (April) International Academic Conference on Economic and Social Development
  • Media & Resources
  • Publications by staff
  • HSE Journals
  • Publishing House
  • iq.hse.ru: commentary by HSE experts
  • Library
  • Economic & Social Data Archive
  • Video
  • HSE Repository of Socio-Economic Information
  • HSE1993–2026
  • Contacts
  • Copyright
  • Privacy Policy
  • Site Map
Edit