Proto-Indo-European-Uralic Comparison from the Probabilistic Point of View

G. Starostin; Kassian A.; Zhivlov M.

?

Proto-Indo-European-Uralic Comparison from the Probabilistic Point of View

The Journal of Indo-European Studies. 2015. Vol. 43. No. 3/4. P. 301–347.

Starostin G., Kassian A., Zhivlov M.

In this paper we discuss the results of an automated compari-son between two 50-item groups of the most generally stable elements on the so-called Swadesh wordlist as reconstructed for Proto-Indo-European and Proto-Uralic. Two forms are counted as potentially related if their first two consonantal units, transcribed in simplified consonantal class notation (a rough variant of the Levenshtein distance method), match up with each other. Next to all previous attempts at such a task (Ringe 1998; Oswalt 1998; Kessler & Lehtonen 2006; Kessler 2007), our automated algorithm comes much closer to emu-lating the traditional procedure of cognate search as em-ployed in historical linguistics. “Swadesh slots” for protolan-guages are filled in strict accordance with such principles of reconstruction as topology (taking into consideration the structure of the genealogical tree), morphological transpar-ency, typology of semantic shifts, and areal distribution of particular items. Altogether we have counted 7 pairs where Proto-Indo-European and Proto-Uralic share the same bi-consonantal skeleton (the exact same pairs are regarded as cognates in traditional hypotheses of Indo-Uralic relation-ship). To verify the probability of arriving at such a result by chance we have applied the permutation test, which yielded a positive result: the probability of 7 matched pairs is equal to 1.9% or 0.5%, depending on the constituency of the conso-nantal classes, which is lower than the standard 5% threshold of statistical significance or even lower than the strong 1% level. Standard methodology suggests that we reject the null hypothesis (accidental resemblance) and offer a more plau-sible explanation for the observed similarities. Since the known typology of language contacts does not speak in favor of explaining the observed Indo-Uralic matches as old lexical borrowings, the optimal explanation is seen in the hypothesis of an Indo-Uralic genetic relationship, with the 7 matching pairs in question representing archaic retentions, left over from the original Indo-Uralic protolanguage.

Research target: Philology and Linguistics

Priority areas: humanitarian

Language: English

Text on another site

Keywords: Indo-European languages lexicostatistics Historical Linguistics Indo-Uralic hypothesis

Между дилетантизмом и диссидентством: переводы рассказов Бориса Виана в «Митином журнале»

Balakireva M., Новое литературное обозрение 2026 № 2 (198) С. 225–237

The article focuses on the study of unofficial translations from French, specifically the translation of Boris Vian’s short stories, published in «Mitin Journal». By examining the features of these translations, we can better understand the role of language in samizdat and rethink the position of the unofficial translator, who is opposed to the official translator ...

Added: June 1, 2026

Анализ культурных референций в творчестве А. Вознесенского: цифровое исследование имен персоналий

Tyuryakova-Matveeva D., Цифровые гуманитарные исследования 2026 № 1 С. 4–26

The article explores cultural references in the works of Andrei Voznesensky by analyzing the personalities he mentions. A total of 1,678 works were processed, including poetry, prose, and early unpublished poems. NER methods based on Natasha, spaCy, and LLM Grok tools made it possible to study the frequency of mentions of famous people and their ...

Added: May 31, 2026

ФУНКЦИОНИРОВАНИЕ ВИДА ГЛАГОЛОВ В НАУЧНЫХ ТЕКСТАХ ПОСТНЕКЛАССИЧЕСКОГО ПЕРИОДА

Ильина Е. А., Международный научно-исследовательский журнал 2024 № 6(144)

The present article examines the issue of the changing nature of the scientific style of speech of the Russian literary language in the postnonclassical period. It is shown that in scientific speech works (scientific articles on three branches: artificial intelligence, synergetics and cosmology) of the present period the tendency to choose the most abstract grammatical ...

Added: May 31, 2026

УПОТРЕБЛЕНИЕ ОТЫМЕННЫХ ПРЕДЛОГОВ В НАУЧНЫХ ТЕКСТАХ ПОСТНЕКЛАССИЧЕСКОГО ПЕРИОДА

Ильина Е. А., Russian linguistic Bulletin 2024 № 12 С. 1–7

The present article examines the issue of the changing nature of the scientific style of speech of the Russian literary language in the postnonclassical period. The analysis of denominative prepositions in scientific speech works (scientific articles on three branches: artificial intelligence, synergetics and cosmology) has demonstrated the continuation of the process of crystallization of one of the most ...

Added: May 31, 2026

Почему растущие доходы не делают людей счастливее: эмоциональное объяснение парадокса Истерлина (Why Growing Incomes Do Not Make People Happier: an Emotional Explanation of the Easterlin Paradox)

Vorchik A., / SSRN. Серия Social Science Research Network "Social Science Research Network". 2026.

This work is devoted to a theoretical explanation of the Easterlin paradox, according to which long-term economic growth does not make average level of people's happiness increasing. By happiness, we mean the intensity of emotions people experience while comparing their new income with its expected value, or the target income with its original value. In the first case, ...

Added: May 31, 2026

Стратегия оперативного информирования адресата в англоязычном жанре футбольного комментария

Тырыгина В. А., Кабанова И. Н., Вестник Нижегородского государственного лингвистического университета им. Н.А. Добролюбова 2023 № 4 С. 180–191

The focus of this article is the genre of European football (soccer) commentary, considered from the point of view of implementing the strategy of informing the addressee. The authors identify and describe the strategy of informing the addressee and its corresponding tactics in the texts of this genre, using video broadcasts of English team football ...

Added: May 29, 2026

Сборник студенческих работ «Восточная перспектива»

М.: ООО «Адвансед солюшнз», 2026.

Данный выпуск сборника студенческих статей .Восточная перспектива. включает в себя статьи победителей и призеров XI Международной научной студенческой конференции "Восточная перспектива", состоявшейся 18 мая 2024 года. В 2024 году на конференцию было подано 115 заявок, офлайн и онлайн в конференции приняли участие докладчики и слушатели из различных вузов России и ближнего и дальнего Зарубежья. ...

Added: May 29, 2026

Сборник студенческих работ «Восточная перспектива»

М.: ООО «Адвансед солюшнз», 2026.

Данный выпуск сборника студенческих статей «Восточная перспектива» включает в себя статьи победителей и призеров X Международной научной студенческой конференции «Восточная перспектива», состоявшейся 15 апреля 2023 года. Юбилейная конференция стала знаковым событием для студентов различных подразделений НИУ ВШЭ и других вузов России, занимающихся подготовкой востоковедческих кадров. ...

Added: May 29, 2026

Litera

NOTA BENE, 2025.

he article is devoted to the description of a voice message in order to introduce a definition of this communicative phenomenon. Despite the contradictory nature of this phenomenon, its popularity has been growing since 2013 to the present day. However, a voice message still does not have a clear definition, as it has specific characteristics ...

Added: May 27, 2026

Лингвистический анализ рекламы парфюма в англоязычном и русскоязычном дискурсах

Gabrielova E., Шевякова Ю. С., Вестник Удмуртского университета 2026 Vol. 36 No. 2 P. 344–354

In today's globalized world, the effectiveness of sales and the success of products largely rely on well-crafted advertising texts. Influenced by this factor and the growing competition, advertising continuously evolves, incorporating various linguistic, psychological, and cross-cultural techniques. This study focuses on the linguistic and stylistic analysis of perfume advertising texts within English and Russian discourses, ...

Added: May 25, 2026

On the Curse Formula in Wˁzb’s Inscription (RIÉ 192 B, ll. 5-9)

Bulakh M., Aethiopica 2025 Vol. 28 P. 39–52

The article deals with the curse formula belonging to the sixth-century inscription by an Aksumite king Wʿzb (RIÉ 192 B, ll. 5–9). After summarizing the extant interpretations, the author proposes a new reading and interpretation, arguing that the text under scrutiny follows the same pattern and employs the same rhetoric devices as the curse formulas ...

Added: May 23, 2026

Practicamos el Subjuntivo

Bocharov Y., M.: -, 2025.

This textbook is designed for students improving their Spanish proficiency at levels B1-B2. It consists of five topics and a selection of texts to reinforce them. The first topic covers the morphology of the four tenses (present, perfect, imperfect, subjunctive perfect) and exercises on the formation of forms. The remaining topics are devoted to exploring ...

Added: May 23, 2026

Эстетика аудиовизуальной журналистики. Учебное пособие. 2-е издание

Бережная М. А., Novikova A., Кирия И. В., КноРус, 2026.

The aesthetics of journalism is substantiated as a necessary component in the professional training of specialists in audiovisual media. The factors and trends of historical and current changes in the aesthetics of journalism are presented, and the aesthetic practices of audiovisual journalism are characterized in terms of their social functioning. Criteria for aesthetic evaluation are ...

Added: May 22, 2026

Juxtapositional vs. possessive-like encoding in Russian specificational constructions

Logvinova N., Russian linguistics 2026 Vol. 50 Article 11

This paper presents the first in-depth corpus-based study of a previously overlooked syntactic variation in Russian: the competition between juxtapositional (Nominative) and possessive-like (Genitive) encoding of the second noun (the term) in specificational constructions (e.g., ponjatie čest’ (notion.NOM honor.NOM) vs. ponjatie česti (notion.NOMhonor.GEN) ‘the notion of honor’). While typological research has established cross-linguistic preferences for one encoding strategy over another, intralinguistic variation ...

Added: May 18, 2026

FOCUS ON VOCABULARY Экономика материальных и нематериальных активов: корпусный словарь и ИИ-упражнения по английскому языку

Gorina O. G., Kucherenko S., Larisa K. et al., St. Petersburg: Asterion, 2026.

This textbook is an integrated teaching and learning resource for English for Specific Purposes (ESP) in the field of economics of tangible and intangible assets. Its design employs (i) modern corpus linguistics methods, including frequency analysis and keyword extraction based on authentic texts reflecting current trends in professional discourse, and (ii) artificial intelligence technologies for ...

Added: May 16, 2026

КОГНИТИВНО-АССОЦИАТИВНОЕ ПОЛЕ ОНИМОВ САНКТ-ПЕТЕРБУРГА И ВЕНЫ

Зелинская Ю. Ю., Когнитивные исследования языка 2025 № 4(65) С. 180–186

The article focuses on the study of the onym as a cognitive stimulus that facilitates the decoding of the language of urban space across two ethnic groups. The research is grounded in the analysis of results from an onomastic associative experiment, aimed at identifying the dominant types of associative responses to anthroponyms, oikodonyms, hodonyms, and ...

Added: May 16, 2026

Лично-числовая асимметрия: согласование пассивных миративов в казымском диалекте хантыйского языка

Starchenko A., Toldova S., Типология морфосинтаксических параметров 2023 Т. 6 № 1 С. 130–148

The study focuses on a previously unrecorded model of split agreement in the mirative paradigm in Kazym Khanty. Split agreement is found when comparing active and passive mirative constructions, as well as in a limited set of uses of non-finite forms. In the passive voice, unlike the active voice, the 3rd person is unmarked and the ...

Added: May 14, 2026

Глаголы перемещения веществ в славянских языках

Fedorov D., Jezikoslovni Zapiski 2026 Т. 32 № 1 С. 23–52

This article describes verbs denoting motion of liquid and dry substances in Slavic languages. The research explores how Slavic languages lexicalize different situations within the semantic field of substance motion and identifies the parameters that drive this lexicalization (e.g., type of substance, intensity and quantization of flow, and causation). Adjacent grammatical phenomena such as argument ...

Added: May 13, 2026

Школьный литературный канон эмиграции 1918–1939 гг.

Strizhkova D., / Институт русской литературы (Пушкинский Дом) РАН. Серия B001 "Репозиторий открытых данных по русской литературе и фольклору". 2026.

В базе данных представлена роспись русскоязычных литературных произведений и отрывков, напечатанных в учебниках по словесности, хрестоматиях, книгах для чтения, сборниках стихотворений и рассказов, выходивших во Франции, Германии, Латвии, Эстонии, Болгарии, Сербии в период первой волны русской эмиграции с 1918 по 1939 гг. Датасет представляет интерес для исследователей школьного литературного канона, эмиграции и детского чтения ...

Added: April 22, 2026

Современная российская мультипликация как инструмент воспитания традиционных духовно-нравственных ценностей

Жигунов А. Ю., / Basic Research Programme. Серия HUM "Humanities". 2026. № 1.

The article attempts to describe the features of the educational potential of Russian animation programmes in aspect of the representation of traditional spiritual and moral values. Based on media and semiotic analysis, the method of cultural and historical interpretation, animated Russian projects created from 2000 to the 2025, which were translated on television channels or streaming ...

Added: April 19, 2026

Lexicostatistical studies in Khoisan III/II: Reconstructing a Swadesh wordlist for Proto-Khoe (items 26–50)

Starostin G., Journal of Language Relationship 2025 No. 23/1-2 P. 97–119

This paper is the second part of a lexicostatistical analysis of the basic lexicon for languages belonging to the Khoe family of South Africa, revised and expanded in comparison to the author’s previously published attempt. This section concentrates on the etymological analysis of the second half of the “ultra-stable” sub-section of the Swadesh wordlist, following ...

Added: November 12, 2025

Политическая аккомодация культурных различий в индустриально развитых обществах (Political Accommodation of Cultural Differences in Industrialized Societies)

Малахов В. С., Симон М. Е., Летняков Д. Э. et al., / SSRN. Серия Social Science Research Network "Social Science Research Network". 2020.

The notion of “political accommodation” applied to the theory and practice of managing cultural diversity could enrich the Russian academic dictionary. Liberal democratic states invented specific mechanisms for political accommodation of cultural differences. Thanks to these mechanisms, the part of the population of a democratic state that is not ready to dissolve into the ethnocultural ...

Added: September 26, 2025

Национальная мощь современных государств: сравнительный анализ. Аналитический доклад

Melville A. Y., Каберник В. В., Mironyuk M. et al., / МГИМО МИД России. 2024.

Данный аналитический доклад является одним из результатов исследований в рамках консорциума НИУ ВШЭ и МГИМО. В нем прежде всего раскрыты вопросы концептуализации национальной мощи и сопутствующих категорий и дается обзор прецедентов. Далее рассматриваются вопросы операционализации предлагаемых нами компонентов национальной мощи. В следующих разделах доклада предлагается анализ вопросов методологии, используемой в докладе. На этой основе предложен ...

Added: September 19, 2025

Quantifying lexical distances among Nudiz, Mahmudi, and Verin Dvin Urmi (North-Eastern Neo-Aramaic)

Shvedova E., Koryakov Y., Elizaveta Zabelina, Journal of Language Relationship 2025 Vol. 23 No. 3–4 P. 207–275

This study documents and analyzes lexical data from four Christian North-Eastern Neo-Aramaic varieties: Mahmudi, Nudiz, Verin Dvin Urmi, and Urmia Urmi, focusing on the previously undescribed Mahmudi and Nudiz. We provide correspondences from these lects for an extended 226-item basic vocabulary list collected for this study with etymologies, cognates from earlier Aramaic, and loanword sources. ...

Added: September 4, 2025