Semantic verification of statistically based hypotheses: Russian verbs of volition хотеть and хотеться

A. A. Bonch-Osmolovskaya

?

Semantic verification of statistically based hypotheses: Russian verbs of volition хотеть and хотеться

НИУ ВШЭ , 2015. No. WP BRP 22/LNG/2015.

This paper investigates the semantic distribution of two morphological forms of the Russian verb of volition хотеть. Three hypotheses for semantic grounds for the grammatical distribution are proposed. The hypotheses are checked by statistical analysis of distribution of the infinitive made on the data obtained from the National Russian Corpus. The analysis reveals groups of verbs which show more deviation in their behavior when used with one of the volition verbs than would be expected. The results of the analysis do not favor any of the preliminary hypotheses. A close-up look at the semantics of the constructions with the infinitives that show non-standard behavior helps to understand the semantic opposition realized by the two constructions.

Priority areas: humanitarian

Language: English

Full text

Keywords: корпусная лингвистика corpus linguistics linguistic semantics лингвистическая семантика статистические методы в лингвистике quantitative methods in linguistics

Publication based on the results of:

Применение методов автоматического анализа естественного языка для теоретического исследования семантико-грамматических конструкций в русском языке (2014)

Квантитативные методы в диахронических корпусных исследованиях: конструкции с предикативами и дативным субъектом

Bonch-Osmolovskaya A. A., Компьютерная лингвистика и интеллектуальные технологии 2015 Т. 1 № 14(21) С. 80-95

The paper proposes new approaches to the problem of Russian dative subjects in predicative and adjective constructions. The core idea of the research is to study the distribution of dative subject constructions with predicative and adjective forms that potentially can be used in such constructions. The methodological novelty of the approach is manifested in the ...

Added: April 15, 2015

Славный корабль - омулевая бочка... К микро-истории семантических переходов

Rakhilina E. V., Ryzhova D., Труды института русского языка им. В.В. Виноградова 2019 № 20 С. 241-256

The paper proposes a corpus analysis of a Russian adjective slavnyj. Its semantic evolution is analyzed through its distribution in XVIII-XXI centuries texts, including the main types of its usages, its main meanings, and possible shifts from one meaning to another. It is shown that the initial semantics of ‘being famous’ that the adjective slavnyj ...

Added: October 17, 2017

Корпус в обучении иностранному языку (на материале английского языка)

Gorina O. G., СПб. : Свое Издательство, 2014

В настоящем издании наглядно иллюстрируются широкие лингводидактические возможности корпусной лингвистики при обучении профессионально-ориентированному общению на английском языке. Обширный языковой материал специально разработанного корпуса профессионального дискурса и других корпусных ресурсов лег в основу вариативных упражнений, заданий, исследований, которые использовались для развития лексических навыков в устной и письменной речи студентов специальности «Регионоведение». Рекомендуется специалистам – филологам, лингводидактам, ...

Added: February 20, 2017

Russian constructions with the reflexive pronoun sebia

Smolovskaya E., Eremina O., / НИУ ВШЭ. Series WP BRP "Linguistics". 2015.

Added: October 19, 2015

Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10), Valletta, Malta, 17-23 May 2010

Valletta : ELRA, 2010

Added: December 17, 2012

Leo philologiae. Фестшрифт в честь 70-летия Льва Иосифовича Соболева

М. : [б.и.], 2016

Anniversary collection of articles in honor of L.I.Sobolev includes works by his disciples and colleagues covering a broad range of the phililogical issues: the problems of Russian literature, European literature of the Middle Ages and of the 19 -20 centures, corpus linguistics, linguistic analysis of the literary texts, the questions of teaching of Russian literature ...

Added: December 13, 2016

Using TXM Platform for Research on Language Changes over Time: The Dynamics of Vocabulary and Punctuation in Russian Literary Texts

Lavrentiev A. M., Sherstinova T., Chepovskiy A. et al., Vestnik Tomskogo Gosudarstvennogo Universiteta, Filologiya 2021 Vol. 70 P. 69-89

The purpose of this paper is to test the methodological tools provided by TXM platform for research on dynamics of vocabulary and punctuation marks in diachronic corpora. TXM is a powerful text analysis software which provides both quantitative and qualitative features in a transparent open-source implementation. In this paper, we demonstrate how it can be ...

Added: June 24, 2021

TaLC 12 - Teaching and Language Corpora Conference

[б.и.], 2016

Various issues relating to the questions of learner corpus researches and their use in teaching are presented. These include the issue of a norm in corpora whether the norm should necessarily be native and what problems a native norm may present. Learners who behave differently from native speakers do not necessarily use language incorrectly as ...

Added: December 10, 2016

Корпусные инструменты в грамматических исследованиях русского языка

Lyashevskaya O., М. : Языки славянской культуры, 2016

Corpus linguistics can be broadly defined in terms of two partially overlapping research dimensions . On the one hand, corpus linguistics is knowledge of how to compile and annotate linguistic corpora. On the other hand, corpus linguistics is a family of qualitative and quantitative methods of language study based on corpus data. The book presents ...

Added: March 26, 2015

Международная конференция «Slavicorp»

Orekhov B., Вопросы языкознания 2011 № 3 С. 153-155

The article deals with the conference «Slavicorp» in Warsaw in November 2010. ...

Added: September 28, 2013

Материалы к корпусной грамматике русского языка

СПб. : Издательство Нестор-История, 2018

The volume is the third issue of a corpora-based grammar of Russian. The volume deals with the issues of parts of speech and, more generally, with formal classes of lexicon, It comprises descriptive papers of separate POS and lesser world classes. ...

Added: November 4, 2018

О способах и средствах выражения чувства удовлетворения в русском языке

Botchkarev A., Известия Уральского федерального университета. Серия 2: Гуманитарные науки 2017 Т. 19 № 3 С. 191-202

This article explores the ways and means of expressing satisfaction in the Russian language in order to show how the analysed emotional concept varies in everyday cognitive and volitional states of consciousness. According to the Russian National Corpus, satisfaction may be moral, mental, physical, sexual, narcissistic, sadistic or masochistic by nature; profound, deep, full or ...

Added: October 27, 2017

Proceedings of EURALEX 15 (7-11 August 2012, Oslo, Norway)

Oslo : Oslo University, 2012

Added: December 10, 2012

Еще раз об исследовательском потенциале поэтического корпуса: метр, лексика, формула

Orekhov B., Труды института русского языка им. В.В. Виноградова 2015 № 6 С. 449-463

The article continues the trend of other researchers’ publications that demonstrate the opportunities of the poetic subcorpus of the Russian National corpus. The question is, what issues related to the history of Russian poetry can be solved with the help of the corpus. In the first part of the article there is a pilot study ...

Added: March 16, 2016

Computational Linguistics and Intellectual Technologies. Papers from the Annual International Conference “Dialogue” (2015)

M. : Russian State University for the Humanitie, 2015

Added: April 28, 2015

Looking for contextual cues to differentiating modal meanings: A corpus-based study

Lyashevskaya O., Ovsjannikova M., Szymor N. et al., , in : Quantitative approaches to the Russian language. : Abingdon : Routledge, 2018. P. 51-78.

The domain of modality is structurally diverse and may be described in multiple ways (for example, see Perkins, 1983; Wierzbicka, 1987; Hengeveld, 1988/2004; Sweetser, 1990; Bondarko, 1990; Bybee et al., 1994; van der Auwera and Plungian, 1998; Palmer, 2001; Hansen, 2004; Nuyts, 2006; Khrakovsky, 2007). The article reports on the Russian part of a larger survey ...

Added: October 24, 2017

Прогностическая валидность глагольных форм длительного аспекта в корпусной лингвистике английского языка

Popkova E., Социосфера 2010 № 4 С. 74-81

The article discusses the most recent trends in the development of the English progressive. A corpus-based approach to linguistic research is seen as an effective means of determining reliability of the data retrieved and helps track the major diachronic dynamic in the increasing frequency of the progressive aspect that has taken place since the beginning ...

Added: November 6, 2012

Adverbial phrases in Hasidic Yiddish

Arkhangelskiy T., Panova T., International Journal of the Sociology of Language 2014

The purpose of our study is to investigate the lexicalization of so-called adverbial phrases, such as fun a mol, in modern Hasidic Yiddish in comparison with written literary Yiddish of the 20th century. The phenomenon in question is a historical process in which several lexemes forming a frequent collocation (including nouns, adjectives, adverbs, prepositions and ...

Added: December 11, 2014

О способах и средствах выражения страха в русской языковой картине мира

Botchkarev A., Вестник Новосибирского государственного университета. Серия: Лингвистика и межкультурная коммуникация 2016 Т. 14 № 3 С. 5-14

This article explores the ways of displaying fear in the Russian language image of the world. According to the National Corpus of the Russian language, in its most usual manifestation, fear covers and paralyzes; this distressing emotion is caused by somebody, apprehension to lose something or somebody as well as by exposure to an imminent ...

Added: November 28, 2016

Russian challenges for quantitative research

Kopotev M., Lyashevskaya O., Mustajoki A., , in : Quantitative approaches to the Russian language. : Abingdon : Routledge, 2018. P. 3-29.

The Russian language, despite being one of the most studied in the world, until recently has been little explored quantitatively. After a burst of research activity in the years 1960–1980, quantitative studies of Russian vanished. They are now reappearing in an entirely different context. Today, we have large and deeply annotated corpora available for extended ...

Added: October 24, 2017

Referential Choice: Predictability and Its Limits

Kibrik A. A., Khudyakova M., Dobrov G. B. et al., Frontiers in Psychology 2016 Vol. 7 No. 1429 P. 1-21

We report a study of referential choice in discourse production, understood as the choice between various types of referential devices, such as pronouns and full noun phrases. Our goal is to predict referential choice, and to explore to what extent such prediction is possible. Our approach to referential choice includes a cognitively informed theoretical component, ...

Added: September 28, 2016

Труды международной конференции "Корпусная лингвистика - 2019"

СПб. : Издательство Санкт-Петербургского университета, 2019

Сборние содержит материалы докладов, представленных на Международной научной конференции "Корпусная лингвистика-2019" 24-28 июня 2019 г. в Санкт-Петербурге. ...

Added: July 8, 2019

Корпусный анализ русского стиха

М. : Азбуковник, 2013

В настоящий сборник вошли статьи, подготовленные с использованием материалов поэтического корпуса Национального корпуса русского языка. Авторы статей прослеживают на обширном материале историю отдельных слов в языке поэзии, анализируют разные аспекты поэтической грамматики и семантики, рассматривают некоторые формальные параметры русского стиха. Сборник предназначен для специалистов в области лингвистической поэтики, стиховедения, а также для тех, кто интересуется современными ...

Added: September 28, 2013

Новый комплекс инструментов автоматической обработки текста для платформыTXM и его апробация на корпусе для анализа экстремистских текстов

Лаврентьев А. М., Соловьев Ф. Н., Суворова М. И. et al., Вестник Новосибирского государственного университета. Серия: Лингвистика и межкультурная коммуникация 2018 Т. 16 № 3 С. 19-31

ПлатформаTXM предоставляет широкие возможности корпусного анализа, такие как анализ соответствий, кластеризация, построение лексических таблиц, поиск сложных лексических конструкций, выделение подкорпу-сов по различным параметрам. По умолчанию платформа работает со словоупотреблениями в качестве структур-ных единиц анализа. Она интегрирована с единственным расширениемTreeTagger, позволяющим проводить лишь морфологический анализ и лемматизацию словоупотреблений. Однако пользователь может сопроводить каждое словоупотребление набором дополнительных характеристик, ...

Added: September 8, 2018