О перспективах филологического корпуса

Publications

?

О перспективах филологического корпуса

Труды Отделения историко-филологических наук РАН. 2016. С. 145-155.

Orekhov B.

Research target: Philology and Linguistics

Language: Russian

Full text

Keywords: корпусная лингвистика

Russian CliPS corpus as a clinical tool for aphasiologists

Khudyakova M., Akinina Y., Bergelson M. et al., Aphasiology 2018

In this paper we present a multimedia corpus of Pear film retellings by people with aphasia, right hemisphere damage, and healthy speakers of Russian. Discourse abilities of brain-damaged individuals are still being a matter of discussion, and Russian CliPS was created for the thorough analysis of micro- and macro-linguistic levels of narratives by PWA and ...

Added: September 29, 2016

О принципах нормализации тематической разметки Корпуса русского рассказа XX века

Kirina M., Социо- и психолингвистические исследования 2023 № 11 С. 28-38

The article discusses the problem of normalization of the thematic annotation of the Corpus of Russian Short Stories of the 20th century. The aim of the research was to develop a methodology that combines linguistic and literary approaches to text analysis, in order to standardize the "theme" parameter, identified by expert. The study proposes to ...

Added: December 10, 2023

Еще раз об исследовательском потенциале поэтического корпуса: метр, лексика, формула

Orekhov B., Труды института русского языка им. В.В. Виноградова 2015 № 6 С. 449-463

The article continues the trend of other researchers’ publications that demonstrate the opportunities of the poetic subcorpus of the Russian National corpus. The question is, what issues related to the history of Russian poetry can be solved with the help of the corpus. In the first part of the article there is a pilot study ...

Added: March 16, 2016

Автоматическое определение частей речи для русского языка с помощью обучения трансформаций.

Kitov V. V., Научные труды Вольного экономического общества России 2014 Т. 186 С. 228-235

This paper describes the application of well-known «transformation-based learning» algorithm of automatic rule generation for the task of part-of-speech tagging. Algorithm is applied to corpora of annotated Russian texts and accuracy as well as most significant rules are shown. ...

Added: March 16, 2016

Компьютерная лингвистика и интеллектуальные технологии. По материалам ежегодной международной конференции «Диалог». Вып. 22. Дополнительный том

[б.и.], 2023

Сборник включает 17 докладов международной конференции по компьютерной лингвистике и интеллектуальным технологиям «Диалог 2023», представляющих широкий спектр теоретических и прикладных исследований в области описания естественного языка, моделирования языковых процессов, создания практически применимых компьютерных лингвистических технологий. Для специалистов в области теоретической и прикладной лингвистики и интеллектуальных технологий. ...

Added: September 14, 2023

Computational Linguistics and Intellectual Technologies. Papers from the Annual International Conference “Dialogue” (2015)

M. : Russian State University for the Humanitie, 2015

Added: April 28, 2015

О способах и средствах выражения страха в русской языковой картине мира

Botchkarev A., Вестник Новосибирского государственного университета. Серия: Лингвистика и межкультурная коммуникация 2016 Т. 14 № 3 С. 5-14

This article explores the ways of displaying fear in the Russian language image of the world. According to the National Corpus of the Russian language, in its most usual manifestation, fear covers and paralyzes; this distressing emotion is caused by somebody, apprehension to lose something or somebody as well as by exposure to an imminent ...

Added: November 28, 2016

После, через, спустя во временны́х контекстах: из наблюдений над текстами казахско-русских билингвов

Rakhilina E. V., Казкенова А. К., Akhapkina Y., Вестник Томского государственного университета. Филология 2021 Т. 73 С. 93-113

Рассматриваются случаи нестандартного употребления казахско-русскими билингвами предлогов после, через и спустя во временны́х контекстах. Доказывается, что отклонения обусловлены грамматическими различиями между родным и русским языками. Анализ отклонений выявил специфические черты предлогов: способность указывать на завершение событий и отрезков времени, как единичных, так и повторяющихся, а также неоднозначность через в составе сочетаний с названиями разных временны́х интервалов. ...

Added: December 1, 2021

Экстралингвистические факторы функционирования имён политиков в диалектной коммуникации (корпусное исследование)

Светлана Сергеевна Земичева, Попова Д. П., Вестник Томского государственного университета 2023 № 486 С. 17-28

На основе контент-анализа материалов Томского диалектного корпуса и Корпуса бассейна реки Устья Архангельской области (всего более 3 000 000 словоупотреблений) описаны черты восприятия политических деятелей, обусловленные спецификой народно-речевой культуры: эгоцентризм, персонализм, патернализм. Влияние социально-политического фактора проявляется в изменении частотности имен, трансформации оценки власти в целом (при переходе от советского периода к постсоветскому) и отдельных политиков. ...

Added: May 6, 2023

Using TXM Platform for Research on Language Changes over Time: The Dynamics of Vocabulary and Punctuation in Russian Literary Texts

Lavrentiev A. M., Sherstinova T., Chepovskiy A. et al., Vestnik Tomskogo Gosudarstvennogo Universiteta, Filologiya 2021 Vol. 70 P. 69-89

The purpose of this paper is to test the methodological tools provided by TXM platform for research on dynamics of vocabulary and punctuation marks in diachronic corpora. TXM is a powerful text analysis software which provides both quantitative and qualitative features in a transparent open-source implementation. In this paper, we demonstrate how it can be ...

Added: June 24, 2021

Adverbial phrases in Hasidic Yiddish

Arkhangelskiy T., Panova T., International Journal of the Sociology of Language 2014

The purpose of our study is to investigate the lexicalization of so-called adverbial phrases, such as fun a mol, in modern Hasidic Yiddish in comparison with written literary Yiddish of the 20th century. The phenomenon in question is a historical process in which several lexemes forming a frequent collocation (including nouns, adjectives, adverbs, prepositions and ...

Added: December 11, 2014

Статистика языка

Piperski A., Квант 2019 № 11 С. 9-16

В данной статье рассматриваются применения математики в компьютерной и корпусной лингвистике. ...

Added: January 16, 2020

Психологические основы обучения иностранному языку специальности с опорой на языковой корпус

Gorina O. G., Вестник Ленинградского государственного университета имени А.С. Пушкина. Серия: Экономика 2014 Т. 7 № 1 С. 172-179

The article deals with pedagogical and psychological grounds for using corpora in the classroom. The description of psychological principles behind data-driven approach and discovery learning with corpus data is given. The cognitive studies such as schemata theory and personal-construct theory are observed to show the advantages of concordance techniques. Some examples of new types of ...

Added: February 14, 2017

Когнитивный термин «фрейм»: создание словарной статьи на базе специализированного текстового корпуса

Khomenko A., Куликова В. А., Babiy A. et al., Вестник Новосибирского государственного университета. Серия: Лингвистика и межкультурная коммуникация 2022 Т. 20 № 4 С. 17-34

The study is devoted to the testing of a specialized texts corpus on the example of a group of cognitive linguistics terms with the hypernym frame. The corpus includes a subcorpus of scientific texts and a subcorpus of journalistic texts. The first one is represented by 15 journals indexed in the RSCI; the second one ...

Added: November 17, 2022

Russian Minority Languages on the Web: Descriptive Statistics

Orekhov B., Krylova I., Popov I. et al., Компьютерная лингвистика и интеллектуальные технологии 2016 No. 15 (22) P. 452-461

Статья о малых языках России в Интернете ...

Added: November 7, 2017

Опыт использования корпусных онлайн инструментов и Google форм при подготовке к государственному экзамену по английскому языку студентов бакалавров факультета БИ НИУ ВШЭ

Kuzmina T. A., Ученые записки национального общества прикладной лингвистики 2013 № 1(1) С. 26-35

В работе представлены результаты изучения инструментов корпусной лингвистики, которые представляют широкий спектр возможностей для развития навыков академического письма. Исследуются некоторые практические аспекты названных технологий, которые позволяют существенно улучшить точность и правильность изложения материала. ...

Added: April 24, 2013

Referential Choice: Predictability and Its Limits

Kibrik A. A., Khudyakova M., Dobrov G. B. et al., Frontiers in Psychology 2016 Vol. 7 No. 1429 P. 1-21

We report a study of referential choice in discourse production, understood as the choice between various types of referential devices, such as pronouns and full noun phrases. Our goal is to predict referential choice, and to explore to what extent such prediction is possible. Our approach to referential choice includes a cognitively informed theoretical component, ...

Added: September 28, 2016

Материалы 21-й Международной конференции по компьютерной лингвистике "Диалог"

М. : Изд-во РГГУ, 2015

Сборник содержит труды 21-й Международной конференции по компьютерной лингвистике. ...

Added: May 20, 2015

Корпусный анализ русского стиха

М. : Азбуковник, 2013

В настоящий сборник вошли статьи, подготовленные с использованием материалов поэтического корпуса Национального корпуса русского языка. Авторы статей прослеживают на обширном материале историю отдельных слов в языке поэзии, анализируют разные аспекты поэтической грамматики и семантики, рассматривают некоторые формальные параметры русского стиха. Сборник предназначен для специалистов в области лингвистической поэтики, стиховедения, а также для тех, кто интересуется современными ...

Added: September 28, 2013

Прагматические маркеры предикативного типа в устной спонтанной речи представителей разных социальных групп

Zaides K., Социо- и психолингвистические исследования 2020 № 8 С. 40-47

В статье рассматриваются особенности употребления прагматических маркеров предикативного типа (знаешь/те, (я) не знаю, (я) (не) думаю (что), представь/те и т. п.) в устной спонтанной речи представителей разных социальных групп. Материалом для исследования послужил рабочий подкорпус, сформированный из 150 000 токенов корпуса повседневной русской речи (фактически – диалогов) «Один речевой день» и 150 000 токенов корпуса ...

Added: February 3, 2022

Literature, Language and Computing: Russian Contribution

Springer, 2023

This book brings together selected revised papers representing a multidisciplinary approach to language and literature. The collection presents studies performed using the methods of computational linguistics in accordance with the traditions of Russian linguistic and literary studies, primarily in line with the Leningrad (Petersburg) philological school. The book comprises the papers allocated into 2 sections ...

Added: September 15, 2023

Proceedings of the 13th Conference on Language Resources and Evaluation (LREC 2022)

Marseille : European Language Resources Association (ELRA), 2022

The proceedings are organised on the basis of the 22 Tracks of the Conference on Language Resources and Evaluation (LREC) held in Marseille, France, from 20 to 25 June 2022. Major topics include corpora and annotation (including tools, systems, treebanks), information extraction and information retrieval (including ner, qa, text mining, document classification, text categorisation), applications involving lrs and evaluation (including ...

Added: February 22, 2023

Прогностическая валидность глагольных форм длительного аспекта в корпусной лингвистике английского языка

Popkova E., Социосфера 2010 № 4 С. 74-81

The article discusses the most recent trends in the development of the English progressive. A corpus-based approach to linguistic research is seen as an effective means of determining reliability of the data retrieved and helps track the major diachronic dynamic in the increasing frequency of the progressive aspect that has taken place since the beginning ...

Added: November 6, 2012

Конструирование образа Перми в комментариях социальных медиа

Matkin N., Культура и технологии 2021 Т. 6 № 1 С. 26-32

There were a lot of changes during 2019 and 2020 in Perm such as transport reform, zoo construction, change of governor and mayor. All changes reflect on the image of the city, which is constructing in the residents’ mind. From one hand the image of the city is formed by media, on the other hand ...

Added: October 23, 2021