?
Классификация текстов по жанрам при помощи алгоритмов машинного обучения
Научно-техническая информация. Серия 2: Информационные процессы и системы. 2018. № 8. С. 34-38.
Builova N.
The problem of documents classification by genre was examined in this review. The main characteristics of the text used to recognize the genre of text were highlighted, and the most widely used algorithms of machine learning were described. The methods considered serve for the classification of scientific, technical, journalistic and artistic texts.
Toldova S., Lyashevskaya O., Вопросы языкознания 2014 № 1 С. 120-145
This paper is an overview of the current issues and tendencies in Computational linguistics. The overview is based on the materials of the conference on computational linguistics COLING’2012. The modern approaches to the traditional NLP domains such as pos-tagging, syntactic parsing, machine translation are discussed. The highlights of automated information extraction, such as fact extraction, ...
Added: October 15, 2013
Romanov A., Ломотин К. Е., Козлова Е. С., Информационные технологии 2017 Т. 23 № 6 С. 418-423
The paper deals with the applicability of modern machine learning methods to the problem of automatic generation of UDC for scientific articles. As the classifiers, such models as artificial neural networks, logistic regression and boosting are considered. Graph algorithms and a prototype software module to generate UDC are designed. ...
Added: July 30, 2017
Bonch-Osmolovskaya A. A., Вопросы языкознания 2016 № 2 С. 100-120
Статья посвящена обзору работ последних лет, в которых теоретическая исследовательская задача решается с помощью методов или инструментов, используемых в компьютерной лингвистике. В обзоре проводится подробный анализ того, как именно с помощью применения того или иного инструмента или метода можно получить новые знания о природе языка. В частности, выделяются два основных направления, развитие которых в рамках ...
Added: April 14, 2015
Kibrik A. A., Khudyakova M., Dobrov G. B. et al., Frontiers in Psychology 2016 Vol. 7 No. 1429 P. 1-21
We report a study of referential choice in discourse production, understood as the choice between various types of referential devices, such as pronouns and full noun phrases. Our goal is to predict referential choice, and to explore to what extent such prediction is possible. Our approach to referential choice includes a cognitively informed theoretical component, ...
Added: September 28, 2016
Фокина М. А., Политическая лингвистика 2014 № 4(50) С. 188-193
The article describes the peculiarities of blogs of politicians in terms of topic, communicative aim, of author’s image, communicative future and language expression, with a special emphasis on the functioning of precedent statements. It is argued that the status of a special genre variety should be allotted to blogs of politicians. From the point of ...
Added: October 24, 2014
Tuliakova N. A., Вестник Ишимского государственного педагогического университета им. П.П. Ершова 2013 № 1 (7) С. 29-35
The article is devoted to defining the genre canon of legends in the system of genres of Mamin-Sibiryak. ...
Added: April 18, 2013
Kolmogorova A., Калинин А. А., Маликова А. В. et al., Саратов : Ай Пи Ар Медиа, 2022
The monograph highlights the application in linguistics of a number of end-to-end technologies listed in the National Program "Digital Economy of the Russian Federation", such as big data storage and analysis technologies, artificial intelligence. Extracting opinions, detecting emotions are topics that are extremely in demand in modern research carried out at the intersection of linguistics ...
Added: October 30, 2022
Association for Computational Linguistics, 2014
Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics held 26–30 April 2014 in Gothenburg, Sweden. ...
Added: October 24, 2013
Berlin : Association for Computational Linguistics, 2016
The 2016 Conference on Computational Natural Language Learning is the twentieth in the series of annual meetings organized by SIGNLL, the ACL special interest group on natural language learning. CoNLL 2016 will be held on August 11-12, 2016, and is co-located with the 54th annual meeting of the Association for Computational Linguistics (ACL) in Berlin, ...
Added: November 12, 2016
Smetanin S., , in : Компьютерная лингвистика и интеллектуальные технологии: по материалам ежегодной международной конференции «Диалог» (Москва, 17–20 июня 2020 г.). Issue 19(26): дополнительный том.: -, 2020. P. 1149-1159.
Added: November 30, 2020
Alieva O., Schole. Философское антиковедение и классическая традиция 2022 Т. 16 № 2 С. 693-705
This paper tests the effectiveness of Burrow’s Delta Method on a corpus of selected prose writings in ancient Greek. When tested on a corpus of fourteen and eight authors, the method yields good results with relatively small samples (1000, 3000, and 5000 words) and different word frequency vectors (100, 200, 500 words), but its performance ...
Added: February 9, 2022
Alieva O., Аристей. Aristeas: Вестник классической филологии и античной истории 2022 Т. 25 С. 19-37
This paper considers the possibility of quantitative measurement of Plato’s style using Burrows’ Delta. The author concludes that the result is not very informative if a limited number of author’s profiles, each corresponding to the known or assumed fluctuations in the author’s style, is used for machine classification. Instead, minimal Delta distances calculated for the ...
Added: December 15, 2021
Vyrenkova A. S., Смирнов И. Ю., Вестник Новосибирского государственного университета. Серия: Лингвистика и межкультурная коммуникация 2021 Т. 19 № 3 С. 57-68
Learner corpora serve as one of the most valuable sources of statistical data on learners' errors. For instance, data from foreign-language learners’ corpora can be used for the Second Language Acquisition research. However, corpora representativity strongly depends on the quality of its error markup, which is most frequently carried out manually and thus presents a ...
Added: September 24, 2021
Braslavski P., Karpov Nikolay, Worring M. et al., ACM SIGIR Forum 2014 Vol. 48 No. 2 P. 105-110
The 8th Russian Summer School in Information Retrieval (RuSSIR 2014) was held on August 18-22, 2014 in Nizhniy Novgorod, Russia.1 The school was co-organized by the National Research University Higher School of Economics2 and the Russian Information Retrieval Evaluation Seminar (ROMIP) ...
Added: August 22, 2015
Kuznetsov I., Научно-техническая информация. Серия 2: Информационные процессы и системы 2012 № 12
Представлен краткий обзор теоретических и практических моделей, используемых при решении задачи автоматической разметки семантических актантов.Рассматривается понятие семантической роли и место этого конструкта в общей языковой системе, проблема составления ролевых инвентарей, а также различные аспекты реализации автоматической разметки актантов с помощью алгоритмов машинного обучения. Также рассматривается ряд проблем, связанных сэтой областью, в частности, проблема «языкозависимости» современных ...
Added: December 23, 2013
Braslavski P. undefined., Markov I., Pardalos P. M. et al., ACM SIGIR Forum 2016 Vol. 49 No. 2 P. 72-79
This paper provides the reader with a report on 9th Russian Summer School in Information Retrieval (RuSSIR 2015). ...
Added: February 27, 2017
Malafeev A., Nikolaev K., , in : Analysis of Images, Social Networks and Texts. 8th International Conference, AIST 2019, Kazan, Russia, July 17–19, 2019, Revised Selected Papers. Communications in Computer and Information Science. Vol. 1086.: Springer, 2020. P. 154-159.
In this paper, a deep learning method study is conducted to solve a new multiclass text classification problem, identifying user interests by text messages. We used an original dataset of almost 90 thousand forum text messages, labeled for ten interests. We experimented with different modern neural network architectures: recurrent and convolutional, as well as simpler ...
Added: November 7, 2019
Sergey Smetanin, Mathematics 2022 Vol. 10 No. 16 Article 2947
Policymakers and researchers worldwide are interested in measuring the subjective well-being (SWB) of populations. In recent years, new approaches to measuring SWB have begun to appear, using digital traces as the main source of information, and show potential to overcome the shortcomings of traditional survey-based methods. In this paper, we propose the formal model for ...
Added: August 15, 2022
Lavrova A. A., Вестник Воронежского государственного университета. Серия: Лингвистика и межкультурная коммуникация 2013 № 1 С. 134-139
The article deals with the structures containing the break of syntactical unit in presidential debates. On the basis of comparative analysis of the structures containing the break of syntactical unit and other types of segmentation in presidential debates and affective speech, correlations between the syntactic speech characteristics and the effective and pseudo-affective constituents of ...
Added: October 6, 2012
Potsar A. N., Медиаскоп 2012 № 1
Russian media environment still goes through technological and language renovation. Under the influence of public politics carried out mostly through the mass media the journalistic genre of a column is transformed. For a non-professional columnist his/her social position appears to be more important than the literary abilities and genre structure. This paper points out that ...
Added: March 15, 2013
Levchenko J., Новое литературное обозрение 2014 Т. 128 № 4 С. 125-143
The present paper is devoted to the transformations of Russian Formalist Theory of Literature just after its declared cancelling in the well-known odious article "A Monument for Scientific Error"published by Victor Shklovsky in the December, 1930. Many researchers (from Richard Sheldon to Alexander Galushkin) share the opinion that the article was an ostensible gesture which ...
Added: May 6, 2014
Naidenova X., Ignatov D. I., Hershey : IGI Global, 2012
The consideration of symbolic machine learning algorithms as an entire class will make it possible, in the future, to generate algorithms, with the aid of some parameters, depending on the initial users’ requirements and the quality of solving targeted problems in domain applications.
Diagnostic Test Approaches to Machine Learning and Commonsense Reasoning Systems surveys, analyzes, and ...
Added: December 3, 2012
Khotinskaya A. I., Вопросы филологических наук 2005 № 4 С. 62-68
Исследование проблемы «Дж. Ш. Ле Фаню и английский сенсационный роман» - с привлечением культурного контекста, в котором сформировался английский сенсационный роман, его восприятия викторианской критикой, а также материала по викторианским периодическим журналам, в которых публиковались сенсационные романы. ...
Added: December 16, 2012
Alieva O., Вестник Православного Свято-Тихоновского гуманитарного университета. Серия 3: Филология 2011 № 3 (25) С. 23-36
The paper revises the term paraenesis which has attracted a lot of scholarly attention recently. Drawing on a wide range of philosophical and rhetorical material the author argues for a more formal understanding of this literary phenomenon and outlines some formal criteria for it. Special attention is paid to the influence of sacred texts (Pythagoras ...
Added: April 1, 2013