?
Corpus-Based Text Retrieval and Adaptation for Learning System
International Journal of Advances in Computer Science and Its Applications. 2014. Vol. 4. No. 2. P. 38-43.
Karpov N.
The algorithm to adapt lexical complexity in the news article which can be used as materials for learning language presented in the paper. We consider words substitution retrieval according to wordnet-based and corpus-based semantic relatedness. Two corpus-based similarity measures empirically tested: Vector Space Model and Distributional Semantic Model. This language processing algorithm has created as a client-server application. It retrieves appropriate text from Web-resource. Next it performs adaptation procedure.
Publication based on the results of:
M. : Russian State University for the Humanitie, 2019
The book includes 64 papers submitted to the International conference in computer linguistics and intellectual technologies Dialogue 2019 and presents a broad spectrum of theoretical and applied research of natural language description, language simulation, and creation of applied computer technologies. ...
Added: October 16, 2019
Malafeev A., International Journal of Conceptual Structures and Smart Applications (IJCSSA) 2014 Vol. 2 No. 2 P. 20-35
This article presents an approach to the automatic generation of open cloze exercises based on arbitrary English text. The exercise format is similar to the open cloze test used in Cambridge English certificate exams (FCE, CAE, CPE). The presented method also makes it possible to adjust the difficulty of the resulting exercises to better suit ...
Added: November 29, 2014
M. : Russian State University for the Humanitie, 2019
The book includes 61 reports of the International conference on computer and intellectual technology "Dialogue-2019", representing a wide range of theoretical and applied research in the field of natural language description, modeling of language processes, creating practically applicable computer linguistic technologies. For specialists in the field of theoretical and applied linguistics and intellectual technologies. ...
Added: June 12, 2019
Denis Turdakov, Astrakhantsev N., Fedorenko D., Programming and Computer Software 2015 Vol. 41 No. 6 P. 336-349
Applications related to domain specific text processing often use glossaries and ontologies, and the main step of such resource construction is term recognition. This paper presents a survey of existing definitions of the term and its linguistic features, formulates the task definition for term recognition, and analyzes presently-available methods for automatic term recognition, such as ...
Added: August 26, 2016
Klyshinskiy E., Логачёва В. К., Карпик О. В. et al., Вестник Новосибирского государственного университета. Серия: Лингвистика и межкультурная коммуникация 2020 Т. 18 № 1 С. 5-21
The grammatical ambiguity (multiple sets of grammatical features for one word form or coinciding surface forms of different words) can be of different types. We describe six classes of grammatical ambiguity: unambiguous, ambiguous by grammatical features, by part of speech, by lemma, by lemma and part of speech, and out-of-vocabulary words. These classes are presented ...
Added: December 11, 2019
S.D. Kuznetsov, D.Yu. Turdakov, Астраханцев Н. А. et al., Programming and Computer Software 2014 Vol. 40 No. 5 P. 288-295
A framework for fast text analysis, which is developed as a part of the Texterra project, is described. Texterra provides a scalable solution for the fast text processing on the basis of novel methods that exploit knowledge extracted from the Web and text documents. For the developed tools, details of the project, use cases, and ...
Added: November 26, 2017
Klyshinskiy E., Kalachyov Y. B., Zhadnov V. V., Научно-техническая информация. Серия 2: Информационные процессы и системы 2014 № 5 С. 11-15
Рассматривается новый метод автоматизации определения соответствия технического задания и итогового отчета в ходе его приемки. Предложенный метод позволяет экспертам получить предварительную оценку степени соответствия отчета техническому заданию. Используются выделение значимых фрагментов технического задания,поиск соответствующих им элементов отчета и проверка степени его покрытия. Разработанный метод,в отличие, например,от косинусной меры сходства, дает лучшее разделение отчетов по критерию ...
Added: June 30, 2014
P. : European Language Resources Association (ELRA), 2018
Book of abstracts from the Eleventh International Conference on Language Resources and Evaluation (LREC 2018) ...
Added: May 5, 2018
Springer, 2022
“Data Analytics and Management in Data Intensive Domains” conference (DAMDID) is planned as a multidisciplinary forum of researchers and practitioners from various domains of science and research promoting cooperation and exchange of ideas in the area of data analysis and management in data intensive domains. Approaches to data analysis and management being developed in specific data intensive domains of X-informatics (such as X = astro, bio, chemo, geo, medicine, neuro, physics, ...
Added: August 30, 2021
M. : Russian State University for the Humanitie, 2015
Added: April 28, 2015
Berlin, Heidelberg : Springer, 2012
Added: September 21, 2012
Savchenko A., Вестник компьютерных и информационных технологий 2012 № 8 С. 14-19
Ставится задача автоматического построения транскрипции слитной речи. Предложен новый критерий распознавания фонем на основе принципа минимума информационного рассогласования Кульбака-Лейблера и произвольных признаков - оценок спектральной плотности мощности речевого сигнала. Проведено сравнение предложенного критерия с традиционными мерами близости для популярных оценок спектра (периодограмма, авторегрессионная оценка, гребенка полосовых фильтров). Показано, что предложенный критерий характеризуется существенным повышением точности ...
Added: September 14, 2012
Денис Турдаков, Астраханцев Н. А., Недумов Я. Р. et al., Труды Института системного программирования РАН 2014 Т. 26 С. 421-438
he paper presents a framework for fast text analytics developed during the Texterra project. Texterra is a technology for multilingual text mining based on novel text processing methods that exploit knowledge extracted from user-generated content. It delivers a fast scalable solution for text mining without the expensive customization. Depending on use-cases Texterra could be utilized ...
Added: November 6, 2017
Association for Computational Linguistics, 2021
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies ...
Added: August 31, 2021
Association for Computational Linguistics, 2019
Added: September 15, 2020
М. : Издательский центр «Российский государственный гуманитарный университет», 2019
The book includes 64 papers submitted to the International conference in computer linguistics and intellectual technologies Dialogue 2019 and presents a broad spectrum of theoretical and applied research of natural language description, language simulation, and creation of applied computer technologies. ...
Added: October 16, 2019
Berlin : Springer, 2014
This book constitutes the proceedings of the Third International Conference on Analysis of Images, Social Networks and Texts, AIST 2014, held in Yekaterinburg, Russia, in April 2014. The 11 full and 10 short papers were carefully reviewed and selected from 74 submissions. They are presented together with 3 short industrial papers, 4 invited papers and ...
Added: November 13, 2014
Switzerland : Springer, 2015
This book constitutes the refereed proceedings of the 6th Conference on Knowledge Engineering and the Semantic Web, KESW 2015, held in Moscow, Russia, in September/October 2015. The 17 revised full papers presented together with 6 short system descriptions were carefully reviewed and selected from 35 submissions. The papers address research issues related to semantic web, ...
Added: September 16, 2015
Tikhonov A., Yamshchikov I. P., / Cornell University. Series Computer Science "arxiv.org". 2021.
Chekhov's gun is a dramatic principle stating that every element in a story must be necessary, and irrelevant elements should be removed. This paper presents a new natural language processing task — Chekhov's gun recognition or (CGR) — recognition of entities that are pivotal for the development of the plot. Though similar to classical Named Entity Recognition ...
Added: December 3, 2021
Dubov M., Mirkin B., Шаль А. А., Открытые системы. СУБД 2014 № 10 С. 15-17
Currently, automating of text processing and analysis is a main tendency of IT applications. As of this moment, there is no unified approach to the analysis and visualization of big volumes of text data. Our system LM Monitor (Latent Meaning Monitor) generates so-called reference graphs which can be considered part of the popular technology of ...
Added: December 16, 2014
Chepovskiy A., М. : Национальный открытый университет «ИНТУИТ», 2015
В монографии рассмотрены различные математические модели для решения практических задач обработки текстов на естественных языках. Предлагаются решения проблем, возникающих при организации индексации и последующего поиска данных. Методы компьютерной лингвистики применяются для прикладных исследований. Предназначена для разработчиков информационных систем, специалистов в области компьютерной лингвистики. ...
Added: May 23, 2015
Krylov V., Krylov S., Жигалов Г. М., Journal of Physics: Conference Series 2019 Vol. 1405(1) No. DOI: 10.1088/1742-6596/1405/1/012011б
In the paper the case is studied then semiotic signs can be represented as language constructs in the same language as the text for the interpretation. The goal is to obtain estimates of the depth of interpretability with the respect to each of the signs by finding the projections of the narrative on these language ...
Added: June 28, 2021
Springer, 2021
This book constitutes revised selected papers from the 9th International Conference on Analysis of Images, Social Networks and Texts, AIST 2020, held during October 15-16, 2020. The conference was planned to take place in Moscow, Russia, but changed to an online format due to the COVID-19 pandemic.
The 27 full papers and 4 short papers presented ...
Added: October 7, 2020
Северина Е. М., Ларионова М. Ч., Litera 2023 № 10 С. 211-222
The article considers a model of preparation of machine-readable (semantic) markup of texts for the Chekhov Digital project on the example of philological interpretation of individual significant elements of A. P. Chekhov's story "Death of an Official" and presentation of this information explicitly based on the standards of digital publication Text Encoding Initiative (TEI/XML). Based ...
Added: January 12, 2024