CLLS 2016. Computational Linguistics and Language Science. Proceedings of the Workshop on Computational Linguistics and Language Science. Moscow, Russia, April 26, 2016. Val. 1886.

?

CLLS 2016. Computational Linguistics and Language Science. Proceedings of the Workshop on Computational Linguistics and Language Science. Moscow, Russia, April 26, 2016

Vol. 1886. Aachen : CEUR Workshop Proceedings, 2017.

Academic editor: Artemova E., Ilvovsky D., Skorinkin D., Vybornova A.

As the number of digital texts increases rapidly, there is a pressing need for more advanced and diverse tools of natural language processing. While purely statistical approaches proved powerful and efficient for many NLP tasks, there are many applications that would benefit from the formal models and approaches traditional language science has to offer. With hopes to facilitate this interaction between theory and practical implementation, we are pleased to announce the workshop on Computational Linguistics and Language Science to be held in Moscow, Russia on April 25, 2016 (11 AM to 6 PM).

Automatic generation of lexical exercises

Kuzmenko E., Fenogenova A., , in: CLLS 2016. Computational Linguistics and Language Science. Proceedings of the Workshop on Computational Linguistics and Language Science. Moscow, Russia, April 26, 2016Vol. 1886. Aachen: CEUR Workshop Proceedings, 2017. P. 20–27.

We propose an approach towards automatic generation of lexical exercises for learners of English. The techniques and tools used for generation of five different exercise types are described. We provide examples and evaluate the quality of generated exercises. We also compare the exercises generated on the basis of two different corpora by conducting an experiment. In the experiment learners ...

Added: December 14, 2016

Quantum Logic and Natural Language Processing

Makarov I., Anastasia Frolenkova, Ivan Belov, , in: CLLS 2016. Computational Linguistics and Language Science. Proceedings of the Workshop on Computational Linguistics and Language Science. Moscow, Russia, April 26, 2016Vol. 1886. Aachen: CEUR Workshop Proceedings, 2017. P. 135–140.

The paper presents a short summary on the applications of the quantum logic categorical constructions to the natural language processing. We give a brief overview on the topic of quantum logic in general, and in natural language processing, in particular. As a result, we discuss comparison of sentences and their representation in quantum logic formalism. ...

Added: June 25, 2017

Big-data-augmented Approach to Emerging Technologies Identification: Case of Agriculture and Food Sector

Kuzminov I., Bakhtin P. D., Lavrynenko A. S., , in: CLLS 2016. Computational Linguistics and Language Science. Proceedings of the Workshop on Computational Linguistics and Language Science. Moscow, Russia, April 26, 2016Vol. 1886. Aachen: CEUR Workshop Proceedings, 2017. Ch. 15 P. 130–134.

The paper discloses a new approach to emerging technologies identi- fication, which strongly relies on capacity of big data analysis, namely text min- ing augmented by syntactic analysis techniques. The opportunities of the new big-data-augmented methodology are shown in comparison to existing results, both globally and in Russia. The integrated ontology of currently emerging tech- ...

Added: September 7, 2017

Annotated Suffix Tree Method for German Compound Splitting

Shishkova A., Artemova E., , in: CLLS 2016. Computational Linguistics and Language Science. Proceedings of the Workshop on Computational Linguistics and Language Science. Moscow, Russia, April 26, 2016Vol. 1886. Aachen: CEUR Workshop Proceedings, 2017. P. 42–47.

The paper presents an unsupervised and knowledge-free ap- proach to compound splitting. Although the research is focused on Ger- man compounds, the method is expected to be extensible to other com- pounding languages. The approach is based on the annotated suffix tree (AST) method proposed and modified by Mirkin et al. To the best of ...

Added: October 10, 2017

Formal Concept Lattices as Semantic Maps

Ryzhova D., Obiedkov S., , in: CLLS 2016. Computational Linguistics and Language Science. Proceedings of the Workshop on Computational Linguistics and Language Science. Moscow, Russia, April 26, 2016Vol. 1886. Aachen: CEUR Workshop Proceedings, 2017. Ch. 10 P. 78–87.

In this paper, we present an application for formal concept analysis (FCA) by showing how it can help construct a semantic map for a lexical typological study. We show that FCA captures typological regularities, so that concept lattices automatically built from linguistic data appear to be even more informative than traditional semantic maps. While sometimes ...

Added: October 14, 2017

Lexis Meets Meter: Attraction of Lexical Units in Russian Verse

Orekhov B., , in: CLLS 2016. Computational Linguistics and Language Science. Proceedings of the Workshop on Computational Linguistics and Language Science. Moscow, Russia, April 26, 2016Vol. 1886. Aachen: CEUR Workshop Proceedings, 2017. P. 110–121.

Статья о словах и метрах в русской поэзии ...

Added: November 7, 2017

Identification of Singleton Mentions in Russian

Toldova S., Max Ionov, , in: CLLS 2016. Computational Linguistics and Language Science. Proceedings of the Workshop on Computational Linguistics and Language Science. Moscow, Russia, April 26, 2016Vol. 1886. Aachen: CEUR Workshop Proceedings, 2017. Ch. 5 P. 33–41.

This paper describes a pilot study of the problem of detecting singleton mentions in Russian texts. A noun phrase is considered a singleton mention if it is the only referent of some entity. We discuss various morphosyntactic and lexical features, some of which were used for analogous tasks for English and propose new features derived ...

Added: November 9, 2017

Text classification based on deep textual parsing

Galitsky B., Ilvovsky D., , in: CLLS 2016. Computational Linguistics and Language Science. Proceedings of the Workshop on Computational Linguistics and Language Science. Moscow, Russia, April 26, 2016Vol. 1886. Aachen: CEUR Workshop Proceedings, 2017. P. 57–65.

The problem of classifying text based on the deep parsing structure is addressed. An algorithm for document classification tasks where counts of words or n-grams is insufficient is proposed. The parse tree kernel method at the level of paragraphs, based on anaphora, rhetoric structure relations and communicative actions linking phrases in the parse thicket is ...

Added: October 28, 2018

Research target: Philology and Linguistics Computer Science

Priority areas: humanitarian IT and mathematics

Language: English

Text on another site

Keywords: natural language processing Computational Linguistics and Language Science CLLS 2016

CLLS 2016. Computational Linguistics and Language Science. Proceedings of the Workshop on Computational Linguistics and Language Science. Moscow, Russia, April 26, 2016

Proceedings of the 6th Workshop on Balto-Slavic Natural Language Processing

Stroudsburg, PA: Association for Computational Linguistics, 2017.

This volume contains the papers presented at BSNLP-2017: the Sixth Workshop on Balto-Slavic Natural Language Processing. The Workshop is organized by SIGSLAV—Special Interest Group on NLP in Slavic Languages of the Association for Computational Linguistics. The Workshops have been convening for over a decade, with a clear vision and purpose. On one hand, the languages from ...

Added: June 13, 2017

Proceedings of the Workshop on Language Technology Resources and Tools for Digital Humanities (LT4DH)

Osaka: [б.и.], 2016.

Language resources are increasingly used not only in Language Technology (LT), but also in other subject fields, such as the digital humanities (DH) and in the field of education. Applying LT tools and data for such fields implies new perspectives on these resources regarding domain adaptation, interoperability, technical requirements, documentation, and usability of user interfaces. ...

Added: November 12, 2016

Корпус татарского языка "Туган тел"

Arkhangelskiy T., Гильмуллин Р. А., Невзорова О. А. et al., Научно-техническая информация. Серия 2: Информационные процессы и системы 2013

В статье описывается электронный корпус татарского языка, созданный в рамках программы фундаментальных исследований Президиума РАН "Корпусная лингвистика", и методы, использованные авторами для создания этого корпуса. В частности, описываются текстовый состав и жанровая структура корпуса, принятые авторами решения о выделении морфологических характеристик, автоматическая морфологическая разметка текстов с помощью двухуровневой модели морфологии и анализатора PC-KIMMO и размещение ...

Added: October 25, 2013

Text, Speech and Dialogue 17th International Conference, TSD 2014, Brno, Czech Republic, September 8-12, 2014. Proceedings

Springer, 2014.

This book constitutes the refereed proceedings of the 17th International Conference on Text, Speech and Dialogue, TSD 2013, held in Brno, Czech Republic, in September 2014. The 70 papers presented together with 3 invited papers were carefully reviewed and selected from 143 submissions. They focus on topics such as corpora and language resources; speech recognition; ...

Added: September 15, 2014

Information Extraction Based on Deep Syntactic-Semantic Analysis

Skorinkin D.A., Budnikov E. A., Stepanova M. E. et al., Компьютерная лингвистика и интеллектуальные технологии 2016 No. 15 P. 721–733

This paper presents a rule-based approach to Information Extraction (IE) task within FactRuEval-2016 competition. Our system is based on ABBYY Compreno Technology. The technology uses the results of deep syntactic-semantic analysis, which leads to significant reduction of the number of necessary rules and makes them laconic. The evaluation was conducted on FactRuEval dataset. FactRuEval is ...

Added: August 28, 2016

Computational Linguistics and Intellectual Technologies Papers from the Annual International Conference “Dialogue” (2019)

M.: Russian State University for the Humanitie, 2019.

The book includes 64 papers submitted to the International conference in computer linguistics and intellectual technologies Dialogue 2019 and presents a broad spectrum of theoretical and applied research of natural language description, language simulation, and creation of applied computer technologies. ...

Added: October 16, 2019

Applying statistical tagging to Russian poetry

Starchenko A., Kazakevich L., Lyashevskaya O., / NRU HSE. Series WP BRP "Linguistics". 2018. No. 76.

The poetic texts pose a challenge to full morphological tagging and lemmatization since the authors seek to extend the vocabulary, employ morphologically and semantically deficient forms, go beyond standard syntactic templates, use non-projective constructions and non-standard word order, among other techniques of the creative language game. In this paper we evaluate a number of probabilistic ...

Added: December 12, 2018

Proceedings of the 4th workshop on NLP for Computer Assisted Language Learning at NODALIDA 2015, Vilnius, 11th May, 2015

Linköping University Electronic Press, 2015.

The workshop series on Natural Language Processing (NLP) for Computer-Assisted Language Learning (CALL) – NLP4CALL – is a meeting place for researchers working on the integration of Natural Language Processing and Speech Technologies in CALL systems and exploring the theoretical and methodological issues arising in this connection. ...

Added: May 31, 2015

Сборник статей по результатам семинара CLLS 2016

[б.и.], 2016.

Added: December 14, 2016

Количественная оценка грамматической неоднозначности некоторых европейских языков

Klyshinskiy E., Логачёва В. К., Карпик О. В. et al., Вестник Новосибирского государственного университета. Серия: Лингвистика и межкультурная коммуникация 2020 Т. 18 № 1 С. 5–21

The grammatical ambiguity (multiple sets of grammatical features for one word form or coinciding surface forms of different words) can be of different types. We describe six classes of grammatical ambiguity: unambiguous, ambiguous by grammatical features, by part of speech, by lemma, by lemma and part of speech, and out-of-vocabulary words. These classes are presented ...

Added: December 11, 2019

Computational Linguistics and Intelligent Text Processing, Lecture Notes in Computer Science

Springer, 2015.

16th International Conference, CICLing 2015, Cairo, Egypt, April 14-20, 2015, Proceedings, Part I ISBN: 978-3-319-18110-3 (Print) 978-3-319-18111-0 (Online) ...

Added: April 23, 2015

Компьютерная лингвистика и интеллектуальные технологии: По материалам ежегодной международной конференции «Диалог» (Москва, 29 мая — 1 июня 2019 г.)

М.: Издательский центр «Российский государственный гуманитарный университет», 2019.

Added: October 16, 2019

Computational Linguistics and Intellectual Technologies

M.: Russian State University for the Humanitie, 2019.

The book includes 61 reports of the International conference on computer and intellectual technology "Dialogue-2019", representing a wide range of theoretical and applied research in the field of natural language description, modeling of language processes, creating practically applicable computer linguistic technologies. For specialists in the field of theoretical and applied linguistics and intellectual technologies. ...

Added: June 12, 2019

Извлечение сценарной информации из текстов. Часть 1. Постановка задачи и обзор методов

Суворова М. И., Кобозева М. В., Toldova S. et al., Искусственный интеллект и принятие решений 2020 № 1 С. 17–26

В статье обсуждается важность автоматического сценарного анализа для понимания текстов на естественном языке. Дан широкий обзор методов и подходов к описанию и извлечению сценариев. Рассмотрены теоретические подходы к формализации сценариев. Приведен список задач, для решения которых используется информация о сценарной структуре текста. Представлены популярные подходы к автоматическому извлечению сценариев из текстов и методы оценки их ...

Added: April 22, 2020

Проблемы обработки естественного языка в диалоговых системах

Klyshinskiy E., Жеребцова Ю., Чижик А., Системный администратор 2019 № 10 С. 82–91

Nowadays, a field of dialogue systems and conversational agents is one of the rapidly growing research areas in artificial intelligence applications. Business and industry are showing increasing interest in implementing intelligent conversational agents into their products. Many recent studies has tended to focus on possibility of developing task-oriented systems which are able to have long ...

Added: October 26, 2019

Universal Dependencies for Russian: A New Syntactic Dependencies Tagset

Lyashevskaya O., Droganova K., Zeman D. et al., / NRU HSE. Series WP BRP "Linguistics". 2016. No. 44.

This paper presents the Universal Dependencies tagset (UD v1) as a new annotation scheme for Russian treebanks. The universal list of dependency relations was adopted and extended to comply with certain language-specific syntactic constructions. The tagset was validated, converting two Russian treebanks into the UD format, UD-Russian-SynTagRus and UD-Russian-Google. ...

Added: December 14, 2016

Труды международной конференции "Корпусная лингвистика - 2019"

СПб.: Издательство Санкт-Петербургского университета, 2019.

Сборние содержит материалы докладов, представленных на Международной научной конференции "Корпусная лингвистика-2019" 24-28 июня 2019 г. в Санкт-Петербурге. ...

Added: July 8, 2019

Speech and Computer. 21st International Conference, SPECOM 2019, Istanbul, Turkey, August 20–25, 2019, Proceedings

Switzerland: Springer, 2019.

This volume contains a collection of submitted papers presented at the conference, which were thoroughly reviewed by members of the Program Committee consisting of more than 100 top specialists, as well as an invited paper by Prof. Scharenborg. Each paper was reviewed, single blind, by two to four committee members (three reviewers on the average) and then discussed by ...

Added: October 29, 2019

Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics

Sofia: Springer, 2013.

Welcome to the 2013 Conference of the Association for Computational Linguistics! Our community continues to grow, and this year’s conference has set a new record for paper submissions. We received 1286 submissions, which is 12% more than the previous record; we are particularly pleased to see a striking increase in the number of short papers ...

Added: October 1, 2014

Машина в духе. Гуссерлевское обоснование и ограничение искусственного интеллекта.

Холенштайн Э., Логос 2007 Т. 63 № 6 С. 176–196

В статье предпринимается попытка применить феноменологическуюй психологию Эдмунда Гуссерля к разрешению проблемы соотношения естественного и искусственного интеллекта. ...

Added: July 7, 2015

Insights into the web based English learning projects

Frolova N., Frolov E. S., The Kazakh-American Free University Academic Journal 2017 P. 179–184

The article reflects the practical experience of enhancing the process of Academic English Writing teaching to undergraduate students by means of web tools. Along with theoretical analysis of the integration scheme of blended learning into the curriculum the article features empirical survey to confirm the efficiency of the project. The article contains a detailed description ...

Added: June 5, 2018

Exploring the Effectiveness of Methods for Persona Extraction

Konstantin Zaitsev, / Series Computer Science "arxiv.org". 2024.

The paper presents a study of methods for extracting information about dialogue participants and evaluating their performance in Russian. To train models for this task, the Multi-Session Chat dataset was translated into Russian using multiple translation models, resulting in improved data quality. A metric based on the F-score concept is presented to evaluate the effectiveness ...

Added: September 26, 2024

Computational Linguistics and Intellectual Technologies. Papers from the Annual International Conference “Dialogue” (2015)

M.: Russian State University for the Humanitie, 2015.

Added: April 28, 2015

RUSSE2018: a Shared Task on Word Sense Induction for the Russian Language

Panchenko A., Lopukhina A., Ustalov D. et al., Компьютерная лингвистика и интеллектуальные технологии 2018 No. 17 P. 547–564

The paper describes the results of the first shared task on word sense induction (WSI) for the Russian language. While similar shared tasks were conducted in the past for some Romance and Germanic languages, we explore the performance of sense induction and disambiguation methods for a Slavic language that shares many features with other Slavic ...

Added: June 7, 2018