Proceedings of the 16th International Workshop on Treebanks and Linguistic Theories (TLT 16)

?

Proceedings of the 16th International Workshop on Treebanks and Linguistic Theories (TLT 16)

Association for Computational Linguistics, 2017.

Editor-in-chief: J. Hajič

The volume includes papers presented at the 16th International Workshop on Treebanks and Linguistic Theories (TLT), which brings together developers and users of linguistically annotated natural language corpora. As ‘treebanks’ we consider any pairing of natural language data (spoken or written) with annotations of linguistic structure at various levels of analysis, ranging from e.g. morpho-phonology to discourse. The articles address all aspects of treebank design, development, and use, including reflections on the design of linguistic annotations, methodology studies, resource announcements or updates, annotation or conversion tool development, and reports on treebank usage.

REALEC learner treebank: annotation principles and evaluation of automatic parsing

Lyashevskaya O., Пантелеева И. М., , in: Proceedings of the 16th International Workshop on Treebanks and Linguistic Theories (TLT 16). Association for Computational Linguistics, 2017. P. 80–87.

The paper presents a Universal Dependencies (UD) annotation scheme for a learner English corpus. The REALEC dataset consists of essays written in English by Russian-speaking university students in the course of general English. The original corpus is manually annotated for learners’ errors and gives information on the error span, error type, and the possible correction ...

Added: December 11, 2018

Research target: Computer Science Philology and Linguistics

Priority areas: humanitarian IT and mathematics

Language: English

Text on another site

Keywords: corpus linguistics treebank data-driven approach annotation consistency

Proceedings of the 16th International Workshop on Treebanks and Linguistic Theories (TLT 16)

Using TXM Platform for Research on Language Changes over Time: The Dynamics of Vocabulary and Punctuation in Russian Literary Texts

Lavrentiev A. M., Sherstinova T., Chepovskiy A. et al., Vestnik Tomskogo Gosudarstvennogo Universiteta, Filologiya 2021 Vol. 70 P. 69–89

The purpose of this paper is to test the methodological tools provided by TXM platform for research on dynamics of vocabulary and punctuation marks in diachronic corpora. TXM is a powerful text analysis software which provides both quantitative and qualitative features in a transparent open-source implementation. In this paper, we demonstrate how it can be ...

Added: June 24, 2021

CEUR Workshop Proceedings (Proceedings of the International Conference "Internet and Modern Society" IMS-2020, 17-20 June 2020, ITMO University, St. Petersburg, Russia)

CEUR Workshop Proceedings, 2020.

The International Conference “Internet and Modern Society” (IMS-2020) was initially planned to take place in St. Petersburg, Russia. Due to the spread of COVID-19 and the ban on public events, the conference was held during 17-20 June 2020 in the format of online sessions with a discussion of papers and presentations uploaded in advance. The ...

Added: November 1, 2020

Квантитативные методы в диахронических корпусных исследованиях: конструкции с предикативами и дативным субъектом

Bonch-Osmolovskaya A. A., Компьютерная лингвистика и интеллектуальные технологии 2015 Т. 1 № 14(21) С. 80–95

The paper proposes new approaches to the problem of Russian dative subjects in predicative and adjective constructions. The core idea of the research is to study the distribution of dative subject constructions with predicative and adjective forms that potentially can be used in such constructions. The methodological novelty of the approach is manifested in the ...

Added: April 15, 2015

Труды международной конференции «Корпусная лингвистика-2019».

Издательство Санкт-Петербургского государственного университета, 2019.

Сборник содержит материалы докладов, представленных на Международной научной конференции «Корпусная лингвистика-2019» 24–28 июня 2019 г. в Санкт-Петербурге. Создание корпусов текстов является одним из приоритетных направлений в современной лингвистике. Проведение конференции по данной тематике знакомит ученых с современными разработками и новыми технологическими решениями в этой области, а также способствует обобщению опыта научных исследований по корпусной лингвистике. ...

Added: November 1, 2020

Новый комплекс инструментов автоматической обработки текста для платформыTXM и его апробация на корпусе для анализа экстремистских текстов

Лаврентьев А. М., Соловьев Ф. Н., Суворова М. И. et al., Вестник Новосибирского государственного университета. Серия: Лингвистика и межкультурная коммуникация 2018 Т. 16 № 3 С. 19–31

ПлатформаTXM предоставляет широкие возможности корпусного анализа, такие как анализ соответствий, кластеризация, построение лексических таблиц, поиск сложных лексических конструкций, выделение подкорпу-сов по различным параметрам. По умолчанию платформа работает со словоупотреблениями в качестве структур-ных единиц анализа. Она интегрирована с единственным расширениемTreeTagger, позволяющим проводить лишь морфологический анализ и лемматизацию словоупотреблений. Однако пользователь может сопроводить каждое словоупотребление набором дополнительных характеристик, ...

Added: September 8, 2018

Proceedings of the 7th Workshop on Balto-Slavic Natural Language Processing

Пономарева М. А., Дроганова К. А., Smurov I. et al., Florence: Association for Computational Linguistics, 2019.

This paper provides a comprehensive overview of the gapping dataset for Russian that consists of 7.5k sentences with gapping (as well as 15k relevant negative sentences) and comprises data from various genres: news, fiction, social media and technical texts. The dataset was prepared for the Automatic Gapping Resolution Shared Task for Russian (AGRR-2019) - a ...

Added: September 5, 2019

Computational Linguistics and Intellectual Technologies Papers from the Annual International Conference “Dialogue” (2019)

M.: Russian State University for the Humanitie, 2019.

The book includes 64 papers submitted to the International conference in computer linguistics and intellectual technologies Dialogue 2019 and presents a broad spectrum of theoretical and applied research of natural language description, language simulation, and creation of applied computer technologies. ...

Added: October 16, 2019

Корпус татарского языка "Туган тел"

Arkhangelskiy T., Гильмуллин Р. А., Невзорова О. А. et al., Научно-техническая информация. Серия 2: Информационные процессы и системы 2013

В статье описывается электронный корпус татарского языка, созданный в рамках программы фундаментальных исследований Президиума РАН "Корпусная лингвистика", и методы, использованные авторами для создания этого корпуса. В частности, описываются текстовый состав и жанровая структура корпуса, принятые авторами решения о выделении морфологических характеристик, автоматическая морфологическая разметка текстов с помощью двухуровневой модели морфологии и анализатора PC-KIMMO и размещение ...

Added: October 25, 2013

Труды международной конференции "Корпусная лингвистика - 2019"

СПб.: Издательство Санкт-Петербургского университета, 2019.

Сборние содержит материалы докладов, представленных на Международной научной конференции "Корпусная лингвистика-2019" 24-28 июня 2019 г. в Санкт-Петербурге. ...

Added: July 8, 2019

Computational Linguistics and Intelligent Text Processing, Lecture Notes in Computer Science

Springer, 2015.

16th International Conference, CICLing 2015, Cairo, Egypt, April 14-20, 2015, Proceedings, Part I ISBN: 978-3-319-18110-3 (Print) 978-3-319-18111-0 (Online) ...

Added: April 23, 2015

Speech and Computer. 21st International Conference, SPECOM 2019, Istanbul, Turkey, August 20–25, 2019, Proceedings

Switzerland: Springer, 2019.

This volume contains a collection of submitted papers presented at the conference, which were thoroughly reviewed by members of the Program Committee consisting of more than 100 top specialists, as well as an invited paper by Prof. Scharenborg. Each paper was reviewed, single blind, by two to four committee members (three reviewers on the average) and then discussed by ...

Added: October 29, 2019

Машина в духе. Гуссерлевское обоснование и ограничение искусственного интеллекта.

Холенштайн Э., Логос 2007 Т. 63 № 6 С. 176–196

В статье предпринимается попытка применить феноменологическуюй психологию Эдмунда Гуссерля к разрешению проблемы соотношения естественного и искусственного интеллекта. ...

Added: July 7, 2015

Computational Linguistics and Intellectual Technologies

M.: Russian State University for the Humanitie, 2019.

The book includes 61 reports of the International conference on computer and intellectual technology "Dialogue-2019", representing a wide range of theoretical and applied research in the field of natural language description, modeling of language processes, creating practically applicable computer linguistic technologies. For specialists in the field of theoretical and applied linguistics and intellectual technologies. ...

Added: June 12, 2019

Компьютерная лингвистика и интеллектуальные технологии: По материалам ежегодной международной конференции «Диалог» (Москва, 29 мая — 1 июня 2019 г.)

М.: Издательский центр «Российский государственный гуманитарный университет», 2019.

Added: October 16, 2019

Количественная оценка грамматической неоднозначности некоторых европейских языков

Klyshinskiy E., Логачёва В. К., Карпик О. В. et al., Вестник Новосибирского государственного университета. Серия: Лингвистика и межкультурная коммуникация 2020 Т. 18 № 1 С. 5–21

The grammatical ambiguity (multiple sets of grammatical features for one word form or coinciding surface forms of different words) can be of different types. We describe six classes of grammatical ambiguity: unambiguous, ambiguous by grammatical features, by part of speech, by lemma, by lemma and part of speech, and out-of-vocabulary words. These classes are presented ...

Added: December 11, 2019

Proceedings of the 7th evaluation campaign of Natural Language Processing and Speech tools for Italian (EVALITA 2020)

CEUR-WS.org, 2020.

Added: October 7, 2020

Dark personalities on Facebook: Harmful online behaviors and language

Bogolyubova O., Panicheva P., Tikhonov R. et al., Computers in Human Behavior 2018 Vol. 78 P. 151–159

*Реализация соц. сети Facebook запрещена на территории России по основаниям осуществления экстремистской деятельности. The goal of this paper was to assess the connection between dark personality traits and engagement in harmful online behaviors in a sample of Russian Facebook users, and to describe the language they use in online communication. A total of 6724 individuals participated ...

Added: February 18, 2019

Anaphoric annotation and corpus-based anaphora resolution: An experiment

Alexeeva S. V., Protopopova E. V., Bodrova A. A. et al., Компьютерная лингвистика и интеллектуальные технологии 2014 P. 562–571

The paper describes the noun phase and anaphora annotation in OpenCorpora and compares it to that in other corpora. We discuss the choice of representative texts for anaphoric annotation and the basic principles of syntactic annotation. In case of noun phrase annotation we followed the scheme introduced earlier for morphological annotation: it was carried out ...

Added: October 8, 2014

RUSSE2018: a Shared Task on Word Sense Induction for the Russian Language

Panchenko A., Lopukhina A., Ustalov D. et al., Компьютерная лингвистика и интеллектуальные технологии 2018 No. 17 P. 547–564

The paper describes the results of the first shared task on word sense induction (WSI) for the Russian language. While similar shared tasks were conducted in the past for some Romance and Germanic languages, we explore the performance of sense induction and disambiguation methods for a Slavic language that shares many features with other Slavic ...

Added: June 7, 2018

Computational Linguistics and Intellectual Technologies. Papers from the Annual International Conference “Dialogue” (2015)

M.: Russian State University for the Humanitie, 2015.

Added: April 28, 2015

Insights into the web based English learning projects

Frolova N., Frolov E. S., The Kazakh-American Free University Academic Journal 2017 P. 179–184

The article reflects the practical experience of enhancing the process of Academic English Writing teaching to undergraduate students by means of web tools. Along with theoretical analysis of the integration scheme of blended learning into the curriculum the article features empirical survey to confirm the efficiency of the project. The article contains a detailed description ...

Added: June 5, 2018

Exploring the Effectiveness of Methods for Persona Extraction

Konstantin Zaitsev, / Series Computer Science "arxiv.org". 2024.

The paper presents a study of methods for extracting information about dialogue participants and evaluating their performance in Russian. To train models for this task, the Multi-Session Chat dataset was translated into Russian using multiple translation models, resulting in improved data quality. A metric based on the F-score concept is presented to evaluate the effectiveness ...

Added: September 26, 2024

Онтологические модели ситуаций в задачах компьютерного контроля знаний иностранного языка

Demkin V. M., Sosnin A., Сусманова С. С., Онтология проектирования 2014 № 3(13) С. 63–76

Discussed in the paper are modern approaches to the design of complicated intellectual computer systems assessing foreign language proficiency, e.g. checking students’ academic progress in a higher educational establishment. The paper provides insight into the means to develop ontology-based situation models in the tasks requiring that a person’s command of English be assessed, which is ...

Added: October 24, 2012

Proceedings of the Eleventh International Conference on Computational Creativity

Coimbra: Association for Computational Creativity, 2020.

Added: September 29, 2020