?
Proceedings of The 12th Language Resources and Evaluation Conference
Vol. 12.
European Language Resources Association (ELRA), 2020.
Welcome to the 12th edition of LREC . . . that should have been in Marseille, first time in France! Unfortunately not now, in May 2020. Now my welcome is completely virtual, to all of you authors of these Proceedings papers and to the colleagues who will look at these. Virtual but not less sincere. This LREC would have also been an occasion to celebrate the 25th anniversary of ELRA. We are proud that ELRA is becoming a mature association. And LREC too. LREC started in 1998, 22 years ago. We hope to welcome you in a non-virtual way next year in Marseille. We will enjoy together not only the conference but also the special “light” of Marseille and the wonderful view of the Mediterranean and the city from the Palais du Pharo.
Shavrina T., Emelyanov A., Fenogenova A. et al., , in : Proceedings of The 12th Language Resources and Evaluation Conference. Vol. 12.: European Language Resources Association (ELRA), 2020. P. 2276-2284.
Artificial General Intelligence (AGI) is showing growing performance in numerous applications - beating human performance in Chess and Go, using knowledge bases and text sources to answer questions (SQuAD) and even pass human examination (Aristo project). In this paper, we describe the results of AI Journey, a competition of AI-systems aimed to improve AI performance ...
Added: June 15, 2020
Krotova I., Aksenov S., Artemova E., , in : Proceedings of The 12th Language Resources and Evaluation Conference. Vol. 12.: European Language Resources Association (ELRA), 2020. P. 4410-4417.
Applications such as machine translation, speech recognition, and information retrieval require efficient handling of noun compounds as they are one of the possible sources for out of vocabulary words. In-depth processing of noun compounds requires not only splitting them into smaller components (or even roots) but also the identification of instances that should remain unsplitted ...
Added: June 21, 2020
Logacheva V., Teslenko D., Shelmanov A. et al., , in : Proceedings of The 12th Language Resources and Evaluation Conference. Vol. 12.: European Language Resources Association (ELRA), 2020. P. 5943-5952.
Disambiguation of word senses in context is easy for humans, but is a major challenge for automatic approaches. Sophisticated supervised and knowledge-based models were developed to solve this task. However, (i) the inherent Zipfian distribution of supervised training instances for a given word and/or (ii) the quality of linguistic knowledge representations motivate the development of ...
Added: June 21, 2020
Khakhmovich A., Pavlova S., Kirillova K. et al., , in : Proceedings of The 12th Language Resources and Evaluation Conference. Vol. 12.: European Language Resources Association (ELRA), 2020. P. 4247-4255.
Out-of-vocabulary words are still a challenge in cross-lingual Natural Language Processing tasks, for which transliteration from source to target language or script is one of the solutions. In this study, we collect a personal name dataset in 445 Wikidata languages (37 scripts), train Transformer-based multilingual transliteration models on 6 high- and 4 less-resourced languages, compare ...
Added: October 9, 2020
Smirnova K. V., Korotaev N. A., Panikratova Y. R. et al., , in : Proceedings of The 12th Language Resources and Evaluation Conference. Vol. 12.: European Language Resources Association (ELRA), 2020. Ch. 25. P. 195-203.
In modern linguistics and psycholinguistics speech disfluencies in real fluent speech are a well-known phenomenon. But it’s not still clear which components of brain systems are involved into its comprehension in a listener’s brain. In this paper we provide a pilot neuroimaging study of the possible neural correlates of speech disfluencies perception, using a combination ...
Added: November 20, 2020
Tyers F. M., Keleg A., Pirinen T., , in : Proceedings of The 12th Language Resources and Evaluation Conference. Vol. 12.: European Language Resources Association (ELRA), 2020. P. 3842-3850.
Morphological analysis is one of the tasks that have been studied for years. Different techniques have been used to develop models for performing morphological analysis. Models based on finite state transducers have proved to be more suitable for languages with low available resources. In this paper, we have developed a method for weighting a morphological ...
Added: April 20, 2021
Meyer J., Rauchenstein L., Eisenberg J., , in : Proceedings of The 12th Language Resources and Evaluation Conference. Vol. 12.: European Language Resources Association (ELRA), 2020. P. 6462-6468.
We describe the creation of the Artie Bias Corpus, an English dataset of expert-validated <audio, transcript> pairs with demographic tags for age, gender, accent. We also release open software which may be used with the Artie Bias Corpus to detect demographic bias in Automatic Speech Recognition systems, and can be extended to other speech technologies. ...
Added: April 20, 2021
Shavrina T., Emelyanov A., Fenogenova A. et al., , in : Proceedings of The 12th Language Resources and Evaluation Conference. Vol. 12.: European Language Resources Association (ELRA), 2020. P. 2276-2284.
Artificial General Intelligence (AGI) is showing growing performance in numerous applications - beating human performance in Chess and Go, using knowledge bases and text sources to answer questions (SQuAD) and even pass human examination (Aristo project). In this paper, we describe the results of AI Journey, a competition of AI-systems aimed to improve AI performance ...
Added: June 15, 2020
Starchenko A., Kazakevich L., Lyashevskaya O., / НИУ ВШЭ. Series WP BRP "Linguistics". 2018. No. 76.
The poetic texts pose a challenge to full morphological tagging and lemmatization since the authors seek to extend the vocabulary, employ morphologically and semantically deficient forms, go beyond standard syntactic templates, use non-projective constructions and non-standard word order, among other techniques of the creative language game. In this paper we evaluate a number of probabilistic ...
Added: December 12, 2018
Toldova S.Ju., Roytberg A., Nedoluzhko А. et al., , in : Компьютерная лингвистика и интеллектуальные технологии: По материалам ежегодной Международной конференции «Диалог» (Бекасово, 4 — 8 июня 2014 г.). Вып. 13(20).: М. : Изд-во РГГУ, 2014. P. 681-695.
The paper reports on the recent forum RU-EVAL ‒ a new initiative for evaluation of Russian NLP resources, methods and toolkits. The first two events were devoted to morphological and syntactic parsing correspondingly. The third event is devoted to anaphora and coreference resolution. Seven participating IT companies and academic institutions submitted their results for anaphora ...
Added: October 6, 2014
Springer, 2021
This book constitutes the proceedings of the 19th Russian Conference on Artificial Intelligence, RCAI 2021, held in Moscow, Russia, in October 2021.
The 19 full papers and 7 short papers presented in this volume were carefully reviewed and selected from 80 submissions. The conference deals with a wide range of topics, categorized into the following topical ...
Added: October 28, 2021
Razzhigaev A., Nikolay Arefyev, Panchenko A., , in : Proceedings of the 15th International Workshop on Semantic Evaluation (SemEval-2021). : Association for Computational Linguistics, 2021. P. 157-162.
In this paper, we present a system for the solution of the cross-lingual and multilingual word-in-context disambiguation task. Task organizers provided monolingual data in several languages, but no cross-lingual training data were available. To address the lack of the officially provided cross-lingual training data, we decided to generate such data ourselves. We describe a simple ...
Added: September 23, 2021
I. K. Kusakin, Fedorets O. V., A. Y. Romanov, Scientific and Technical Information Processing 2023 Vol. 50 No. 3 P. 176-183
This paper discusses modern approaches to natural language processing and the application of machine learning models to the task of classifying short scientific texts in Russian. This study is devoted to the analysis of methods for vectorization of textual information, selection of a model for scientific paper clas- sification, and training of linguistic model BERT ...
Added: November 4, 2023
Baklanova V., Kurkin A., Teplova T., China Finance Review International 2023
Purpose – The primary objective of this research is to provide a precise interpretation of the constructed
machine learning model and produce definitive summaries that can evaluate the influence of investor sentiment on the overall sales of non-fungible token (NFT) assets. To achieve this objective, the NFT hype
index was constructed as well as several approaches of ...
Added: December 10, 2023
Karpov N., В кн. : Современные проблемы информатизации в анализе и синтезе технологических и программно-телекоммуникационных систем: Сборник трудов. Вып. 17.: Воронеж : Научная книга, 2012. С. 264-266.
Added: November 7, 2012
Фирсанова В. И., International Journal of Open Information Technologies 2021 Vol. 9 No. 12 P. 53-59
The paper presents a study on question answering systems evaluation. The purpose of the study is to determine if human evaluation is indeed necessary to qualitatively measure the performance of a sociomedical dialogue system. The study is based on the data from several natural language processing experiments conducted with a question answering dataset for inclusion of people with autism spectrum disorder and state-of-the-art ...
Added: September 25, 2023
Alimova l., Tutubalina E., Journal of Biomedical Informatics 2020 Vol. 103 P. 1-9
Relation extraction aims to discover relational facts about entity mentions from plain texts. In this work, we focus on clinical relation extraction; namely, given a medical record with mentions of drugs and their attributes, we identify relations between these entities. We propose a machine learning model with a novel set of knowledge-based and BioSentVec embedding ...
Added: October 28, 2020
Luparov A., Panov A. I., Suvorov R. et al., , in : Proceedings of ICPRAM 2015 - 4th International Conference on Pattern Recognition Applications and Methods. Vol. 2.: SciTePress, 2015. P. 270-276.
Dendritic cells (DCs) vaccination is a promising way to contend cancer metastases especially in the case of immunogenic tumors. Unfortunately, it is only rarely possible to achieve a satisfactory clinical outcome in the majority of patients treated with a particular DC vaccine. Apparently, DC vaccination can be successful with certain combinations of features of the ...
Added: November 20, 2015
Ivan P. Yamshchikov, Shibaev V., Nagaev A. et al., , in : Proceedings of the 3rd Workshop on Neural Generation and Translation. : Association for Computational Linguistics, 2019. P. 128-137.
This paper focuses on latent representations that could effectively decompose different aspects of textual information. Using a framework of style transfer for texts, we propose several empirical methods to assess information decomposition quality. We validate these methods with several state-of-the-art textual style transfer methods. Higher quality of information decomposition corresponds to higher performance in terms ...
Added: January 7, 2021
Stroudsburg, PA : Association for Computational Linguistics, 2019
Proceedings of the Fourteenth Workshop on Innovative Use of NLP for Building Educational Applications ...
Added: October 5, 2020
Switzerland : Springer, 2015
This book constitutes the refereed proceedings of the 6th Conference on Knowledge Engineering and the Semantic Web, KESW 2015, held in Moscow, Russia, in September/October 2015. The 17 revised full papers presented together with 6 short system descriptions were carefully reviewed and selected from 35 submissions. The papers address research issues related to semantic web, ...
Added: September 16, 2015
Kutuzov A. B., Никишина И. А., , in : Analysis of Images, Social Networks and Texts. 8th International Conference, AIST 2019, Lecture Notes in Computer Science, Revised Selected Papers. Vol. 11832.: Cham : Springer, 2019. P. 3-8.
Double-blind peer reviewing has been proved to be a pretty effective and fair way of academic work selection. However, to the best of our knowledge, nobody has yet analysed the effects caused by its introduction at the Russian NLP conferences. We investigate how the double-blind peer reviewing influences gender and location (according to authors’ affiliations) ...
Added: January 20, 2020
Association for Computational Linguistics, 2019
Added: September 15, 2020
P. : European Language Resources Association (ELRA), 2018
Book of abstracts from the Eleventh International Conference on Language Resources and Evaluation (LREC 2018) ...
Added: May 5, 2018
Zakhlebin I. V., В кн. : Электронный бизнес. Управление интернет-проектами. Инновации: Сборник трудов участников студенческой научно-практической конференции, Москва, 12-14 марта 2013 г. : М. : НИУ ВШЭ, 2014. С. 88-91.
The report deals with the methodology of building a system to perform search for specialists satisfying a defined set of competencies. The proposed search method is based on natural language texts analysis. ...
Added: July 11, 2015
Durandin O., Malafeev A., Zolotykh N., , in : Analysis of Images, Social Networks and Texts. 6th International Conference, 2017, Revised Selected Papers. Vol. 10716.: Cham : Springer, 2018. Ch. 4. P. 34-46.
The paper deals with Google’s universal parser SyntaxNet. The system was used to analyze the Universal Dependencies linguistic corpora. We conducted an error analysis of the output of the parser to reveal to what extent the error types are connected with or preconditioned by the language types. In particular, we carried out several experiments, clustering ...
Added: December 1, 2017
Davletov A., Gordeev D., Nikolay Arefyev et al., , in : Proceedings of the 15th International Workshop on Semantic Evaluation (SemEval-2021). : Association for Computational Linguistics, 2021. P. 1249-1254.
This work describes our approach for subtasks of SemEval-2021 Task 8: MeasEval: Counts and Measurements which took the official first place in the competition. To solve all subtasks we use multi-task learning in a question-answering-like manner. We also use learnable scalar weights to weight subtasks’ contribution to the final loss in multi-task training. We fine-tune ...
Added: September 23, 2021
М. : Издательский центр «Российский государственный гуманитарный университет», 2019
The book includes 64 papers submitted to the International conference in computer linguistics and intellectual technologies Dialogue 2019 and presents a broad spectrum of theoretical and applied research of natural language description, language simulation, and creation of applied computer technologies. ...
Added: October 16, 2019
Berlin : Springer, 2014
This book constitutes the proceedings of the Third International Conference on Analysis of Images, Social Networks and Texts, AIST 2014, held in Yekaterinburg, Russia, in April 2014. The 11 full and 10 short papers were carefully reviewed and selected from 74 submissions. They are presented together with 3 short industrial papers, 4 invited papers and ...
Added: November 13, 2014
Karpov I., Крылова Т. В., Timoshenko S., Scando-Slavica 2022 P. 1-20
In this paper we describe the difference between informal comments, posted on social networks, and internet journalistic style texts, which tend to be written in a Codified Literary Russian. We performed a quantitative analysis of more than0 graphic, morphological, syntactic features, and supplied statistically significant features with the linguistic interpretation. The article concluded that the ...
Added: October 31, 2021
Tikhonova M., Elina Telesheva, Mirzoev S. et al., , in : 2021 International Conference Engineering and Telecommunication (En&T). : IEEE, 2022. P. 1-6.
Style transfer is an important and a rapidly developing of Natural Language Processing. This days more and more methods and models are proposed which allow us to generate text in predefined style. In this paper we propose a framework for style transfer of “Friends” TV series. The trained models are able to mimic one of ...
Added: May 21, 2022