Do Data-based Curricula Work?
Сурков М. К., Мосин В. Д., Yamshchikov I. P.
Dublin: Association for Computational Linguistics, 2022
, , , Do Data-based Curricula Work? / Cornell University. Series Computer Science "arxiv.org". 2021.
Current state-of-the-art NLP systems use large neural networks that require lots of computational resources for training. Inspired by human knowledge acquisition, researchers have proposed curriculum learning, — sequencing of tasks (task-based curricula) or ordering and sampling of the datasets (data-based curricula) that facilitate training. This work investigates the benefits of data-based curriculum learning for large modern ...
Added: January 17, 2022
, , , in: Database and Expert Systems Applications. 27th International Conference, DEXA 2016, Porto, Portugal, September 5-8, 2016, Proceedings. Ч. I. Т. 9827.: Дордрехт, Лондон, Хайдельберг, Нью-Йорк, Хам: Springer, 2016.. P. 416-430.
During roughly the last seven years, an increase of interest in semantic parsing of instructions in natural language (NL) could be observed. The principal applications of developed algorithms are NL-interfaces for interaction with robots and the personages of videogames, navigation in virtual space, and for developing programs by means of NL. However, the known algorithms ...
Added: October 18, 2016
Association for Computational Linguistics, 2019
Added: September 15, 2020
Assessment of Dendritic Cell Therapy Effectiveness Based on the Feature Extraction from Scientific Publications
, , et al., , in: Proceedings of ICPRAM 2015 - 4th International Conference on Pattern Recognition Applications and Methods. Vol. 2.: SciTePress, 2015.. P. 270-276.
Dendritic cells (DCs) vaccination is a promising way to contend cancer metastases especially in the case of immunogenic tumors. Unfortunately, it is only rarely possible to achieve a satisfactory clinical outcome in the majority of patients treated with a particular DC vaccine. Apparently, DC vaccination can be successful with certain combinations of features of the ...
Added: November 20, 2015
, , , , in: Proceedings of the Workshop on Language Technology Resources and Tools for Digital Humanities (LT4DH). .: Osaka: [б.и.], 2016.. P. 26-34.
We present an approach to detect differences in lexical semantics across English language registers, using word embedding models from distributional semantics paradigm. Models trained on register-specific subcorpora of the BNC corpus are employed to compare lists of nearest associates for particular words and draw conclusions about their semantic shifts depending on register in which they ...
Added: November 12, 2016
LIORI at SemEval-2021 Task 2: Span Prediction and Binary Classification approaches to Word-in-Context Disambiguation
, , et al., , in: Proceedings of the 15th International Workshop on Semantic Evaluation (SemEval-2021). .: Association for Computational Linguistics, 2021.. P. 780-786.
This paper presents our approaches to SemEval-2021 Task 2: Multilingual and Cross-lingual Word-in-Context Disambiguation task. The first approach attempted to reformulate the task as a question answering problem, while the second one framed it as a binary classification problem. Our best system, which is an ensemble of XLM-R based binary classifiers trained with data augmentation, ...
Added: September 23, 2021
Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)
P.: European Language Resources Association (ELRA), 2018
Book of abstracts from the Eleventh International Conference on Language Resources and Evaluation (LREC 2018) ...
Added: May 5, 2018
, , et al., , in: The NeurIPS '18 Competition: From Machine Learning to Intelligent Conversations. .: Springer, 2020.. P. 295-315.
Added: February 20, 2021
Analysis of Images, Social Networks and Texts Third International Conference, AIST 2014, Yekaterinburg, Russia, April 10-12, 2014, Revised Selected Papers
Berlin: Springer, 2014
This book constitutes the proceedings of the Third International Conference on Analysis of Images, Social Networks and Texts, AIST 2014, held in Yekaterinburg, Russia, in April 2014. The 11 full and 10 short papers were carefully reviewed and selected from 74 submissions. They are presented together with 3 short industrial papers, 4 invited papers and ...
Added: November 13, 2014
, , , Scando-Slavica 2022 P. 1-20
In this paper we describe the difference between informal comments, posted on social networks, and internet journalistic style texts, which tend to be written in a Codified Literary Russian. We performed a quantitative analysis of more than0 graphic, morphological, syntactic features, and supplied statistically significant features with the linguistic interpretation. The article concluded that the ...
Added: October 31, 2021
Switzerland: Springer, 2015
This book constitutes the refereed proceedings of the 6th Conference on Knowledge Engineering and the Semantic Web, KESW 2015, held in Moscow, Russia, in September/October 2015. The 17 revised full papers presented together with 6 short system descriptions were carefully reviewed and selected from 35 submissions. The papers address research issues related to semantic web, ...
Added: September 16, 2015
This book constitutes the proceedings of the 19th Russian Conference on Artificial Intelligence, RCAI 2021, held in Moscow, Russia, in October 2021. The 19 full papers and 7 short papers presented in this volume were carefully reviewed and selected from 80 submissions. The conference deals with a wide range of topics, categorized into the following topical ...
Added: October 28, 2021
, , in: Artificial Intelligence and Natural Language, 7th International Conference, AINL 2018, St. Petersburg, Russia, October 17–19, 2018, Proceedings. Issue 930.: Switzerland: Springer, 2018.. P. 107-112.
Semantic information has been deemed a valuable resource for solving the task of coreference resolution by many researchers. Unfortunately, not much has been done in the direction of using this data when working with Russian data. This work describes the first step of a research, attempting to create a coreference resolution system for Russian based on semantic data, concerned with ...
Added: September 5, 2018
, , Chekhov's Gun Recognition / Cornell University. Series Computer Science "arxiv.org". 2021.
Chekhov's gun is a dramatic principle stating that every element in a story must be necessary, and irrelevant elements should be removed. This paper presents a new natural language processing task — Chekhov's gun recognition or (CGR) — recognition of entities that are pivotal for the development of the plot. Though similar to classical Named Entity Recognition ...
Added: December 3, 2021
, , , , in: Analysis of Images, Social Networks and Texts. 6th International Conference, 2017, Revised Selected Papers. Vol. 10716.: Cham: Springer, 2018.. Ch. 4. P. 34-46.
The paper deals with Google’s universal parser SyntaxNet. The system was used to analyze the Universal Dependencies linguistic corpora. We conducted an error analysis of the output of the parser to reveal to what extent the error types are connected with or preconditioned by the language types. In particular, we carried out several experiments, clustering ...
Added: December 1, 2017
, , et al., , in: 2021 International Conference Engineering and Telecommunication (En&T). .: IEEE, 2022..
Style transfer is an important and a rapidly developing of Natural Language Processing. This days more and more methods and models are proposed which allow us to generate text in predefined style. In this paper we propose a framework for style transfer of “Friends” TV series. The trained models are able to mimic one of ...
Added: May 21, 2022
, В кн.: Электронный бизнес. Управление интернет-проектами. Инновации: Сборник трудов участников студенческой научно-практической конференции, Москва, 12-14 марта 2013 г.. .: М.: НИУ ВШЭ, 2014.. С. 88-91.
The report deals with the methodology of building a system to perform search for specialists satisfying a defined set of competencies. The proposed search method is based on natural language texts analysis. ...
Added: July 11, 2015
, В кн.: Современные проблемы информатизации в анализе и синтезе технологических и программно-телекоммуникационных систем: Сборник трудов. Вып. 17.: Воронеж: Научная книга, 2012.. С. 264-266.
Added: November 7, 2012
, , et al., , in: Proceedings of the 15th International Workshop on Semantic Evaluation (SemEval-2021). .: Association for Computational Linguistics, 2021.. P. 1249-1254.
This work describes our approach for subtasks of SemEval-2021 Task 8: MeasEval: Counts and Measurements which took the official first place in the competition. To solve all subtasks we use multi-task learning in a question-answering-like manner. We also use learnable scalar weights to weight subtasks’ contribution to the final loss in multi-task training. We fine-tune ...
Added: September 23, 2021
Компьютерная лингвистика и интеллектуальные технологии: По материалам ежегодной международной конференции «Диалог» (Москва, 29 мая — 1 июня 2019 г.)
М.: Издательский центр «Российский государственный гуманитарный университет», 2019
The book includes 64 papers submitted to the International conference in computer linguistics and intellectual technologies Dialogue 2019 and presents a broad spectrum of theoretical and applied research of natural language description, language simulation, and creation of applied computer technologies. ...
Added: October 16, 2019
, , Journal of Biomedical Informatics 2020 Vol. 103 P. 1-9
Relation extraction aims to discover relational facts about entity mentions from plain texts. In this work, we focus on clinical relation extraction; namely, given a medical record with mentions of drugs and their attributes, we identify relations between these entities. We propose a machine learning model with a novel set of knowledge-based and BioSentVec embedding ...
Added: October 28, 2020
Использование принципов «регуляторной гильотины» и методов вычислительного права для анализа требований к качеству высшего образования
, , , Вопросы государственного и муниципального управления 2022 № 1 С. 78-100
Now Russia is undergoing a reform of the control and supervisory activity of the “regulatory guillotine”, which is designed to signifi cantly reduce the number of mandatory requirements in the legislation, leaving only those that are necessary and should be controlled among them. In the presented article, the principles of this reform are applied to the Federal State Educational Standards (FSES). Russian legislation understands the quality of education ...
Added: April 6, 2022
SkoltechNLP at SemEval-2021 Task 2: Generating Cross-Lingual Training Data for the Word-in-Context Task
, , , , in: Proceedings of the 15th International Workshop on Semantic Evaluation (SemEval-2021). .: Association for Computational Linguistics, 2021.. P. 157-162.
In this paper, we present a system for the solution of the cross-lingual and multilingual word-in-context disambiguation task. Task organizers provided monolingual data in several languages, but no cross-lingual training data were available. To address the lack of the officially provided cross-lingual training data, we decided to generate such data ourselves. We describe a simple ...
Added: September 23, 2021
, , et al., , in: Proceedings of the 3rd Workshop on Neural Generation and Translation. .: Association for Computational Linguistics, 2019.. P. 128-137.
This paper focuses on latent representations that could effectively decompose different aspects of textual information. Using a framework of style transfer for texts, we propose several empirical methods to assess information decomposition quality. We validate these methods with several state-of-the-art textual style transfer methods. Higher quality of information decomposition corresponds to higher performance in terms ...
Added: January 7, 2021