NAS-Bench-NLP: Neural Architecture Search Benchmark for Natural Language Processing

?

NAS-Bench-NLP: Neural Architecture Search Benchmark for Natural Language Processing

arXiv , 2020.

Klyuchnikov N., Trofimov I., Artemova E., Salnikov M., Fedorov M., Burnaev E. V.

Neural Architecture Search (NAS) is a promising and rapidly evolving research area. Training a large number of neural networks requires an exceptional amount of computational power, which makes NAS unreachable for those researchers who have limited or no access to high-performance clusters and supercomputers. A few benchmarks with precomputed neural architectures performances have been recently introduced to overcome this problem and ensure more reproducible experiments. However, these benchmarks are only for the computer vision domain and, thus, are built from the image datasets and convolution-derived architectures. In this work, we step outside the computer vision domain by leveraging the language modeling task, which is the core of natural language processing (NLP). Our main contribution is as follows: we have provided search space of recurrent neural networks on the text datasets and trained 14k architectures within it; we have conducted both intrinsic and extrinsic evaluation of the trained models using datasets for semantic relatedness and language understanding evaluation; finally, we have tested several NAS algorithms to demonstrate how the precomputed results can be utilized. We believe that our results have high potential of usage for both NAS and NLP communities.

Priority areas: IT and mathematics

Language: English

Full text

Text on another site

Publication based on the results of:

Development of Mathematical Models and Methods for Recommender Systems and Natural Language Processing (2020)

Компьютерная лингвистика и интеллектуальные технологии: По материалам ежегодной международной конференции «Диалог» (Москва, 29 мая — 1 июня 2019 г.). Вып. 18 (25)

М. : Издательский центр «Российский государственный гуманитарный университет», 2019

Сборник включает 27 докладов международной конференции по компьютерной лингвистике и интеллектуальным технологиям «Диалог 2019», не вошедшие в ежегодник «Компьютерная лингвистика и интеллектуальные технологии», но рекомендованные Программным Комитетом к представлению на конференции. Для специалистов в области теоретической и прикладной лингвистики и интеллектуальных технологий. ...

Added: December 10, 2019

A Parallel Algorithm to Detect Structural Breaks in Time Series

Furmanov K. K., Nikol'skii I. M., Computational Mathematics and Modeling 2016 Vol. 27 No. 2 P. 247-253

Added: December 22, 2016

Measurement of the Drell-Yan triple-differential cross section in pp collisions at $$ \sqrt{s}=8 $$ TeV

Сапронов А. А., Aaboud M., Journal of High Energy Physics 2017 Vol. 2017 No. 12 P. 1-78

This paper presents a measurement of the triple-differential cross section for the Drell-Yan process Z/γ * → ℓ + ℓ − where ℓ is an electron or a muon. The measurement is performed for invariant masses of the lepton pairs, m ℓℓ , between 46 and 200 GeV using a sample of 20.2 fb−1of pp collisions data at a centre-of-mass energy of s√=8s=8 TeV collected by the ATLAS detector at the LHC ...

Added: February 26, 2018

Representational dissimilarity component analysis (ReDisCA)

Ossadtchi A., Semenkov I., Zhuravleva A. et al., / Cold Spring Harbor Laboratory. Series http://dx.doi.org/ "BioRxiv". 2024.

Added: March 15, 2024

Сборник трудов конференции NI Academic Days 2017, Москва 13-14 апреля 2017 г.

М. : National Instruments Russia, 2017

Содержание сборника составляют доклады с результатами оригинальных исследований и технических решений, ранее не публиковавшиеся. Мы надеемся, что предлагаемый сборник окажется полезным для специалистов, работающих в различных областях науки и техники, для широкого круга преподавателей, аспирантов и студентов ВУЗов, а также для преподавателей средних школ и технических колледжей. ...

Added: May 10, 2017

О выборе программных средств когнитивной компьютерной визуализации

Baibikova T., Domoratsky E., Вестник Московского финансово-юридического университета 2017 № 1 С. 200-206

Some questions of scientific visualization are under consideration in this paper. This article also discusses the peculiarities of application of cognitive computer graphics, singles out a range of tasks of scientific visualization. The paper gives a brief overview of modern support tools for program visualization, tendencies of their development and their main characteristics. A module ...

Added: June 10, 2017

Математический анализ. Базовые понятия

Shagin V. L., Соколов А. В., М. : Юрайт, 2016

Учебное пособие посвящено основам математического анализа. В нем в доход- чивой форме объясняется происхождение и существо фундаментальных понятий, на которых строится теория: предел, непрерывность, производная, интеграл; подробно рассматриваются методы исследования функций и построения графиков. Изложение теоретических вопросов сопровождается иллюстрирующими примерами, а также многочисленными задачами и вопросами, позволяющими оценить степень усвоения материала. Предлагаемое учебное пособие следует ...

Added: October 12, 2016

Complex forecasting scheme for surface meteorological values

Bagrov A. N., Gordin V. A., Bykov P. L., Russian Meteorology and Hydrology 2014 No. 5 P. 283-291

The evaluations of the forecasts of surface air temperature and precipitation for the period July 2010 - June 2013 are presented. The forecasting of surface air temperature at 5 days and precipitation at 3 days are considered. Our complex statistical scheme uses the results of the best foreign global schemes, regional scheme COSMO-RU7. The joint ...

Added: December 7, 2013

Шестая Всероссийская научно-практическая конференция по имитационному моделированию и его применению в науке и промышленности «Имитационное моделирование. Теория и практика» Материалы конференции. Сборник докладов

Каз. : Издательство «Фэн» Академии наук Республики Татарстан, 2013

Материалы и доклады Шестой Всероссийской научно-практической конференции по имитацонному моделированию и его применению в науке и промышленности. ...

Added: December 14, 2013

"Авиакосмические технологии" (АКТ-2014): Тезисы I тура XV Всероссийской научно-технической конференции и школы молодых ученых, аспирантов и студентов

ООО Фирма "Элист", 2014

В книге представлены тезисы докладов I тура XV Всероссийской научно-технической конференции и школы молодых ученых, аспирантов и студентов. ...

Added: October 17, 2014

Методика самовосстановления распределенной системы контроля и управления техническими объектами на основе методов теории принятия решений

Vishnekov A., Erokhin V., Ivanova E., Датчики и системы 2018 Т. 221 № 1 С. 18-24

The article discusses the recovery of distributed control systems of technical objects. The options for the design of subsystems recovery after hardware or software fault of the sensor system are investigated. The development of an integrated subsystems recovery is proposed on the basis of decision-making system to develop the most rational control actions by ...

Added: January 27, 2018

Measures of uncertainty in market network analysis

Kalyagin V.A., Koldanov A.P., Koldanov P.A. et al., Physica A: Statistical Mechanics and its Applications 2014 Vol. 413 No. 1 P. 59-70

A general approach to measure statistical uncertainty of different filtration techniques for market network analysis is proposed. Two measures of statistical uncertainty are introduced and discussed. One is based on conditional risk for multiple decision statistical procedures and another one is based on average fraction of errors. It is shown that for some important cases ...

Added: July 19, 2014

Database on the Bandgap of Inorganic Substances and Materials

Kiselyova N. N., Dudarev V.A., Korzhuev M. A., Inorganic Materials: Applied Research 2016 Vol. 7 No. 1 P. 34-39

A database (DB) on the bandgap of inorganic substances available via the Internet (http://bg.imetdb.ru) was developed for the information service of specialists in the sphere of inorganic chemistry and materials science. The DB is integrated with other information systems on the properties of inorganic substances and materials, which provides the search of a wide range ...

Added: February 23, 2016

Pre-experiments on Annotation of Russian Coreference Corpus

Toldova S., Azerkovich I., Гришина Ю. et al., / НИУ ВШЭ. Series WP BRP "Linguistics". 2015.

Building benchmark corpora in the domain of coreference and anaphora resolution is an important task for developing and evaluating NLP systems and models. Our study is aimed at assessing the feasibility of enhancing corpora with information about coreference relations. The annotation procedure includes identification of text segments that are subjects to annotation (markables), marking their ...

Added: December 15, 2015

Sheath parameters for non-Debye plasmas: Simulations and arc damage

Morozov I., Norman G. E., Insepov Z. et al., Physical Review Special Topics - Accelerators and Beams 2012 Vol. 15 P. 053501

This paper describes the surface environment of the dense plasma arcs that damage rf accelerators, tokamaks, and other high gradient structures. We simulate the dense, nonideal plasma sheath near a metallic surface using molecular dynamics (MD) to evaluate sheaths in the non-Debye region for high density, low temperature plasmas. We use direct two-component MD simulations ...

Added: October 28, 2013

Increasing the performance of a Mobile Ad-hoc Network using a game-theoretic approach to drone positioning

Blakeway S., Gromov D., Gromova E. et al., Vestnik Sankt-Peterburgskogo Universiteta, Prikladnaya Matematika, Informatika, Protsessy Upravleniya 2019 Vol. 15 No. 1 P. 22-38

We describe a novel game-theoretic formulation of the optimal mobile agents’ placement problem which arises in the context of Mobile Ad-hoc Networks (MANETs). This problem is modelled as a sequential multistage game. The definitions of both the Nash equilibrium and cooperative solution are given. A modification was proposed to ensure the existence of a Nash ...

Added: March 13, 2020

Proceedings of 2011 Fourth International Conference on Information Management, Innovation Management and Industrial Engineering (ICIII 2011). 26-27 November 2011, Shenzhen, China

Los Alamitos : IEEE CS Pre, 2011

Information Management, Innovation Management and Industrial Engineering are becoming increasingly interesting to both the academic researchers and management practitioners. It is essential to explore enterprise management system from the theoretical viewpoint; it is also absolutely essential to the survival, growth and prosperity of any company to have some means to manage innovation in the process ...

Added: July 19, 2012

Об одномерных проекциях многогранников задач дискретной оптимизации

Vyalyi M., Дискретная математика 1991 Т. 3 № 3 С. 35-45

Added: October 17, 2014

On the Impact of Word Error Rate on Acoustic-Linguistic Speech Emotion Recognition: An Update for the Deep Learning Era

Shuranov E., / Cornell University. Series Computer Science "arxiv.org". 2021.

Text encodings from automatic speech recognition (ASR) transcripts and audio representations have shown promise in speech emotion recognition (SER) ever since. Yet, it is challenging to explain the effect of each information stream on the SER systems. Further, more clarification is required for analysing the impact of ASR's word error rate (WER) on linguistic emotion ...

Added: February 14, 2023

EDULEARN12 4th International Conference on Education and New Learning Technologies Publications

Barcelona : International Association of Technology, Education and Development , 2012

Added: September 11, 2012

Normal approximation and smoothness for sums of means of lattice-valued random variables

Decrouez G. G., Hall P., Bernoulli: a journal of mathematical statistics and probability 2013 Vol. 19 No. 4 P. 1268-1293

Motivated by a problem arising when analysing data from quarantine searches, we explore properties of distributions of sums of independent means of independent lattice-valued random variables. The aim is to determine the extent to which approximations to those sums require continuity corrections. We show that, in cases where there are only two different means, the ...

Added: September 29, 2014

Совершенствование преподавания дисциплин математического цикла на основе инвариантов, необходимых для преподавания курса «Эконометрика» экономистам-бакалаврам

Kotelnikova M. V., Aistov A., Вестник Нижегородского университета им. Н.И. Лобачевского. Серия: Социальные науки 2019 Т. 55 № 3 С. 183-189

The article describes a method that allows to improve the content of disciplines of the mathematical cycle by dividing them into invariant (general) and variable parts. The invariants were identified for such disciplines as «Linear algebra», «Mathematical analysis», «Probability theory and mathematical statistics» delivered to Bachelors program students of economics at several universities. Based on ...

Added: January 28, 2020

Formation of Control Structures in Static Swarms

Karpov V. E., Karpova I. P., Procedia Engineering 2015 Vol. 100 P. 1459-1468

Work solutions are proposed for problems of leader definition and role distribution in homogeneous groups of robots. It is shown that transition from a swarm to a collective of robots with hierarchical organization is possible using exclusively local interaction. The local revoting algorithm is central to the procedure for choice of leader while redistribution of roles can ...

Added: March 14, 2015

Операционные системы. Учебник и практикум

Gostev I. M., М. : Юрайт, 2016

В настоящее время компьютерные науки стремительно развиваются. Новые версии операционных систем появляются каждые полтора-два года, поэтому было принято решение о включении в данную книгу такого материала, который не будет устаревать. Содержание учебника представляет собой некоторые наиболее общие принципы построения операционных систем, которые были разработаны более 50 лет назад и практически не изменились за прошедшее время. ...

Added: October 13, 2009