An adaptive multiclass nearest neighbor classifier

N. Puchkin; V. Spokoiny

doi:10.1051/ps/2019021

Publications

?

An adaptive multiclass nearest neighbor classifier

ESAIM: Probability and Statistics. 2020. Vol. 24. P. 69–99.

Puchkin N., Spokoiny V.

We consider a problem of multiclass classification, where the training sample Sn={(Xi,Yi)}ni=1 is generated from the model ℙ(Y=m|X=x)=ηm(x), 1≤m≤M, and η1(x),…,ηM(x) are unknown α-Holder continuous functions.Given a test point X, our goal is to predict its label. A widely used 𝗄-nearest-neighbors classifier constructs estimates of η1(X),…,ηM(X) and uses a plug-in rule for the prediction. However, it requires a proper choice of the smoothing parameter 𝗄, which may become tricky in some situations. In our solution, we fix several integers n1,…,nK, compute corresponding nk-nearest-neighbor estimates for each m and each nk and apply an aggregation procedure. We study an algorithm, which constructs a convex combination of these estimates such that the aggregated estimate behaves approximately as well as an oracle choice. We also provide a non-asymptotic analysis of the procedure, prove its adaptation to the unknown smoothness parameter α and to the margin and establish rates of convergence under mild assumptions.

Priority areas: IT and mathematics

Language: English

DOI

Text on another site

Keywords: aggregation procedures агрегация multiclass learning многоклассовая классификация

Publication based on the results of:

Uncertainty quantification in high-dimensional models (2020)

Manipulability of Aggregation Procedures in Impartial Anonymous Culture

Aleskerov F. T., Ivanov A., Karabekyan D. et al., Procedia Computer Science 2015 No. 55 P. 1250–1257

Aleskerov et al. [1] and [2] estimated the degree of manipulability for the case of multi-valued choice (without using any tie-breaking rule) and for Impartial Culture (IC). In our paper, we address the similar question for the multi-valued choice and for Impartial Anonymous Culture (IAC). We use Nitzan-Kelly's (NK) index to estimate the degree of manipulability, which is calculated ...

Added: December 4, 2015

A Parallel Algorithm to Detect Structural Breaks in Time Series

Furmanov K. K., Nikol'skii I. M., Computational Mathematics and Modeling 2016 Vol. 27 No. 2 P. 247–253

Added: December 22, 2016

Sheath parameters for non-Debye plasmas: Simulations and arc damage

Morozov I., Norman G. E., Insepov Z. et al., Physical Review Special Topics - Accelerators and Beams 2012 Vol. 15 P. 053501

This paper describes the surface environment of the dense plasma arcs that damage rf accelerators, tokamaks, and other high gradient structures. We simulate the dense, nonideal plasma sheath near a metallic surface using molecular dynamics (MD) to evaluate sheaths in the non-Debye region for high density, low temperature plasmas. We use direct two-component MD simulations ...

Added: October 28, 2013

Representational dissimilarity component analysis (ReDisCA)

Ossadtchi A., Semenkov I., Zhuravleva A. et al., / Cold Spring Harbor Laboratory. Series http://dx.doi.org/ "BioRxiv". 2024.

Added: March 15, 2024

Сборник трудов конференции NI Academic Days 2017, Москва 13-14 апреля 2017 г.

М.: National Instruments Russia, 2017.

Содержание сборника составляют доклады с результатами оригинальных исследований и технических решений, ранее не публиковавшиеся. Мы надеемся, что предлагаемый сборник окажется полезным для специалистов, работающих в различных областях науки и техники, для широкого круга преподавателей, аспирантов и студентов ВУЗов, а также для преподавателей средних школ и технических колледжей. ...

Added: May 10, 2017

О выборе программных средств когнитивной компьютерной визуализации

Baibikova T., Domoratsky E., Вестник Московского финансово-юридического университета 2017 № 1 С. 200–206

Some questions of scientific visualization are under consideration in this paper. This article also discusses the peculiarities of application of cognitive computer graphics, singles out a range of tasks of scientific visualization. The paper gives a brief overview of modern support tools for program visualization, tendencies of their development and their main characteristics. A module ...

Added: June 10, 2017

Математический анализ. Базовые понятия

Shagin V. L., Соколов А. В., М.: Юрайт, 2016.

Учебное пособие посвящено основам математического анализа. В нем в доход- чивой форме объясняется происхождение и существо фундаментальных понятий, на которых строится теория: предел, непрерывность, производная, интеграл; подробно рассматриваются методы исследования функций и построения графиков. Изложение теоретических вопросов сопровождается иллюстрирующими примерами, а также многочисленными задачами и вопросами, позволяющими оценить степень усвоения материала. Предлагаемое учебное пособие следует ...

Added: October 12, 2016

Complex forecasting scheme for surface meteorological values

Bagrov A. N., Gordin V. A., Bykov P. L., Russian Meteorology and Hydrology 2014 No. 5 P. 283–291

The evaluations of the forecasts of surface air temperature and precipitation for the period July 2010 - June 2013 are presented. The forecasting of surface air temperature at 5 days and precipitation at 3 days are considered. Our complex statistical scheme uses the results of the best foreign global schemes, regional scheme COSMO-RU7. The joint ...

Added: December 7, 2013

Шестая Всероссийская научно-практическая конференция по имитационному моделированию и его применению в науке и промышленности «Имитационное моделирование. Теория и практика» Материалы конференции. Сборник докладов

Каз.: Издательство «Фэн» Академии наук Республики Татарстан, 2013.

Материалы и доклады Шестой Всероссийской научно-практической конференции по имитацонному моделированию и его применению в науке и промышленности. ...

Added: December 14, 2013

"Авиакосмические технологии" (АКТ-2014): Тезисы I тура XV Всероссийской научно-технической конференции и школы молодых ученых, аспирантов и студентов

ООО Фирма "Элист", 2014.

В книге представлены тезисы докладов I тура XV Всероссийской научно-технической конференции и школы молодых ученых, аспирантов и студентов. ...

Added: October 17, 2014

Методика самовосстановления распределенной системы контроля и управления техническими объектами на основе методов теории принятия решений

Vishnekov A., Erokhin V., Ivanova E., Датчики и системы 2018 Т. 221 № 1 С. 18–24

The article discusses the recovery of distributed control systems of technical objects. The options for the design of subsystems recovery after hardware or software fault of the sensor system are investigated. The development of an integrated subsystems recovery is proposed on the basis of decision-making system to develop the most rational control actions by ...

Added: January 27, 2018

Measures of uncertainty in market network analysis

Kalyagin V.A., Koldanov A.P., Koldanov P.A. et al., Physica A: Statistical Mechanics and its Applications 2014 Vol. 413 No. 1 P. 59–70

A general approach to measure statistical uncertainty of different filtration techniques for market network analysis is proposed. Two measures of statistical uncertainty are introduced and discussed. One is based on conditional risk for multiple decision statistical procedures and another one is based on average fraction of errors. It is shown that for some important cases ...

Added: July 19, 2014

Database on the Bandgap of Inorganic Substances and Materials

Kiselyova N. N., Dudarev V.A., Korzhuev M. A., Inorganic Materials: Applied Research 2016 Vol. 7 No. 1 P. 34–39

A database (DB) on the bandgap of inorganic substances available via the Internet (http://bg.imetdb.ru) was developed for the information service of specialists in the sphere of inorganic chemistry and materials science. The DB is integrated with other information systems on the properties of inorganic substances and materials, which provides the search of a wide range ...

Added: February 23, 2016

Pre-experiments on Annotation of Russian Coreference Corpus

Toldova S., Azerkovich I., Гришина Ю. et al., / НИУ ВШЭ. Series WP BRP "Linguistics". 2015.

Building benchmark corpora in the domain of coreference and anaphora resolution is an important task for developing and evaluating NLP systems and models. Our study is aimed at assessing the feasibility of enhancing corpora with information about coreference relations. The annotation procedure includes identification of text segments that are subjects to annotation (markables), marking their ...

Added: December 15, 2015

Measurement of the Drell-Yan triple-differential cross section in pp collisions at $$ \sqrt{s}=8 $$ TeV

Сапронов А. А., Aaboud M., Journal of High Energy Physics 2017 Vol. 2017 No. 12 P. 1–78

This paper presents a measurement of the triple-differential cross section for the Drell-Yan process Z/γ * → ℓ + ℓ − where ℓ is an electron or a muon. The measurement is performed for invariant masses of the lepton pairs, m ℓℓ , between 46 and 200 GeV using a sample of 20.2 fb−1of pp collisions data at a centre-of-mass energy of s√=8s=8 TeV collected by the ATLAS detector at the LHC ...

Added: February 26, 2018

Increasing the performance of a Mobile Ad-hoc Network using a game-theoretic approach to drone positioning

Blakeway S., Gromov D., Gromova E. et al., Vestnik Sankt-Peterburgskogo Universiteta, Prikladnaya Matematika, Informatika, Protsessy Upravleniya 2019 Vol. 15 No. 1 P. 22–38

We describe a novel game-theoretic formulation of the optimal mobile agents’ placement problem which arises in the context of Mobile Ad-hoc Networks (MANETs). This problem is modelled as a sequential multistage game. The definitions of both the Nash equilibrium and cooperative solution are given. A modification was proposed to ensure the existence of a Nash ...

Added: March 13, 2020

Proceedings of 2011 Fourth International Conference on Information Management, Innovation Management and Industrial Engineering (ICIII 2011). 26-27 November 2011, Shenzhen, China

Los Alamitos: IEEE CS Pre, 2011.

Information Management, Innovation Management and Industrial Engineering are becoming increasingly interesting to both the academic researchers and management practitioners. It is essential to explore enterprise management system from the theoretical viewpoint; it is also absolutely essential to the survival, growth and prosperity of any company to have some means to manage innovation in the process ...

Added: July 19, 2012

Об одномерных проекциях многогранников задач дискретной оптимизации

Vyalyi M., Дискретная математика 1991 Т. 3 № 3 С. 35–45

Added: October 17, 2014

On the Impact of Word Error Rate on Acoustic-Linguistic Speech Emotion Recognition: An Update for the Deep Learning Era

Shuranov E., / Cornell University. Series Computer Science "arxiv.org". 2021.

Text encodings from automatic speech recognition (ASR) transcripts and audio representations have shown promise in speech emotion recognition (SER) ever since. Yet, it is challenging to explain the effect of each information stream on the SER systems. Further, more clarification is required for analysing the impact of ASR's word error rate (WER) on linguistic emotion ...

Added: February 14, 2023

EDULEARN12 4th International Conference on Education and New Learning Technologies Publications

Barcelona: International Association of Technology, Education and Development , 2012.

Added: September 11, 2012

Normal approximation and smoothness for sums of means of lattice-valued random variables

Decrouez G. G., Hall P., Bernoulli: a journal of mathematical statistics and probability 2013 Vol. 19 No. 4 P. 1268–1293

Motivated by a problem arising when analysing data from quarantine searches, we explore properties of distributions of sums of independent means of independent lattice-valued random variables. The aim is to determine the extent to which approximations to those sums require continuity corrections. We show that, in cases where there are only two different means, the ...

Added: September 29, 2014

Совершенствование преподавания дисциплин математического цикла на основе инвариантов, необходимых для преподавания курса «Эконометрика» экономистам-бакалаврам

Kotelnikova M. V., Aistov A., Вестник Нижегородского университета им. Н.И. Лобачевского. Серия: Социальные науки 2019 Т. 55 № 3 С. 183–189

The article describes a method that allows to improve the content of disciplines of the mathematical cycle by dividing them into invariant (general) and variable parts. The invariants were identified for such disciplines as «Linear algebra», «Mathematical analysis», «Probability theory and mathematical statistics» delivered to Bachelors program students of economics at several universities. Based on ...

Added: January 28, 2020

Formation of Control Structures in Static Swarms

Karpov V. E., Karpova I. P., Procedia Engineering 2015 Vol. 100 P. 1459–1468

Work solutions are proposed for problems of leader definition and role distribution in homogeneous groups of robots. It is shown that transition from a swarm to a collective of robots with hierarchical organization is possible using exclusively local interaction. The local revoting algorithm is central to the procedure for choice of leader while redistribution of roles can ...

Added: March 14, 2015

Компьютерная лингвистика и интеллектуальные технологии: По материалам ежегодной международной конференции «Диалог» (Москва, 29 мая — 1 июня 2019 г.). Вып. 18 (25)

М.: Издательский центр «Российский государственный гуманитарный университет», 2019.

Сборник включает 27 докладов международной конференции по компьютерной лингвистике и интеллектуальным технологиям «Диалог 2019», не вошедшие в ежегодник «Компьютерная лингвистика и интеллектуальные технологии», но рекомендованные Программным Комитетом к представлению на конференции. Для специалистов в области теоретической и прикладной лингвистики и интеллектуальных технологий. ...

Added: December 10, 2019