DEDPUL: Difference-of-Estimated-Densities-based Positive-Unlabeled Learning

Dmitry Ivanov

doi:10.1109/ICMLA51294.2020.00128

Publications

?

DEDPUL: Difference-of-Estimated-Densities-based Positive-Unlabeled Learning

P. 782-790.

Dmitry Ivanov

Positive-Unlabeled (PU) learning is an analog to supervised binary classification for the case when only the positive
sample is clean, while the negative sample is contaminated with latent instances of positive class and hence can be considered as an unlabeled mixture. The objectives are to classify the unlabeled sample and train an unbiased positive-negative classifier, which generally requires to identify the mixing proportions of positives and negatives first. Recently, unbiased risk estimation framework has achieved state-of-the-art performance in PU learning. This approach, however, exhibits two major bottlenecks. First, the mixing proportions are assumed to be identified, i.e. known in the domain or estimated with additional methods. Second, the approach relies on the classifier being a neural network. In this paper, we propose DEDPUL, a method that solves PU Learning without the aforementioned issues. The mechanism behind DEDPUL is to apply a computationally cheap postprocessing procedure to the predictions of any classifier trained to distinguish positive and unlabeled data. Instead of assuming the proportions to be identified, DEDPUL estimates them alongside with classifying unlabeled sample. Experiments show that DEDPUL
outperforms the current state-of-the-art in both proportion estimation and PU Classification and is flexible in the choice of the classifier.

Keywords: density estimation Semi-supervised learning Positive-Unlabeled Classification Mixture Proportions Estimation

Publication based on the results of:

Allocation and social choice mechanisms: axioms, incentives, algorithms (2020)

In book

2020 19th IEEE International Conference on Machine Learning and Applications (ICMLA 2020)

Miami : IEEE, 2020

2020 19th IEEE International Conference on Machine Learning and Applications (ICMLA 2020)

Miami : IEEE, 2020

Added: October 15, 2020

Density deconvolution under general assumptions on the distribution of measurement errors

Belomestny D., Goldenshluger A., Annals of Statistics 2021 Vol. 49 No. 2 P. 615-649

In this paper we study the problem of density deconvolution under general assumptions on the measurement error distribution. Typically deconvolution estimators are constructed using Fourier transform techniques, and it is assumed that the characteristic function of the measurement errors does not have zeros on the real line. This assumption is rather strong and is not fulfilled in many cases of interest. ...

Added: April 7, 2020

Refining the ONCE Benchmark With Hyperparameter Tuning

Maksim Golyadkin, Alexander Gambashidze, Nurgaliev I. et al., IEEE Access 2024 Vol. 12 P. 3805-3814

In response to the growing demand for 3D object detection in applications such as autonomous driving, robotics, and augmented reality, this work focuses on the evaluation of semi-supervised learning approaches for point cloud data. The point cloud representation provides reliable and consistent observations regardless of lighting conditions, thanks to advances in LiDAR sensors. Data annotation ...

Added: March 13, 2024

In-sample forecasting of local linear survival densities

Mammen E., Hiabu M., Mart ́ınez Miranda M. D. et al., Biometrika 2016

In this paper, in-sample forecasting is defined as forecasting a structured density to sets where it is unobserved. The structured density consists of one-dimensional in-sample components that identify the density on such sets. We focus on the multiplicative density structure, which has recently been seen as the underlying structure of non-life insurance forecasts. In non-life ...

Added: October 12, 2016

Asymptotics for in-sample density forecasting

Lee Y. K., Mammen E., Nielsen J. et al., Annals of Statistics 2015 No. 43 P. 620-645

This paper generalizes recent proposals of density forecasting models and it develops theory for this class of models. In density forecasting, the density of observations is estimated in regions where the density is not observed. Identification of the density in such regions is guaranteed by structural assumptions on the density that allows exact extrapolation. In ...

Added: December 12, 2014

Об оценке плотности распределения с помощью ряда Фурье

Belomestny D., Iosipoi L., Управление большими системами: сборник трудов 2019 № 82 С. 28-43

In this paper, we consider the classical statistical problem of probability density estimation based on a sample from this distribution. This problem naturally arises in many applications when one aims at investigation of a probability structure in a random process. For instance, it is possible to identify some structure in a complex system using density ...

Added: October 21, 2019

Sobolev-Hermite versus Sobolev nonparametric density estimation on R

Belomestny D., Comte F., Genon-Catalot V., Annals of the Institute of Statistical Mathematics 2019 Vol. 71 No. 1 P. 29-62

In this spaper, our aim is to revisit the nonparametric estimation of a square integrable density f on R, by using projection estimators on a Hermite basis. These estimators are studied from the point of view of their mean integrated squared error on R. A model selection method is described and proved to perform an ...

Added: May 5, 2018

Social media mining for ideation: Identification of sustainable solutions and opinions

Ozcan S., Suloglu M., Sakar C. O. et al., Technovation 2021 Vol. 107 No. September 2021 P. 1-12

The availability of social media-based data creates opportunities to obtain information about consumers, trends, companies and technologies using text mining techniques. However, the quality of the data is a significant concern for social media-based analyses. The aim of this study was to mine tweets (microblogs) to explore trends and retrieve ideas for various purposes such ...

Added: December 12, 2021

Generalized Post–Widder inversion formula with application to statistics

Belomestny D., Mai H., Schoenmakers J., Journal of Mathematical Analysis and Applications 2017 No. 455 P. 89-104

In this work we derive an inversion formula for the Laplace transform of a density observed on a curve in the complex domain, which generalizes the well known Post– Widder formula. We establish convergence of our inversion method and derive the corresponding convergence rates for the case of a Laplace transform of a smooth density. ...

Added: September 22, 2017

Semi-Conditional Normalizing Flows for Semi-Supervised Learning

Atanov A., Volokhova A., Ashukha A. et al., Workshop on Invertible Neural Nets and Normalizing Flows, International Conference on Machine Learning 2019 P. 1-9

This paper proposes a semi-conditional normalizing flow model for semi-supervised learning. The model uses both labeled and unlabeled data to learn an explicit model of joint distribution over objects and labels. Semi-conditional architecture of the model allows us to efficiently compute a value and gradients of the marginal likelihood for unlabeled objects. The conditional part ...

Added: July 11, 2019

A Parametrix Approach for some Degenerate Stable Driven SDEs

Lorik Huang, Menozzi S., Annales de l’Institut Henri Poincaré 2016 Vol. 52 No. 4 P. 1925-1975

We consider a stable driven degenerate stochastic differential equation, whose coefficients satisfy a kind of weak Hörmander condition. Under mild smoothness assumptions we prove the uniqueness of the martingale problem for the associated generator under some dimension constraints. Also, when the driving noise is scalar and tempered, we establish density bounds reflecting the multi-scale behavior ...

Added: October 14, 2015

On the prediction loss of the lasso in the partially labeled setting

Bellec P., Dalalyan A., Grappin E. et al., Electronic journal of statistics 2018 Vol. 12 No. 2 P. 3443-3472

In this paper we revisit the risk bounds of the lasso estimator in the context of transductive and semi-supervised learning. In other terms, the setting under consideration is that of regression with random design under partial labeling. The main goal is to obtain user-friendly bounds on the off-sample prediction risk. To this end, the simple ...

Added: November 9, 2018