Beyond Vector Spaces: Compact Data Representation as Differentiable Weighted Graphs

Mazur D.; Egiazarian V.; Morozov S.; A. Babenko

?

Beyond Vector Spaces: Compact Data Representation as Differentiable Weighted Graphs

P. 1–11.

Mazur D., Egiazarian V., Morozov S., Babenko A.

Learning useful representations is a key ingredient to the success of modern ma- chine learning. Currently, representation learning mostly relies on embedding data into Euclidean space. However, recent work has shown that data in some domains is better modeled by non-euclidean metric spaces, and inappropriate geometry can result in inferior performance. In this paper, we aim to eliminate the inductive bias imposed by the embedding space geometry. Namely, we propose to map data into more general non-vector metric spaces: a weighted graph with a shortest path distance. By design, such graphs can model arbitrary geometry with a proper configuration of edges and weights. Our main contribution is PRODIGE: a method that learns a weighted graph representation of data end-to-end by gradient descent. Greater generality and fewer model assumptions make PRODIGE more powerful than existing embedding-based approaches. We confirm the superiority of our method via extensive experiments on a wide range of tasks, including classification, compression, and collaborative filtering.

Language: English

Full text

Text on another site

Keywords: Representation learning

In book

Advances in Neural Information Processing Systems 32 (NeurIPS 2019)

[б.и.], 2019.

Breaking Sticks and Ambiguities with Adaptive Skip-gram

Bartunov S., Кондрашкин Д. А., Osokin A. et al., / Series arXiv:1502.07257 "Computation and language". 2015.

Recently proposed Skip-gram model is a powerful method for learning high-dimensional word representations that capture rich semantic relationships between words. However, Skip-gram as well as most prior work on learning word representations does not take into account word ambiguity and maintain only single representation per word. Although a number of Skip-gram modifications were proposed to ...

Added: November 5, 2015

Manifold Learning in Data Mining Tasks

Kuleshov A. P., Bernstein A., , in: Machine Learning and Data Mining in Pattern RecognitionVol. 8556.: Springer, 2014. P. 119–133.

Many Data Mining tasks deal with data which are presented in high dimensional spaces, and the ‘curse of dimensionality’ phenomena is often an obstacle to the use of many methods for solving these tasks. To avoid these phenomena, various Representation learning algorithms are used as a first key step in solutions of these tasks to ...

Added: September 30, 2014