Breaking Sticks and Ambiguities with Adaptive Skip-gram

S. Bartunov; A. Osokin; D. Vetrov

?

Breaking Sticks and Ambiguities with Adaptive Skip-gram

2015.

Bartunov S., Кондрашкин Д. А., Osokin A., Vetrov D.

Recently proposed Skip-gram model is a powerful method for learning high-dimensional word representations that capture rich semantic relationships between words. However, Skip-gram as well as most prior work on learning word representations does not take into account word ambiguity and maintain only single representation per word. Although a number of Skip-gram modifications were proposed to overcome this limitation and learn multi-prototype word representations, they either require a known number of word meanings or learn them using greedy heuristic approaches. In this paper we propose the Adaptive Skip-gram model which is a nonparametric Bayesian extension of Skip-gram capable to automatically learn the required number of representations for all words at desired semantic resolution. We derive efficient online variational learning algorithm for the model and empirically demonstrate its efficiency on word-sense induction task.

Research target: Computer Science

Priority areas: IT and mathematics mathematics

Language: English

Text on another site

Keywords: машинное обучение natural language processing text mining machine learning обработка текстов на естественном языке Representation learning bayesian nonparametric methods непараметрические байесовские методы bayesian nonparametrics обучение представлений

Analysis of Images, Social Networks and Texts Third International Conference, AIST 2014, Yekaterinburg, Russia, April 10-12, 2014, Revised Selected Papers

Berlin: Springer, 2014.

This book constitutes the proceedings of the Third International Conference on Analysis of Images, Social Networks and Texts, AIST 2014, held in Yekaterinburg, Russia, in April 2014. The 11 full and 10 short papers were carefully reviewed and selected from 74 submissions. They are presented together with 3 short industrial papers, 4 invited papers and ...

Added: November 13, 2014

Texterra: инфраструктура для анализа текстов

Денис Турдаков, Астраханцев Н. А., Недумов Я. Р. et al., Труды Института системного программирования РАН 2014 Т. 26 С. 421–438

he paper presents a framework for fast text analytics developed during the Texterra project. Texterra is a technology for multilingual text mining based on novel text processing methods that exploit knowledge extracted from user-generated content. It delivers a fast scalable solution for text mining without the expensive customization. Depending on use-cases Texterra could be utilized ...

Added: November 6, 2017

Proceedings of The 20th SIGNLL Conference on Computational Natural Language Learning

Berlin: Association for Computational Linguistics, 2016.

The 2016 Conference on Computational Natural Language Learning is the twentieth in the series of annual meetings organized by SIGNLL, the ACL special interest group on natural language learning. CoNLL 2016 will be held on August 11-12, 2016, and is co-located with the 54th annual meeting of the Association for Computational Linguistics (ACL) in Berlin, ...

Added: November 12, 2016

Тезисы докладов 12-й Международной конференции Интеллектуализация обработки информации

М.: Торус Пресс, 2018.

The volume contains the abstracts of the 12th International Conference "Intelligent Data Processing: Theory and Applications". The conference is organized by the Russian Academy of Sciences, the Federal Research Center "Informatics and Control" of the Russian Academy of Sciences and the Scientific and Coordination Center "Digital Methods of Data Mining". The conference has being held biennially since 1989. It is one ...

Added: October 9, 2018

Intelligent Data Processing 11th International Conference, IDP 2016, Barcelona, Spain, October 10–14, 2016, Revised Selected Papers

Switzerland: Springer, 2019.

This book constitutes the refereed proceedings of the 11th International Conference on Intelligent Data Processing, IDP 2016, held in Barcelona, Spain, in October 2016. The 11 revised full papers were carefully reviewed and selected from 52 submissions. The papers of this volume are organized in topical sections on machine learning theory with applications; intelligent data processing in life ...

Added: February 8, 2020

Тезисы докладов 11-й конференции Интеллектуализация обработки информации

М.: Торус Пресс, 2016.

This proceedings contains the abstracts of papers accepted to IDP-11 ...

Added: November 12, 2016

Математические основы машинного обучения и прогнозирования

V'yugin V., М.: МЦНМО, 2013.

Книга предназначена для первоначлаьного знакомства с математическими основами современной теории машинного обучения (Machine Learning) и теории игр на предсказания. В первой части излагаются основы статистической теории машинного обучения, рассматриваются задачи классификации и регрессии с опорными векторами, теория обобщения и алгоритмы построения разделяющих гиперплоскостей. Во второй и третьей частях рассматриваются задачи адаптивного прогнозирования в нестохастических теоретико-игровой ...

Added: July 9, 2014

Analysis of Images, Social Networks and Texts. 4th International Conference, AIST 2015, Yekaterinburg, Russia, April 9–11, 2015, Revised Selected Papers

Switzerland: Springer, 2015.

This book constitutes the proceedings of the Fourth International Conference on Analysis of Images, Social Networks and Texts, AIST 2015, held in Yekaterinburg, Russia, in April 2015. The 24 full and 8 short papers were carefully reviewed and selected from 140 submissions. The papers are organized in topical sections on analysis of images and videos; ...

Added: October 12, 2015

Proceedings of the Fifth Workshop on Experimental Economics and Machine Learning at the National Research University Higher School of Economics co-located with the Seventh International Conference on Applied Research in Economics (iCare7)

Aachen: CEUR Workshop Proceedings, 2019.

Workshop concentrates on an interdisciplinary approach to modelling human behavior incorporating data mining and expert knowledge from behavioral sciences. Data analysis results extracted from clean data of laboratory experiments will be compared with noisy industrial datasets from the web e.g. Insights from behavioral sciences will help data scientists. Behavior scientists will see new inspirations to ...

Added: November 19, 2019

Proceedings of the 2016 Future Technologies Conference

IEEE, 2017.

Added: November 20, 2017

Concept Learning from Triadic Data

Zhuk R., Ignatov D. I., Konstantinova N., Procedia Computer Science 2014 Vol. 31 P. 928–938

We propose extensions of the classical JSM-method and the Na ̈ıve Bayesian classifier for the case of triadic relational data. We performed a series of experiments on various types of data (both real and synthetic) to estimate quality of classification techniques and compare them with other classification algorithms that generate hypotheses, e.g. ID3 and Random ...

Added: June 9, 2014

Supplementary Proceedings ICFCA 2019 Conference and Workshops

CEUR Workshop Proceedings, 2019.

Added: October 31, 2019

8th Russian Summer School in Information Retrieval (RuSSIR 2014)

Braslavski P., Karpov Nikolay, Worring M. et al., ACM SIGIR Forum 2014 Vol. 48 No. 2 P. 105–110

The 8th Russian Summer School in Information Retrieval (RuSSIR 2014) was held on August 18-22, 2014 in Nizhniy Novgorod, Russia.1 The school was co-organized by the National Research University Higher School of Economics2 and the Russian Information Retrieval Evaluation Seminar (ROMIP) ...

Added: August 22, 2015

14th International Conference on Formal Concept Analysis - Supplementary Proceedings

University Rennes 1, 2017.

This volume is the supplementary volume of the 14th International Conference on Formal Concept Analysis (ICFCA 2017), held from June 13th to 16th 2017, at IRISA, Rennes. The ICFCA conference series is one of the major venues for researches from the field of Formal Concept Analysis and related areas to present and discuss their recent ...

Added: June 19, 2017

Analysis of Images, Social Networks and Texts. 5th International Conference, AIST 2016, Yekaterinburg, Russia, April 7-9, 2016, Revised Selected Papers. Communications in Computer and Information Science

Switzerland: Springer, 2017.

This book constitutes the proceedings of the 5th International Conference on Analysis of Images, Social Networks and Texts, AIST 2016, held in Yekaterinburg, Russia, in April 2016. The 23 full papers, 7 short papers, and 3 industrial papers were carefully reviewed and selected from 142 submissions. The papers are organized in topical sections on machine ...

Added: October 19, 2016

Использование метода главных компонент для анализа надежности цепей поставок

Kuznetsov V. O., Логистика и управление цепями поставок 2018 № 4 (87) С. 27–33

One of the options for a more flexible approach to analyzing the reliability of supply chains is the principal component analysis (PCA). With a large number of variables describing supply chain, it is a difficult task to analyze the structure of variables in two-dimensional space. Within the analysis of the variables dependencies PCA allows to ...

Added: November 29, 2018

Supplementary Proceedings of the 3rd International Conference on Analysis of Images, Social Networks and Texts (AIST 2014)

Ekaterinburg: CEUR Workshop Proceedings, 2014.

AIST'2014 is an international data science conference on Analysis of Images, Social Networks, and Texts. Traditionally, the conference is held annually in Yekaterinburg, Russia. The conference is intended for computer scientists and practitioners whose research interests involve Internet mathematics and other related fields of data science. LIST OF TOPICS (NON EXHAUSTIVE) Applications of Data Mining and Machine ...

Added: August 28, 2014

Методы построения социо-демографических профилей пользователей сети Интернет

С.Д. Кузнецов, Гомзин А. Г., Труды Института системного программирования РАН 2015 Т. 27 № 4 С. 129–144

he paper is devoted to methods for construction of socio-demographic profile of Internet users. Gender, age, political and religion views, region, relationship status are examples of demographic attributes. This work is a survey of methods that detect demographic attributes from user’s profile and messages. The most of surveyed works are devoted to gender detection. Age, ...

Added: January 23, 2018

Learning Representations in Directed Networks

Bartunov S., Ivanov O., , in: Analysis of Images, Social Networks and Texts. 4th International Conference, AIST 2015, Yekaterinburg, Russia, April 9–11, 2015, Revised Selected PapersVol. 542: Series: Communications in Computer and Information Science.: Switzerland: Springer, 2015.

We propose a probabilistic model for learning continuous vector representations of nodes in directed networks. These representations could be used as high quality features describing nodes in a graph and implicitly encoding global network structure. The usefulness of the representations is demonstrated on link prediction and graph visualization tasks. Using representations learned by our method ...

Added: November 5, 2015

Lecture Notes in Computer Science

Springer, 2011.

Added: March 14, 2016

Применение методов машинного обучения для прогноза или замещения недостающих каротажных данных

Ахметсафин Р. Д., Akhmetsafina R., Известия высших учебных заведений. Приборостроение 2021 Т. 64 № 7 С. 532–541

Nine machine learning methods (ANN, ANFIS, ELM, FM, SVM, GPR, RF, RT, k-NN) are compared using the example of predicting acoustic logging data. With machine learning, the solution to the regression problem can be used not only for predicting geophysical fields, but also for filing in missing data. The constructed curve T(Р) of the P-wave ...

Added: July 25, 2021

Alexander Kotov, Elena Treshcheva, Leonid Bessonov, Dmitry I. Ignatov, Yana Volkovich, Maria Eskevich, Pavel Braslavski: 10th Russian Summer School in Information Retrieval (RuSSIR 2016)

Kotov A., Treshcheva E., Bessonov L. et al., SIGIR Forum (ACM Special Interest Group on Information Retrieval) 2016 Vol. 50 No. 2 P. 28–35

This paper provides the reader with a report on 10th Russian Summer School in Information Retrieval (RuSSIR 2016). ...

Added: February 27, 2017

Faster variational inducing input Gaussian process classification

Izmailov P., Kropotov D., Journal of machine learning and data analysis 2017 Vol. 3 No. 1 P. 20–35

Background: Gaussian processes (GP) provide an elegant and effective approach to learning in kernel machines. This approach leads to a highly interpretable model and allows using the Bayesian framework for model adaptation and incorporating the prior knowledge about the problem. The GP framework is successfully applied to regression, classification, and dimensionality reduction problems. Unfortunately, the ...

Added: December 6, 2018

Machine Learning approach to γ/π0 separation in the LHCb calorimeter

Ratnikov F., Viktoria Chekalina, Journal of Physics: Conference Series 2018 Vol. 1085 P. 1–5

Reconstruction and identification of particles in calorimeters of modern High Energy Physics experiments is a complicated task. Solutions are usually driven by a priori knowledge about expected properties of reconstructed objects. Such an approach is also used to distinguish single photons in the electromagnetic calorimeter of the LHCb detector at the LHC from overlapping photons ...

Added: October 18, 2018