Learning Word Embeddings without Context Vectors

A. Zobnin; Elistratova E.

doi:10.18653/v1/W19-4329

Publications

?

Learning Word Embeddings without Context Vectors

P. 244–249.

Zobnin A., Elistratova E.

Most word embedding algorithms such as word2vec or fastText construct two sort of vectors: for words and for contexts. Naive use of vectors of only one sort leads to poor results. We suggest using indefinite inner product in skip-gram negative sampling algorithm. This allows us to use only one sort of vectors without loss of quality. Our “context-free” cf algorithm performs on par with SGNS on word similarity datasets.

Keywords: SVD word2vec word embeddings

In book

Proceedings of the 4th Workshop on Representation Learning for NLP (RepL4NLP-2019)

Issue W19-43. , Association for Computational Linguistics, 2019.

Improving Distributional Semantic Models Using Anaphora Resolution during Linguistic Preprocessing

Kutuzov A. B., Козлова О. С., , in: Компьютерная лингвистика и интеллектуальные технологии: По материалам ежегодной международной конференции «Диалог» (Москва,1–4 июля 2016 г.)Вып. 15. М.: Изд-во РГГУ, 2016. P. 288–300.

In natural language processing, distributional semantic models are known as an efficient data driven approach to word and text representation, which allows computing meaning directly from large text corpora into word embeddings in a vector space. This paper addresses the role of linguistic preprocessing in enhancing performance of distributional models, and particularly studies pronominal anaphora ...

Added: November 12, 2016

Rotations and Interpretability of Word Embeddings: The Case of the Russian Language

Zobnin A., , in: Analysis of Images, Social Networks and Texts. 6th International Conference, 2017, Revised Selected PapersVol. 10716. Cham: Springer, 2018. Ch. 11 P. 116–128.

Consider a continuous word embedding model. Usually, the cosines between word vectors are used as a measure of similarity of words. These cosines do not change under orthogonal transformations of the embedding space. We demonstrate that, using some canonical orthogonal transformations from SVD, it is possible both to increase the meaning of some components and ...

Added: November 26, 2017

An Unsupervised Method for Weighting Finite-state Morphological Analyzers

Tyers F. M., Keleg A., Pirinen T., , in: Proceedings of The 12th Language Resources and Evaluation ConferenceVol. 12. European Language Resources Association (ELRA), 2020. P. 3842–3850.

Morphological analysis is one of the tasks that have been studied for years. Different techniques have been used to develop models for performing morphological analysis. Models based on finite state transducers have proved to be more suitable for languages with low available resources. In this paper, we have developed a method for weighting a morphological ...

Added: April 20, 2021

Data-driven models and computational tools for neurolinguistics: a language technology perspective

Ekaterina Artemova, Bakarov A., Artemov A. et al., Journal of Cognitive Science 2020 Vol. 1 No. 21 P. 15–52

In this paper, our focus is the connection and influence of language technologies on the research in neurolinguistics. We present a review of brain imaging-based neurolinguistics studies with a focus on the natural language representations, such as word embeddings and pre-trained language model. Mutual enrichment of neurolinguistics and language technologies leads to development of brain-aware natural ...

Added: January 17, 2020

Identifying emerging trends and hot topics through intelligent data mining: the case of clinical psychology and psychotherapy

Sokolova A., Lobanova P., Kuzminov I., Foresight 2024 Vol. 26 No. 1 P. 155–180

Purpose The purpose of the paper is to present an integrated methodology for identifying trends in a particular subject area based on a combination of advanced text mining and expert methods. The authors aim to test it in an area of clinical psychology and psychotherapy in 2010–2019. Design/methodology/approach The authors demonstrate the way of applying text-mining and the ...

Added: October 12, 2023

WORD VECTOR MODELS AS AN OBJECT OF LINGUISTIC RESEARCH

Shavrina T., , in: Компьютерная лингвистика и интеллектуальные технологии: По материалам ежегодной международной конференции «Диалог» (Москва, 29 мая — 1 июня 2019 г.)Вып. 18(25). [б.и.], 2019. P. 576–588.

This article launches a series of studies in which popular vector word2vec models are considered not as an element of the architecture of an NLP application, but as an independent object of linguistic research. The linguist's view on the surrogate of contexts on the corpus, as which vector models can be considered, makes it possible ...

Added: September 5, 2019

Exploration of register-dependent lexical semantics using word embeddings

Kutuzov A. B., Kuzmenko E., Marakasova A., , in: Proceedings of the Workshop on Language Technology Resources and Tools for Digital Humanities (LT4DH). Osaka: [б.и.], 2016. P. 26–34.

We present an approach to detect differences in lexical semantics across English language registers, using word embedding models from distributional semantics paradigm. Models trained on register-specific subcorpora of the BNC corpus are employed to compare lists of nearest associates for particular words and draw conclusions about their semantic shifts depending on register in which they ...

Added: November 12, 2016

Text classification with deep learning neural networks

Voronkov Ilia, Amajd M., Kaimuldenov Z., , in: Actual Problems of System and Software Engineering 2017. Proceedings of the 5th International Conference on Actual Problems of System and Software Engineering Supported by Russian Foundation for Basic Research. Project #17-07-20565 Moscow, Russia, November 14-16, 2017, 408 P.Vol. 1989. Aachen: CEUR Workshop Proceedings, 2017. P. 362–370.

In this paper, we analyze the use of different neural networks for the text classification task. The accuracy of the studied text classifiers can be changed by a small number of previously classified texts. This is important due to the fact that in many applications of text classification a large number of unlabeled texts are easily accessible, while ...

Added: August 16, 2018

Automated Word Sense Frequency Estimation for Russian Nouns

Lopukhina A., Лопухин К. А., Носырев Г. В., , in: Quantitative approaches to the Russian language. Abingdon: Routledge, 2018. P. 79–94.

According to G. K. Zipf’s observation, there is a strong correlation between word frequency and polysemy. Yet word sense frequency distribution is a neglected area in computational linguistics. Furthermore, the study of sense frequency has theoretical interest and practical applications for lexicography and word sense disambiguation. Although WordNet and SemCor contain some information about sense frequency ...

Added: October 11, 2016

Automated defect identification for cell phones using language context, linguistic and smoke-word models

Muhammad Z. Y., Malik M. S., Ignatov D. I., Expert Systems with Applications 2023 Vol. 227 Article 120236

Product defects are a widespread concern for manufacturers when conducting quality and customer relationship management. Prior approaches addressed many electronic products however cell phones are still unexplored. Moreover, prior work mainly focused on the lexicon, probabilistic graphic, failure mode, and effect analysis models but the utilization of word embeddings and language models are not explored. State-of-the-art contextual word embeddings and language models generate automated features and ...

Added: June 13, 2023

A Dataset for Noun Compositionality Detection for a Slavic Language

Puzyrev D., Shelmanov A., Panchenko A. et al., , in: Proceedings of the 7th Workshop on Balto-Slavic Natural Language Processing, 2019, Florence, Italy, Association for Computational Linguistics. Association for Computational Linguistics, 2019. P. 56–62.

aper presents the first gold-standard resource for Russian annotated with compositionality information of noun compounds. The compound phrases are collected from the Universal Dependency treebanks according to part of speech patterns, such as ADJ+NOUN or NOUN+NOUN, using the gold-standard annotations. Each compound phrase is annotated by two experts and a moderator according to the following ...

Added: October 30, 2019

You shall know a piece by the company it keeps. Chess plays as a data for word2vec models

Orekhov B., / Series Computer Science "arxiv.org". 2024.

In this paper, I apply linguistic methods of analysis to non-linguistic data, chess plays, metaphorically equating one with the other and seeking analogies. Chess game notations are also a kind of text, and one can consider the records of moves or positions of pieces as words and statements in a certain language. In this article ...

Added: August 8, 2024

Scalable and language-independent embedding-based approach for plagiarism detection considering obfuscation type: no training phase

Gharavi E., Veisi H., Россо П., Neural Computing and Applications 2020 Vol. 32 No. 14 P. 10593–10607

The efficiency and scalability of plagiarism detection systems have become a major challenge due to the vast amount of available textual data in several languages over the Internet. Plagiarism occurs in different levels of obfuscation, ranging from the exact copy of original materials to text summarization. Consequently, designed algorithms to detect plagiarism should be robust ...

Added: October 29, 2020

On Singular Value Decomposition and Polar Decomposition in Geometric Algebras

Shirokov D., , in: Advances in Computer Graphics: 40th Computer Graphics International Conference, CGI 2023, Shanghai, China, August 28 – September 1, 2023, Proceedings, Part IV* 4. Vol. 14498. Springer, 2024. P. 391–401.

This paper is a brief note on the natural implementation of singular value decomposition (SVD) and polar decomposition of an arbitrary multivector in nondegenerate real (Clifford) geometric algebras of arbitrary dimension and signature. We naturally define these and other related structures (operation of Hermitian conjugation, Euclidean space, and Lie groups) in geometric algebras. The results ...

Added: December 25, 2023

Automated Analysis of Discourse Coherence in Schizophrenia: Approximation of Manual Measures

Ryazanskaya G., Khudyakova M., , in: Proceedings of the LREC 2020 Workshop on: Resources and Processing of Linguistic, Para-linguistic and Extra-linguistic Data from People with Various Forms of Cognitive/Psychiatric/Developmental Impairments (RaPID-3). European Language Resources Association (ELRA), 2020. P. 98–107.

Disorganized, or incoherent, speech is one of the important criteria for diagnosing schizophrenia. However, there is still a lack of a rather quick objective method of measuring speech coherence. Automated discourse analysis is a possible solution to this problem. We analyzed discourse coherence in a set of spoken narratives by people with schizophrenia and neurotypical speakers ...

Added: February 2, 2021

Extracting social networks from literary text with word embedding tools

Wohlgenannt G., Artemova E., Ilvovsky D., , in: Proceedings of the Workshop on Language Technology Resources and Tools for Digital Humanities (LT4DH). Osaka: [б.и.], 2016. Ch. 4 P. 18–26.

In this paper a social network is extracted from a literary text. The social network shows, how frequent the characters interact and how similar their social behavior is. Two types of similarity measures are used: the first applies co-occurrence statistics, while the second exploits cosine similarity on different types of word embedding vectors. The results ...

Added: March 6, 2017

КОНСТРУИРОВАНИЕ ОБРАЗА ГОРОДА В ОФИЦИАЛЬНОЙ И ОБЫДЕННОЙ КОММУНИКАЦИИ: СРАВНИТЕЛЬНЫЙ АНАЛИЗ (НА МАТЕРИАЛЕ СОЦИАЛЬНЫХ МЕДИА)

Matkin N., Коммуникации. Медиа. Дизайн 2024

The article offers an analysis and visualization of Russian city images that emerge in the comments of urban community subscribers and posts from administrative press services. The city image is regarded as a frame structure that develops through political and interpersonal communication in the network. The social component of the city image is identified as ...

Added: November 15, 2023

Extraction of Hypernyms from Dictionaries with a Little Help from Word Embeddings

Karyaeva M., Braslavski P., Kiselev Y., , in: Analysis of Images, Social Networks and Texts. 7th International Conference AIST 2018. Springer, 2018. P. 76–87.

The paper investigates several techniques for hypernymy extraction from a large collection of dictionary definitions in Russian. First, definitions from different dictionaries are clustered, then single words and multiwords are extracted as hypernym candidates. A classification-based approach on pre-trained word embeddings is implemented as a complementary technique. In total, we extracted about 40K unique hypernym ...

Added: March 11, 2019

How to detect propaganda from social media? Exploitation of semantic and fine-tuned language models

Malik M. S., Imran T., Mona Mamdouh J., PeerJ Computer Science 2023 Vol. 9 Article e1248

Online propaganda is a mechanism to influence the opinions of social media users. It is a growing menace to public health, democratic institutions, and public society. The present study proposes a propaganda detection framework as a binary classification model based on a news repository. Several feature models are explored to develop a robust model such ...

Added: September 4, 2023

Распределённые представления редких слов русского языка, учитывающие векторы однокоренных слов

Malafeev A., Мальтина Л. П., Научно-техническая информация. Серия 2: Информационные процессы и системы 2021 № 1

The paper proposes algorithms that perform automatic morphemic analysis of words and methods of distributed representations of words that indirectly use information about the morphemic composition through the averaging of vectors of same-root words. Morphemic analysis models for the Russian language are evaluated on samples of common and rare words. Several methods are proposed for obtaining ...

Added: November 9, 2020

Mutual information spectrum for selection of event-related spatial components. Application to eloquent motor cortex mapping

Ossadtchi A., Pronko P. K., Baillet S. et al., Frontiers in Neuroinformatics 2014 Vol. 7 No. 53 P. 1–11

Spatial component analysis is often used to explore multidimensional time series data whose sources cannot be measured directly. Several methods may be used to decompose the data into a set of spatial components with temporal loadings. Component selection is of crucial importance, and should be supported by objective criteria. In some applications, the use of ...

Added: January 19, 2014

Study on precoding optimization algorithms in massive MIMO system with multi-antenna users

Bobrov E., Kropotov D., Troshin S. et al., Optimization Methods and Software 2022 P. 1–16

The paper studies the multi-user precoding problem as a non-convex optimization problem for wireless multiple inputs and multiple outputs (MIMO) systems. In our work, we approximate the target Spectral Efficiency function with a novel computationally simpler function. Then, we reduce the precoding problem to an unconstrained optimization task using a special differential projection method and ...

Added: October 26, 2022

An efficient complex structure-preserving algorithm for the Autonne-Takagi decomposition of quaternion matrices

Wang G., Siberian Electronic Mathematical Reports 2024

Given the importance of the Autonne-Takagi decomposition in fields such as quantum computing and signal processing and the research gap in this decomposition algorithm in quaternionic mechanics. This paper investigates the algorithm for computing the Autonne-Takagi decomposition of quaternion matrices, that is, special singular value decomposition algorithms for η-Hermitian quaternion matrices. Effective algorithms for rotation ...

Added: December 16, 2024

How much does a word weight? Weighting word embeddings for word sense induction

Arefyev, N., Ermolaev P., Panchenko A., , in: Computational Linguistics and Intellectual Technologies. International Conference "Dialogue 2018" Proceedings. M.: Conference Proceedings Editorial board, 2018. P. 68–84.

The paper describes our participation in the first shared task on word sense induction and disambiguation for the Russian language RUSSE'2018 [Panchenko et al., 2018]. For each of several dozens of ambiguous words, the participants were asked to group text fragments containing it according to the senses of this word, which were not provided beforehand, ...

Added: October 9, 2020