The Database of Cross-Linguistic Colexifications, reproducible analysis of cross-linguistic polysemies
Advances in computer-assisted linguistic research are greatly influencing and reshaping linguistic investigation. With the increasing availability of interconnected datasets created and curated by researchers, more and more interwoven questions can now be investigated. Such advances, however, are bringing high requirements in terms of rigorousness for preparing and curating datasets. In this work we present CLICS, a Database of Cross-Linguistic Colexifications which aims to both tackle interdisciplinary and interconnected research questions as well as showcasing best practices in preparing data for cross-linguistic research. This is done by addressing shortcomings of an earlier version of the database, CLICS2, and supplying an updated version with CLICS3 which massively increases the size and scope of the project. We provide tools and guidelines for this purpose and discuss insights resulting from organizing student tasks for database updates.
Word sense disambiguation (WSD) methods are useful for many NLP tasks that require semantic interpretation of input. Furthermore, such methods can help estimate word sense frequencies in different corpora, which is important for lexicographic studies and language learning resources. Although previous research on Russian polysemous verbs disambiguation established some important and interesting results, it was mostly focused on reducing ambiguity or determining the most frequent sense, but not on evaluating WSD accuracy. To the best of our knowledge, there is no comprehensively evaluated method that can perform semi-supervised word sense disambiguation for Russian verbs. In this paper we present a WSD method for verbs that is able to reach an average disambiguation accuracy of 75% using only available linguistic resources: examples and collocations from the Active Dictionary of Russian and large unlabeled corpora. We evaluate the method on contexts sampled from the web-based corpus RuTenTen11 for 10 verbs with 100 contexts for each verb. We compare different variations of the method and analyze its limitations. Method’s implementation and labeled contexts are available online.
The paper deals with comparative analysis of two groups of polysemantic words in Russian and in English proving the idea of relativity of the structural scheme of the polysemantic word. The specificity of the lingual unit is always relative and depends on the background against which the comparison is performed.
The collected papers contain articles by famous and young scientists on actual problems of philology (cognitive linguistics, lexical semantics, semiotics, pragmatics, text linguistics, stylistics; poetics, literary criticism; translation, intercultural communication). The issue also presents research on foreign language teaching methods. The edition is addressed to linguists, translators, teachers, postgraduates, students and a wide readership.
These proceedings include papers on subjects from a wide number of areas including theoretical linguistics, translation, computational linguistics, natural language processing, and applied linguistics, focusing on a variety of languages, ranging from familiar Indo-European languages to Mandarin Chinese, Wolof, and Dene Sųɬiné. In order to make the papers available to the wider research community, these proceedings are being published electronically and distributed freely at http://www.meaningtext.net
The monograph presents a scientific and theoretical edition and devoted to the problem of polysemy with a cognitive-semantic context, in particular, complex, not single metaphors and their cognitive heuristic function. The study is based on further development and elaboration of the theory of custody schemes of J. Lakoff and M. Johnson, and considers a metaphor as a tool for systemic understanding of deep conceptual spheres, giving the structural cohesion to the human experience. Cognitive approach to the scientific study assumes that the entire role in the formation of language values belongs to the person as a participant of communication, the observer and the media of knowledge and some experience.
The article is implemented within the cognitive approach and is dedicated to the formation of a substantive core of the polysemantic verb, particularly of the verb of relations in the modern English. The first part of the article presents points of view on the semantic nature - "content plan" - of the word. In the second part of the paper the authors identify the main cognitive mechanisms underlying the formation of meanings of the verb compose, as well as its substantial core that combines all the lexical-semantic variants of a this verb.
The paper discusses evaluation techniques for semantic role labeling in Russian. It has been shown that the quality of FrameNet-style semantic role labeling largely depends on the quantity of roles and may decrease if the inventory of roles in the training set differs from that in the output resource. Our study is the first step towards the ‘smart’ evaluation tool which would introduce linguistically relevant criteria to evaluation; be able to put the mistakes on a scale from minor to critical ones; make evaluation easier in case the grid of roles varies.
We run an experiment based on the data from the Russian FrameBank, a FrameNet-oriented open access database which includes a dictionary of Russian lexical constructions and a corpus of tagged examples. The semantic role is one of the parameters that define the predicate-argument patterns in FrameBank. The inventory of roles is modeled hierarchically and
forms a graph. We explore the cases when the role induced by the system and the answer of the gold standard do not match. We analyze the statistical criteria of distribution of roles in the patterns and the distance between the source and the target in the graph of roles as a mean to assess the goodness of fit.
The paper is focused on the study of reaction of italian literature critics on the publication of the Boris Pasternak's novel "Doctor Jivago". The analysys of the book ""Doctor Jivago", Pasternak, 1958, Italy" (published in Russian language in "Reka vremen", 2012, in Moscow) is given. The papers of italian writers, critics and historians of literature, who reacted immediately upon the publication of the novel (A. Moravia, I. Calvino, F.Fortini, C. Cassola, C. Salinari ecc.) are studied and analised.
In the article the patterns of the realization of emotional utterances in dialogic and monologic speech are described. The author pays special attention to the characteristic features of the speech of a speaker feeling psychic tension and to the compositional-pragmatic peculiarities of dialogic and monologic text.
This article is talking about state management and cultural policy, their nature and content in term of the new tendency - development of postindustrial society. It mentioned here, that at the moment cultural policy is the base of regional political activity and that regions can get strong competitive advantage if they are able to implement cultural policy successfully. All these trends can produce elements of new economic development.