Russian Sentence Corpus: Benchmark measures of eye movements in reading in Russian
In this article we report some new experiments in the area of words clustering for the Russian language. We introduce a new clustering method that distributes words into classes according to their syntactic relations. We used a large untagged corpus (about 7,2 bln of words) to collect a set of such relations. The corpus was processed using a set of finite state automata that extracts syntactically dependent combinations having explicit structure. These automata were used to process only unambiguous text fragments because of combination of these techniques increases the quality of sampled input data. The modification of group average agglomerative clustering was used to separate words between clusters. The sampled set of clusters was tested using one of the semantic dictionaries of the Russian language. The NMI score calculated in this article is equal to 0.457 and F1-score is 0.607.
«Bankruptcy» Concept Within the Legal Linguistics Coordinates: Russian–English–French Approximations
The article addresses the notion of bankruptcy as perceived by speakers of current Russian, English and French languages both lawyers and participants in professional communication from other trades. Semantic structure of the term is identified based on its lexicographic and regulatory definitions.
In this paper we describe the design and development of a multi-touch surface and software that challenges current approaches to the production and consumption of comics. Authorship of the comics involves drawing the ‘top level’ of the story directly onto paper and projecting lower-level narrative elements, such as objects, characters, dialogue, descriptions and/or events onto the paper via a multi-touch interface. In terms of the impact this has upon the experience of reading and writing, the implementation of paper is intended to facilitate the creation of high-level overviews of stories, while the touch surface allows users to generate branches through the addition of artifacts in accordance with certain theories about interactive narratives. This provides the opportunity to participate in the reading and authoring of both traditional, paper-based texts and interactive, digital scenarios. Prototype comics are used to demonstrate this approach to reading and writing top-level and low-level narratives.
The articles cover issues of reading, reading competency, library development, as well as the prevention of abnormal development of the young reader.
In this article we present the results of research into discourse features characterising a lexico-semantic group of synonyms denoting a human being: human being, person, individual, personality and man. The main tool for analysis was language corpora, which made it possible not only to determine more precisely the functional styles the lexemes tend to be used in, but also to describe thematic characteristics of the texts in which the analysed lexical units show the highest frequency of use
The distractive effects on attentional task performance in different paradigms are analyzed in this paper. I demonstrate how distractors may negatively affect (interference effect), positively (redundancy effect) or neutrally (null effect). Distractor effects described in literature are classified in accordance with their hypothetical source. The general rule of the theory is also introduced. It contains the formal prediction of the particular distractor effect, based on entropy and redundancy measures from the mathematical theory of communication (Shannon, 1948). Single- vs dual-process frameworks are considered for hypothetical mechanisms which underpin the distractor effects. Distractor profiles (DPs) are also introduced for the formalization and simple visualization of experimental data concerning the distractor effects. Typical shapes of DPs and their interpretations are discussed with examples from three frequently cited experiments. Finally, the paper introduces hierarchical hypothesis that states the level-fashion modulating interrelations between distractor effects of different classes.
The results of research of different areas of personality of homeless men: values, life attitudes, activity, homelessness area is presents. The data indicate the presence of a number of characteristics inherent in varying degrees all homeless people. The data obtained can be used to build an effective program of psychological re-socialization of homeless people.
The present article continues the investigation of the Soqotri verbal system undertaken by the Russian-Soqotri fieldwork team. The article focuses on the so-called “weak” and “geminated” roots in the basic stem. The investigation is based on the analysis of full paradigms (perfect, imperfect and jussive) of more than 170 “weak” and “geminated” Soqotri verbs.