Russian sentence corpus: benchmark measures of eye movements in reading in cyrillic
In this article we report some new experiments in the area of words clustering for the Russian language. We introduce a new clustering method that distributes words into classes according to their syntactic relations. We used a large untagged corpus (about 7,2 bln of words) to collect a set of such relations. The corpus was processed using a set of finite state automata that extracts syntactically dependent combinations having explicit structure. These automata were used to process only unambiguous text fragments because of combination of these techniques increases the quality of sampled input data. The modification of group average agglomerative clustering was used to separate words between clusters. The sampled set of clusters was tested using one of the semantic dictionaries of the Russian language. The NMI score calculated in this article is equal to 0.457 and F1-score is 0.607.
In this paper we describe the design and development of a multi-touch surface and software that challenges current approaches to the production and consumption of comics. Authorship of the comics involves drawing the ‘top level’ of the story directly onto paper and projecting lower-level narrative elements, such as objects, characters, dialogue, descriptions and/or events onto the paper via a multi-touch interface. In terms of the impact this has upon the experience of reading and writing, the implementation of paper is intended to facilitate the creation of high-level overviews of stories, while the touch surface allows users to generate branches through the addition of artifacts in accordance with certain theories about interactive narratives. This provides the opportunity to participate in the reading and authoring of both traditional, paper-based texts and interactive, digital scenarios. Prototype comics are used to demonstrate this approach to reading and writing top-level and low-level narratives.
«Bankruptcy» Concept Within the Legal Linguistics Coordinates: Russian–English–French Approximations
The article addresses the notion of bankruptcy as perceived by speakers of current Russian, English and French languages both lawyers and participants in professional communication from other trades. Semantic structure of the term is identified based on its lexicographic and regulatory definitions.
The articles cover issues of reading, reading competency, library development, as well as the prevention of abnormal development of the young reader.