The paper presents an open-source morphological processor of Russian texts recently developed and named CrossMorphy. The processor performs lemmatization, morphological tagging of both dictionary and non-dictionary words, contextual and non-contextual morphological disambiguation, generation of word forms, as well as morphemic parsing of words. Besides the extended functionality, emphasis is put on linguistic quality of word processing and easy integration into programming projects. CrossMorphy is fully implemented in C++ programming language on the base of OpenCorpora vocabulary data.
The article is devoted to the analysis of Amos 7:14 in the context of prophetic rhetoric. What does it mean if a personage of ancient Hebrew literature behaves like a prophet and at the same time denies that he is a prophet?
Abstract— In this paper a new multi-agent genetic algorithm for multi-objective optimization (MAGAMO) is presented. The algorithm based on the dynamical interaction of synchronized agents which are interdepended genetic algorithms (GAs) having own separate evolutions of their populations. This approach has some similarities with well known “island model” of GA. In both methods is used a migration of individuals from agents (“islands”) to the main process (“continent”). In contrast, the intelligent agents in MAGAMO are able to decompose the dimensions space to form evolutions of subpopulations (instead of distribution of initial population as in the standard “island model”). In the same time, the main (central) process is responsible for the coordination of agents only and their selection according Pareto rules (without evolution). Intelligent agents seek local suboptimal solutions for a global optimization, which will be completed in the result of the interaction of all agents. In the result of this, the amount of needed recalculating the fitness-functions can be significantly reduced. It is especially important for the multi-objective optimization related to a large-scale problem. Besides, the proposed approximating approach allows solving complex optimization problems for real big systems (like an oil company, plants, corporations, etc.).
The paper presents a supervised machine learning experiment with multiple features for identification of sentences containing verbal metaphors in raw Russian text. We introduce the custom-created training dataset, describe the feature engineering techniques, and discuss the results. The following set of features is applied: distributional semantic features, lexical and morphosyntactic co-occurrence frequencies, flag words, quotation marks, and sentence length. We combine these features into models of varying complexity; the results of the experiment demonstrate that fairly simple models based on lexical, morphosyntactic and semantic features are able to produce competitive results.
The natural language texts (NL-texts) from the newspapers, e-mail lists, various blogs, etc. are the important sources of information being able to stimulate the elaboration of a new plan of actions. The paper describes a new formal approach to developing multilingual algorithms of semantic-syntactic analysis of NL-texts. It is a part of the theory of K-representations - a new theory of designing semantic-syntactic analyzers of NL-texts with the broad use of formal means for representing input, intermediary, and output data. The current version of the theory is set forth in a monograph published by Springer in 2010. One of the principal constituents of this theory is a complex, strongly structured algorithm SemSynt1 carrying out semantic-syntactic analysis of texts from some practically interesting sublanguages of the English, German, and Russian languages. An important feature of this algorithm is that it doesn’t construct any syntactic representation of the inputted NL-text but directly finds semantic relations between text units. The other distinguished feature is that the algorithm is completely described with the help of formal means, that is why it is problem independent and doesn’t depend on a programming system. The peculiarities and some central procedures of the algorithm SemSynt1 are analyzed.
We present an efficient equivalence-checking algorithm for a propositional model of programs with semantics based on (what we call) progressive monoids on the finite set of statements generated by relations of a specific form. We consider arbitrary set of relations for commutativity (relations of the form ab=ba for statements a, b) and left absorption (relations of the form ab=b for statements a, b) properties. The main results are a polynomial-time decidability for the equivalence problem in the considered case, and an explicit description of an equivalence-checking algorithm which terminates in time polynomial in size of programs.
The paper sets forth a new way of considering impressionism under the frame of cognitonics. It is a new scientific discipline aimed at compensating the negative shifts in the cognitive-emotional development of personality and society caused by stormy progress of information and communication technologies (ICT) and globalization processes An original algorithm of transforming the negative emotions (caused by the messages received from social networks) into the positive ones is proposed. This algorithm considers the possible reactions of a human (including the recommended reactions) to the emotional attacks via social networks. A new look at impressionism underpins this algorithm. The algorithm is a part of an original interdisciplinary course “Foundations of secure living in information society”.
—Ramification in complete discrete valuation fields is studied. For the case of a perfect residue field, there is a well-developed theory of ramification groups. Hyodo introduced the concept of ramification depth associated with the different of an extension and obtained an inequality that combines the concept of ramification depth in a degree p2 cyclotomic extension with the concept of ramification depth in a degree p subextension. The paper gives a detailed consideration of the structure of degree p2 extensions that can be obtained by a composite of two degree p extensions. In this case, it is not required that the residue field be perfect. Using the concepts of wild and ferocious extensions and the defect of the main unit, degree p2 extensions are classified and more accurate estimates for the ramification depth are obtained. In a number of cases, exact formulas for ramification depth are presented.
In the past decade, a lot of researchers have focused on either management or measurement of intellectual capital components on macro (government), meso (local communities and industries) and micro (single organization itself) level. While it’s been a current issue for commercial organizations, higher education institutions, and private schools, a number of research projects about the implementation of intellectual capital concept (both parts) is scarce. Russian secondary education system is facing a necessity to become more transparent and client‐oriented, the existing reporting and management systems are not capable of dealing with this range of problems. Our research goal is to investigate the current status quo of intellectual capital in secondary schools (Do we have an established language for Intellectual Capital in educational institutions in St Petersburg Russia?). The level of perception and representation of IC metrics is analyzed by semantic analysis (what are the main IC components that value in a strategic planning of schools) and preliminary insights of the research are reported.
In this paper we propose two novel methods for analyzing data collected from online social networks. In particular we will do analyses on Vkontake data (Russian online social network). Using biclustering we extract groups of users with similar interests and find communities of users which belong to similar groups. With triclustering we reveal users’ interests as tags and use them to describe Vkontakte groups. After this social tagging process we can recommend to a particular user relevant groups to join or new friends from interesting groups which have a similar taste. We present some preliminary results and explain how we are going to apply these methods on massive data repositories.
The paper looks at the way intertextual elements in films are translated. The significance of the research is explained by the fact that despite intertextuality being a popular topic, there is still no common strategy how to handle intertextual fragments when it comes to translating them into another language. This task becomes even more difficult when movie scripts are analysed due to the challenge of dubbing the original with the translation within the same length of time. First, the paper looks at the way the concept intertext is interpreted in linguistics. Second, the paper describes the research which involves scripts for the series Suits (seasons 1 and 2). The author singles out intertextual elements in scripts, analyses their translations, identifies the translation strategy and discusses possible reasons for choosing the strategy and offers recommendations how to improve the translation.
In our previous work we have proposed the universal classification of RNA structure elements; the database of spatial structures of RNA based on this classification was developed. In this work we propose classifications of pseudoknots and multiplets, i.e. interactions between RNA helices and study prevalence of various types of structures in known RNA structures. Classification of multiplets is based on their graph representation; the vertices correspond to helices, the edges reflect H-bonds between helices.
The purpose of this paper is to investigate the performance of the protocol of interaction between the reader and the tags. The subject of the work is automatic vehicle identification system (AVI) making use of passive ultra-high frequency radio frequency identification (UHF RFID) under EPC Class 1 Generation 2 protocol. In the thesis, protocol specifications that affect the probability of successful identification are considered and a simulation model of the communication protocol between RFID tags and the reader is given. The purpose of the thesis is the analysis and modeling of the radio frequency identification protocol standard EPC Class 1 Generation 2. During the diploma work, the EPC Class 1 Generation 2 standard was studied, which describes the technology of passive radio frequency identification, in particular, the communication protocol between tags with the reader, the anticollisional procedure for interrogating tags, access to various areas of tag memory, the reader's operating parameters, security issues. To simplify the description and logical separation of the functional presented in the standard, two levels are introduced: physical and logical. The description of these levels is given in the thesis. For the performance analysis, a model of a radio frequency identification system in Python 3 was developed. The software allows to simulate the procedure of interrogating tags and build and analyze the dependencies of the number of successfully read tags on the settings of the reader based on the model described above.
The temperature-current rise in modern (up to 100-150 microns wide) PCB traces is simulated using three software tools ANSYS, HyperLynxThermal and ELCUT. The results are compared with the IR measurements in PCB copper traces with different sizes and substrate materials. It is shown that ANSYS correctly describes the thermal behavior for all tests, other tools have some limitations for small size traces.
The structure of the applied theory of the logistics systems design and planning continues to evolve in the circumstances of supply chain integration. The synchronization of material, service, information, and intellectual flows offers new opportunities for cross-disciplinary analysis. The article considers the development of conceptual framework, as well tactical and operational levels (e.g. existing terms and definitions, economic indicators, methods and models) that are applied in the research on the creation of value nets in terms of their flexibility, speed and reliability.
The article estimates Trends and factors of small business development in Russiawere analyzed in the article. By means of multivariate statistical procedures the RF regions were classified by groups, which differ in common features and problems in the development of small entrepreneurship. The results of the study show that the program to support and stimulate small business should take into account the existing cluster structure and the revealed factors.