Proceedings of the 2016 Future Technologies Conference
In the given paper the aggregated randomized indices method is modified for credit scoring. Coefficients of the modified method can be calibrated on a massive training set in comparison with a standard version. Different credit scoring models are analyzed, i.e. with a binary scale and a continuous one. The Monte Carlo method is applied to measure the efficiency of models.
The paper deals with the problems of creating and tuning a system of automated anaphora resolution for Russian. Such a system is introduced, combining rule-based and machine learning approaches. It shows F-measure from 0.51 to 0.59. Freeling serves as an underlying morphological layer and an account of its quality is given, with its influence on anaphora resolution workflow. The anaphora resolution system itself is available to download and use, coming with online demo.
The volume contains the abstracts of the 12th International Conference "Intelligent Data Processing: Theory and Applications". The conference is organized by the Russian Academy of Sciences, the Federal Research Center "Informatics and Control" of the Russian Academy of Sciences and the Scientific and Coordination Center "Digital Methods of Data Mining". The conference has being held biennially since 1989. It is one of the most recognizable scientific forums on data mining, machine learning, pattern recognition, image analysis, signal processing, and discrete analysis. The Organizing Committee of IDP-2018 is grateful to Forecsys Co. and CFRS Co. for providing assistance in the conference preparation and execution. The conference is funded by RFBR, grant 18-07-20075. The conference website http://mmro.ru/en/.
This research work deals with the problem formulation of control of complex organizational structures. The mechanism of functioning of such systems is described by example of a vertically integrated company (VIC). The problems of strategic and operative control of VIC are considered. The methods for solving such problems based on genetic algorithms and neural networks are suggested. A new iterative procedure for coordination of strategic and operative control goals based on the estimation of imbalance between shareholder value and net profit distributed for payment of dividends to shareholders is suggested.
The considered system is a double criterion optimization problem with complex multiparameter restrictions.
This proceedings publication is a compilation of selected contributions from the “Third International Conference on the Dynamics of Information Systems” which took place at the University of Florida, Gainesville, February 16–18, 2011. The purpose of this conference was to bring together scientists and engineers from industry, government, and academia in order to exchange new discoveries and results in a broad range of topics relevant to the theory and practice of dynamics of information systems. Dynamics of Information Systems: Mathematical Foundation presents state-of-the art research and is intended for graduate students and researchers interested in some of the most recent discoveries in information theory and dynamical systems. Scientists in other disciplines may also benefit from the applications of new developments to their own area of study.
In an effort to make reading more accessible, an automated readability formula can help students to retrieve appropriate material for their language level. This study attempts to discover and analyze a set of possible features that can be used for single-sentence readability prediction in Russian. We test the influence of syntactic features on predictability of structural complexity. The readability of sentences from SynTagRus corpus was marked up manually and used for evaluation.
In this work is presented a new approach to the designing of intelligent systems of the control of the shareholder value for the vertical-integrated Financial Corporation (VIFK). Developed system based on using of system-dynamics methods for the simulation of the synergic interaction between different business directions of VIFK for the target of shareholder value maximization. Note, the described system has been successfully introduced in biggest Russian banking groups and it is used for the preparing of strategic decisions.
This paper is an overview of the current issues and tendencies in Computational linguistics. The overview is based on the materials of the conference on computational linguistics COLING’2012. The modern approaches to the traditional NLP domains such as pos-tagging, syntactic parsing, machine translation are discussed. The highlights of automated information extraction, such as fact extraction, opinion mining are also in focus. The main tendency of modern technologies in Computational linguistics is to accumulate the higher level of linguistic analysis (discourse analysis, cognitive modeling) in the models and to combine machine learning technologies with the algorithmic methods on the basis of deep expert linguistic knowledge.
The scope of the conference is to gather researchers from different areas and disciplines to present results and participate in discussions under the common theme of intelligent systems and computing. These interactions will facilitate a better understanding of the diversity of the different approaches as well as of their similarities. In addition it will open the way for applying approaches that have been successful in one area to problem solving in different areas and applications.
This book constitutes the refereed proceedings of the 12th Industrial Conference on Data Mining, ICDM 2012, held in Berlin, Germany in July 2012. The 22 revised full papers presented were carefully reviewed and selected from 97 submissions. The papers are organized in topical sections on data mining in medicine and biology; data mining for energy industry; data mining in traffic and logistic; data mining in telecommunication; data mining in engineering; theory in data mining; theory in data mining: clustering; theory in data mining: association rule mining and decision rule mining.
A form for an unbiased estimate of the coefficient of determination of a linear regression model is obtained. It is calculated by using a sample from a multivariate normal distribution. This estimate is proposed as an alternative criterion for a choice of regression factors.