Proceedings of the Institute for System Programming of the RAS
Proceedings of ISP RAS are a double-blind peer-reviewed journal publishing scientific articles in the areas of system programming, software engineering, and computer science. The journal's goal is to develop a respected network of knowledge in the mentioned above areas by publishing high quality articles on open access. The journal is intended for researchers, students, and practitioners.
The volume is dedicated to Boris Mirkin on the occasion of his 70th birthday. In addition to his startling PhD results in abstract automata theory, Mirkin’s ground breaking contributions in various fields of decision making and data analysis have marked the fourth quarter of the 20th century and beyond. Mirkin has done pioneering work in group choice, clustering, data mining and knowledge discovery aimed at finding and describing non-trivial or hidden structures—first of all, clusters, orderings, and hierarchies—in multivariate and/or network data.
This volume contains a collection of papers reflecting recent developments rooted in Mirkin's fundamental contribution to the state-of-the-art in group choice, ordering, clustering, data mining, and knowledge discovery. Researchers, students, and software engineers will benefit from new knowledge discovery techniques and application directions.
Measuring the value of IT is always a challenge for investors. Market share for service oriented Information Systems (IS) is constantly growing and it creates the demand for methods of measuring the value of SOA-based IS projects. This research is aimed at adopting existing IT Project assessment methods to this growing demand. The work proposes the method that considers the fact that SOA-based IS deployment and evolution could be split in separate flows, one per service. It will allow usage of individual discount rate values per service since project risk values should be different for different services. It should make project value assessment more accurate comparing to existing methods which use the single flow for the entire project. This research also proposes Real Options for calculating the flexibility fraction of the value. The developed method was verified using own simulation model. Both developed method and the simulation model were applied to value assessment of a real-world project.
A new computer architecture named object-attribute is offered in the article. Computer of the architecture have all necessary properties for Artificial Intelligence: abstraction of data and program, height concurrency, isomorphism of data and program (i.e. possibility of painless changing of program and data structures), training and self-training of computer system, dataflow, integration of data and program, generation of object description from simple description to complex description, implementation of distribute computer system.
Companies from various domains record their operational behavior in a form of event logs. These event logs can be analyzed and relevant process models representing the real companies’ behavior can be discovered. One of the main advantages of the process discovery methods is that they commonly produce models in a form of graphs which can be easily visualized giving an intuitive view of the executed processes. Moreover, the graph-based representation opens new challenging perspectives for the application of graph comparison methods to find and explicitly visualize differences between discovered process models (representing real behavior) and reference process models (representing expected behavior). Another important area where graph comparison algorithms can be used is the recognition of process modeling patterns. Unfortunately, exact graph comparison algorithms are computationally expensive. In this paper, we adapt an inexact tabu search algorithm to find differences between BPMN (Business Process Model and Notation) models. The tabu search and greedy algorithms were implemented within the BPMNDiffViz tool and were tested on BPMN models discovered from synthetic and real-life event logs. It was experimentally shown that inexact tabu search algorithm allows to find a solution which is close to the optimal in most of the cases. At the same, its computational complexity is significantly lower than the complexity of the exact A search algorithm investigated earlier.
The article describes the project approach to learning as a way of formation of professional competence of software engineering. Since this sphere is quite different from engineering as such, the process of learning and competencies performed in a special way : training project selected on the basis of those competencies that need to master.
The article describes the use of guidelines for the individual and collective (team) processes software development (Personal Software Process - PSP and Team Software Process - TSP), developed by the Software Engineering Institute (SEI) Carnegie Mellon University (CMU), to help understand what concrete actions, skills and knowledge necessary for the development of specific competences. Highlighted the benefits that gives distinction competency areas. The basic features of educational projects and the positive characteristics of project-based learning were formulated in terms of increased demand for specialists in the labor market.
Measuring the value of IT is always a challenge for investors. Market share for service oriented Information Systems (IS) is constantly growing and it creates the demand for methods of measuring the value of SOA-based IS projects. This research is aimed at adopting existing IT Project assessment methods to this growing demand. The work proposes the method that considers the fact that SOA-based IS deployment and evolution could be split in separate flows, one per service. It will allow using individual discounts rate values since project risk values should be different for different services. It should make project value assessment more accurate comparing to existing methods which use the single flow for the entire project. This research also proposes Real Options for calculating the flexibility fraction of the value. The developed method was verified using own simulation model. Both developed method and the simulation model were applied to value assessment of a real-world project.
This book constitutes the refereed proceedings of the 12th Industrial Conference on Data Mining, ICDM 2012, held in Berlin, Germany in July 2012. The 22 revised full papers presented were carefully reviewed and selected from 97 submissions. The papers are organized in topical sections on data mining in medicine and biology; data mining for energy industry; data mining in traffic and logistic; data mining in telecommunication; data mining in engineering; theory in data mining; theory in data mining: clustering; theory in data mining: association rule mining and decision rule mining.
A model for organizing cargo transportation between two node stations connected by a railway line which contains a certain number of intermediate stations is considered. The movement of cargo is in one direction. Such a situation may occur, for example, if one of the node stations is located in a region which produce raw material for manufacturing industry located in another region, and there is another node station. The organization of freight traﬃc is performed by means of a number of technologies. These technologies determine the rules for taking on cargo at the initial node station, the rules of interaction between neighboring stations, as well as the rule of distribution of cargo to the ﬁnal node stations. The process of cargo transportation is followed by the set rule of control. For such a model, one must determine possible modes of cargo transportation and describe their properties. This model is described by a ﬁnite-dimensional system of diﬀerential equations with nonlocal linear restrictions. The class of the solution satisfying nonlocal linear restrictions is extremely narrow. It results in the need for the “correct” extension of solutions of a system of diﬀerential equations to a class of quasi-solutions having the distinctive feature of gaps in a countable number of points. It was possible numerically using the Runge–Kutta method of the fourth order to build these quasi-solutions and determine their rate of growth. Let us note that in the technical plan the main complexity consisted in obtaining quasi-solutions satisfying the nonlocal linear restrictions. Furthermore, we investigated the dependence of quasi-solutions and, in particular, sizes of gaps (jumps) of solutions on a number of parameters of the model characterizing a rule of control, technologies for transportation of cargo and intensity of giving of cargo on a node station.
Event logs collected by modern information and technical systems usually contain enough data for automated process models discovery. A variety of algorithms was developed for process models discovery, conformance checking, log to model alignment, comparison of process models, etc., nevertheless a quick analysis of ad-hoc selected parts of a journal still have not get a full-fledged implementation. This paper describes an ROLAP-based method of multidimensional event logs storage for process mining. The result of the analysis of the journal is visualized as directed graph representing the union of all possible event sequences, ranked by their occurrence probability. Our implementation allows the analyst to discover process models for sublogs defined by ad-hoc selection of criteria and value of occurrence probability
The geographic information system (GIS) is based on the first and only Russian Imperial Census of 1897 and the First All-Union Census of the Soviet Union of 1926. The GIS features vector data (shapefiles) of allprovinces of the two states. For the 1897 census, there is information about linguistic, religious, and social estate groups. The part based on the 1926 census features nationality. Both shapefiles include information on gender, rural and urban population. The GIS allows for producing any necessary maps for individual studies of the period which require the administrative boundaries and demographic information.
Existing approaches suggest that IT strategy should be a reflection of business strategy. However, actually organisations do not often follow business strategy even if it is formally declared. In these conditions, IT strategy can be viewed not as a plan, but as an organisational shared view on the role of information systems. This approach generally reflects only a top-down perspective of IT strategy. So, it can be supplemented by a strategic behaviour pattern (i.e., more or less standard response to a changes that is formed as result of previous experience) to implement bottom-up approach. Two components that can help to establish effective reaction regarding new initiatives in IT are proposed here: model of IT-related decision making, and efficiency measurement metric to estimate maturity of business processes and appropriate IT. Usage of proposed tools is demonstrated in practical cases.