In book
M. : IEEE, 2022
С.Д. Кузнецов, Посконин А. В., Труды Института системного программирования РАН 2013 Т. 24 С. 327-258
Many modern applications (such as large-scale Web-sites, social networks, research projects, business analytics, etc.) have to deal with very large data volumes (also referred to as “big data”) and high read/write loads. These applications require underlying data management systems to scale well in order to accommodate data growth and increasing workloads. High throughput, low latencies ...
Added: January 30, 2018
Зудин С., Gnatyshak D. V., Ignatov D. I., , in : Proceedings of the Twelfth International Conference on Concept Lattices and Their Applications Clermont-Ferrand, France, October 13-16, 2015. Vol. 1466.: Clermont-Ferrand : CEUR Workshop Proceedings, 2015. P. 47-58.
In our previous work an efficient one-pass online algorithm
for triclustering of binary data (triadic formal contexts) was proposed.
This algorithm is a modified version of the basic algorithm for OAC-triclustering
approach; it has linear time and memory complexities. In
this paper we parallelise it via map-reduce framework in order to make
it suitable for big datasets. The results of ...
Added: October 23, 2015
Egurnov D., Ignatov D. I., Точилкин Д. С., / Springer. Series LNCS "Lecture Notes in Computer Science". 2020.
In this paper, we describe versions of triclustering algorithms adapted for efficient calculations in distributed environments with MapReduce model or parallelisation mechanism provided by modern programming languages. OAC-family of triclustering algorithms shows good parallelisation capabilities due to the independent processing of triples of a triadic formal context. We provide the time and space complexity of ...
Added: November 10, 2020
Клеменков П. А., Kuznetsov S. D., Труды Института системного программирования РАН 2012 Т. 23 С. 143-158
Big data challenged traditional storage and analysis systems in several new ways. In this paper we try to figure out how to overcome this challenges, why it's not possible to make it efficiently and describe three modern approaches to big data handling: NoSQL, MapReduce and real-time stream processing. The first section of the paper is ...
Added: October 31, 2017
Shugurov I., Mitsyuk A. A., Proceedings of the Institute for System Programming of the RAS 2016 Vol. 28 No. 3 P. 103-122
Process mining is a relatively new research field, offering methods of business processes analysis and improvement, which are based on studying their execution history (event logs). Conformance checking is one of the main sub-fields of process mining. Conformance checking algorithms are aimed to assess how well a given process model, typically represented by a Petri ...
Added: September 12, 2016
Tyryshkina Y., Tumkovskiy S., Информационно-управляющие системы 2022 № 5(120) С. 2-11
Added: May 31, 2022
Leokhin, Y., Myagkov, A., Panfilov, P., , in : 26th DAAAM International Symposium on Intelligent Manufacturing and Automation 2015. Vol. 1.: NY : Curran Associates, Inc., 2015. P. 0656 - 0662.
In this paper, we present results of a computational evaluation of goMapReduce parallel programming model approach for solving distributed data processing problems. In some applications, particularly data center problems, including text processing the programming models can aggregate significant number of parallel processes. We first discuss the implementation of these approaches using both Linux and Plan9 ...
Added: November 26, 2016
Egurnov D., Точилкин Д. С., Ignatov D. I., , in : Complex Data Analytics with Formal Concept Analysis. : Springer, 2022. P. 239-258.
In this paper, we describe versions of triclustering algorithms adapted for efficient calculations in distributed environments with MapReduce model or parallelisation mechanism provided by modern programming languages. OAC-family of triclustering algorithms shows good parallelisation capabilities due to the independent processing of triples of a triadic formal context. We provide time and space complexity of the ...
Added: November 1, 2022
Tyryshkina Y., , in : International Seminar on Electron Devices Design and Production, SED 2021. : [б.и.], 2021.
In this paper, we consider the problem of reducing the cost of computer time by developing and implementing a method for accelerating the operation of connecting distributed data arrays according to a given criterion. The following tasks were solved: a study was conducted on the architecture of distributed data storages and parallel computing algorithms; on ...
Added: June 2, 2022
Ahmed Munna M. T., International Journal of Engineering and Technology 2018 Vol. 7 No. 8 P. 16-21
MapReduce has become a popular programming model for processing and running large-scale data sets with a parallel, distributed paradigm on a cluster. Hadoop MapReduce is needed especially for large scale data like big data processing. In this paper, we work to modify the Hadoop MapReduce Algorithm and implement it to reduce processing time. ...
Added: October 29, 2019
Леохин Ю.Л., Мягков А.С., Информатизация образования и науки 2014 Т. 24 № 4 С. 111-118
In the article the implementation of the parallel programming model MapReduce, which is used in distributed computation, is considered. The results of the scalability research of the implementation running on the Plan9 operation system are shown. In addition, we have pointed out the main lines of the selected prototype development. ...
Added: October 23, 2014
Tyryshkina Y., В кн. : Международной научно-практической конференции «BIG DATA and Advanced Analytics». : Мн. : [б.и.], 2022.
В данной работе рассматривается проблема снижения затрат машинного времени за счет разработки и реализации метода ускорения операции соединения распределенных массивов данных по заданному критерию. Были решены следующие задачи: проведено исследование архитектуры распределенных хранилищ данных и алгоритмов параллельных вычислений; на основании этих исследований установлены лимитирующие стадии, замедляющие процесс переработки; разработан метод, исключающий установленные лимитирующие стадии; на ...
Added: May 31, 2022
Tyryshkina Y., , in : Международная научнопрактическая конференция «Информационные Инновационные Технологии», 2022. : [б.и.], 2022.
In this paper, we consider the problem of reducing the cost of computer time by developing and implementing a method for accelerating the operation of connecting distributed data arrays according to a given criterion. The following tasks were solved: a study was conducted on the architecture of distributed data storages and parallel computing algorithms; on ...
Added: May 31, 2022
T.M. Pribylev, Zaytsev M. N., O.L. Vikentyeva, Proceedings of the Institute for System Programming of the RAS 2022 Vol. 34 No. 2 P. 57-65
This paper aims at investigating the feasibility of an actor-oriented approach for modelling analytical
systems development business processes. The study analyzes existing management challenges of analytical
systems development processes, identifies key business process modeling approaches, and proposes a modeling
approach based on actor-oriented approach with high flexibility and enhanced control over business artifacts.
The article also describes examples of ...
Added: September 5, 2022