Автоматическое создание виртуальных кластеров Apache Spark в облачной среде Openstack

?

Автоматическое создание виртуальных кластеров Apache Spark в облачной среде Openstack

Труды Института системного программирования РАН. 2014. Т. 26. № 4. С. 33–44.

Kuznetsov S. D., Turdakov D. Y., Борисенко О. Д.

This article is dedicated to automation of cluster creation and management for Apache Spark MapReduce implementation in Openstack environments. As a result of this project open-source (Apache 2.0 license) implementation of toolchain for virtual cluster on-demand creation in Openstack environments was presented. The article contains an overview of existing solutions for clustering automation in cloud environments by the start of 2014 year. The article provides a shallow overview of issues and problems in Openstack Heat project that provides a compatibility layer for Amazon EC2 API. The final implementation provided in the article is almost strainforward port of existing toolchain for cluster creation automation for Apache Spark in Amazon EC2 environment with some improvements. Also prepared base system virtual machine image for Openstack is provided. Plans for further work are connected with Ansible project. Using Ansible for observed problem will make possible to implement versatile environment-agnostic solution that is able to work using any cloud computing services provider, set of Docker containers or bare-metal clusters without any dependencies for prepared operating system image. Current article doesn't use Ansible due to the lack of key features at the moment of the project start. The solution provided in this article has been already tested in production environment for graph theory research arcticle.

Research target: Computer Science

Priority areas: IT and mathematics

Language: Russian

DOI

Text on another site

Создание виртуальных кластеров Apache Spark в облачных средах с использованием систем оркестрации

Борисенко О. Д., Пастухов Р. К., С.Д. Кузнецов, Труды Института системного программирования РАН 2016 Т. 28 № 6 С. 111–120

Apache Spark is a framework providing fast computations on Big Data using MapReduce model. With cloud environments Big Data processing becomes more flexible since they allow to create virtual clusters on-demand. One of the most powerful open-source cloud environments is Openstack. The main goal of this project is to provide an ability to create virtual ...

Added: January 25, 2018

Разработка масштабируемой программной инфраструктуры для хранения и обработки данных в задачах вычислительной биологии

Kuznetsov S. D., Turdakov D. Y., Борисенко О. Д. et al., Труды Института системного программирования РАН 2014 Т. 26 № 4 С. 45–54

This article is an overview of scalable infrastructure for storage and processing of genome data in genetics problems. The overview covers used technologies descriptions, the organization of unified access to genome processing API of different underlying services. The article also covers methods for scalable and cloud computing technologies support. The first service in virtual genome ...

Added: November 26, 2017

МЕТОДИКА КОЛИЧЕСТВЕННОЙ ОЦЕНКИ РИСКА ИНФОРМАЦИОННОЙ БЕЗОПАСНОСТИ ДЛЯ ОБЛАЧНОЙ ИНФРАСТРУКТУРЫ ОРГАНИЗАЦИИ

Tsaregorodtsev A. V., Макаренко Е. В., Национальные интересы: приоритеты и безопасность 2014 № 44 С. 30–41

Almost all of the technologies that are now part of the cloud paradigm existed before, but so far the market has not been proposals that bring together emerging technologies in a single commercially attractive solution. However, in the last decade, there were public cloud services, through which these technologies, on the one hand, available to ...

Added: March 26, 2015

Построение гибридной защищенной облачной среды ИТ-инфраструктуры организации

Tsaregorodtsev A. V., Los A., Sorokin A., Промышленные АСУ и контроллеры 2015 № 11 С. 26–31

Cloud computing is becoming one of the most common IT technologies for deploying applications, thanks to its key features: flexible solutions, available on request, and a good price / performance ratio. Migrating to the cloud-based architecture allows organizations to reduce the total cost of implementation and support infrastructure, and reduce development time for new business ...

Added: March 15, 2016

Платформа туманных вычислений на основе беспроводных сенсорных сетей

Dvornikov A. A., Качество. Инновации. Образование 2014 № 8 С. 64–70

Wireless sensor networks are offered as a platform for fog computing. This paradigm describes a computation process that executes on a new structural layer situated between cloud computing and end devices. A problems' analysis provided that arise in case of using wireless sensor network for fog computing, offered researching way for problems salvation. ...

Added: October 9, 2014

Обеспечение информационной безопасности облачных вычислений

Isaev E., Dumsky D., Samodurov V. et al., Математическая биология и биоинформатика 2015 Т. 10 № 2 С. 567–579

The rapid development of information technology in today's society dictates new requirements for information security technologies of data, methods of remote access and data processing, integrated reduction of financial expenses on working with information. In recent years, the ideal solution to all these problems that is widely suggested is the concept of cloud computing. This ...

Added: December 19, 2015

ОДИН ИЗ ПОДХОДОВ К ОЦЕНКЕ РИСКОВ ИНФОРМАЦИОННОЙ БЕЗОПАСНОСТИ В ОБЛАЧНЫХ СРЕДАХ

Tsaregorodtsev A. V., Малюк А. А., Макаренко Е. В., Безопасность информационных технологий 2014 № 4 С. 68–74

Due to the fact that cloud computing bring with them new challenges in the field of information security, it is imperative for organizations to control the process of information risk management in the cloud. This paper proposes a risk assessment approach for assessing the potential damage from the attack on the implementation of components of ...

Added: March 26, 2015

Модель оценки рисков информационной безопасности информационных систем на основе облачных вычислений

Tsaregorodtsev A. V., Ермошкин Г. Н., Национальная безопасность / nota bene 2013 № 6 С. 46–54

Widespread acceptance and adoption of cloud computing calls for adaptation and development of existing risk assessment models of information systems. The approach suggested in this article can be used for risk assessment of information systems functioning on the basis of cloud computing technology, and assess the effectiveness of security measures. ...

Added: March 17, 2014

Формализованная модель безопасности рабочих процессов информационно-телекоммуникационных систем, функционирующих на основе технологии облачных вычислений

Tsaregorodtsev A. V., Нелинейный мир 2013 Т. 11 № 9 С. 610–621

Use of cloud computing applications and services requires review and adaptation of existing formal models for informational telecommunication systems security. It is necessary to consider the benefits of cloud deployment models and provide the procedure for allocating process among components of cloud computing environment for achieving confidentiality and data protection. ...

Added: March 26, 2015

Реализация сервиса для выполнения Apache Spark задач и создания Apache Spark кластеров на основе Openstack Sahara

S. Kuznetsov, Борисенко О. Д., Алексиянц А. В. et al., Proceedings of the Institute for System Programming of the RAS 2015 Vol. 27 No. 5 P. 35–48

In this paper the problem of creating virtual clusters in clouds for big data analysis with Apache Hadoop and Apache Spark is discussed. Existing methods for Apache Spark clusters creation are described in this work. Also the implemented solution for building Apache Spark clusters and Apache Spark jobs execution in Openstack environment is described. The ...

Added: January 23, 2018

Методика количественной оценки риска в информационной безопасности облачной инфраструктуры организации

Tsaregorodtsev A. V., Макаренко Е. В., Дайджест-финансы 2015 № 1(233) С. 56–67

Added: March 15, 2016

ОПТИМИЗАЦИЯ АРХИТЕКТУРЫ ГИБРИДНОЙ СРЕДЫ ОБЛАЧНЫХ ВЫЧИСЛЕНИЙ ПО КРИТЕРИЮ СОВОКУПНОЙ СТОИМОСТИ ВЛАДЕНИЯ

Tsaregorodtsev A. V., Макаренко Е. В., Безопасность информационных технологий 2014 № 4 С. 59–67

Achieving the goals of information security is a key factor in the decision to outsource information technology and, in particular, to decide on the migration of organizational data, applications, and other resources to the infrastructure, based on cloud computing. And the key issue in the selection of optimal architecture and the subsequent migration of business ...

Added: March 26, 2015

К вопросу о существовании доказуемо стойких систем облачных вычислений

Zakharov V., Варновский Н. П., Шокуров А. В., Вестник Московского университета. Серия 15: Вычислительная математика и кибернетика 2016 № 2 С. 32–38

We study a formal model of cloud computing systems supplied with auxiliary cryptoservers. Assuming an existence of a secure threshold somewhat homomorphic open key cryptosystem we show how to build a secure cloud computing system in the framework of this model. ...

Added: October 13, 2016

Один из подходов к построению информационной инфраструктуры организации на базе гибридной облачной среды

Tsaregorodtsev A. V., Мухин И. Н., Боридько С. И., Информация и безопасность 2015 Т. 18 № 3 С. 400–403

Due to the fact that cloud computing bring the new challenges in the field of information security, it is imperative for the organization to control the process of information security management in the cloud. The level of confidence in the services provided can vary significantly depending on the goals of the organization, the structure of ...

Added: March 15, 2016

ОЦЕНКА РИСКА БЕЗОПАСНОСТИ ДАННЫХ В ИНФОРМАЦИОННО-ТЕЛЕКОММУНИКАЦТОННЫХ СИСТЕМАХ НА ОСНОВЕ ОБЛАЧНЫХ ВЫЧИСЛЕНИЙ

Tsaregorodtsev A. V., Лавриненко М. М., Лапенкова Н. В., Безопасность информационных технологий 2014 № 1 С. 36–40

Cloud computing will be one of the most common IT technologies to deploy applications, due to its key features: on-demand network access to a shared pool of configurable computing resources, flexibility and good quality/price ratio. Migrating to cloud architecture enables organizations to reduce the overall cost of implementing and maintaining the infrastructure and reduce development ...

Added: March 26, 2015

Метод моделирования маршрутов распределения обработки критичных данных в гибридной среде облачных вычислений на основе модифицированных сетей Петри

Tsaregorodtsev A. V., Дербин Е. А., Мухин И. Н., Информация и безопасность 2015 Т. 18 № 3 С. 408–411

The use of cloud computing to build of IT-infrastructure of the organization implies the refusal of the organization direct control over the security aspects. There is a need for solving the problem of data privacy in the design architecture based on cloud computing technology. In the article the simulation method of data processing using Petri ...

Added: March 15, 2016

Базовые принципы построения дерева целей информационной безопасности среды облачных вычислений

Tsaregorodtsev A. V., Ермошкин Г. Н., Национальная безопасность / nota bene 2013 № 5 С. 69–79

Change of a contour of safety and exit of critical assets of the organizations from under internal control with the subsequent migration of these assets on cloudy Wednesday nominated a problem of management of information security of the corporate systems functioning on the basis of technology of cloud computing to the first place. All this ...

Added: March 26, 2015

Комплексный подход к построению защищенных информационно-телекоммуникационных систем на базе гибридной облачной среды

Tsaregorodtsev A. V., Los A., Sorokin A., Национальная безопасность / nota bene 2015

В статье рассматриваются вопросы обеспечения информационной безопасности при проведении облачных вычислений. Информационно-телекоммуникационные системы, функционирующие на основе технологии облачных вычислений, в последнее время получают все большее распространение в связи с постоянно растущими потребностями в вопросах обработки и хранения больших объемов данных, что подтверждает актуальность рассматриваемых в статье вопросов. При этом ключевым моментом при использовании облачных вычислений ...

Added: October 20, 2015

Методика построения защищенных информационно-телекоммуникационных систем на базе гибридной облачной среды

Tsaregorodtsev A. V., Мухин И. Н., Белый А. Ф., Информация и безопасность 2015 Т. 18 № 3 С. 404–407

The widespread use of cloud computing calls for adaptation and refinement of existing approaches to the construction of information and telecommunication systems. Data migrating to the cloud-based architecture enables to reduce total cost of implementation and maintenance of infrastructure and reduces development time for new business applications. Thus, the question of information security remains open. ...

Added: March 15, 2016

Построение гибридной защищенной облачной среды Ит-инфраструктуры организации

Tsaregorodtsev A. V., Los A., Sorokin A., Промышленные АСУ и контроллеры 2015 № 11 С. 26–31

Added: October 20, 2015

ОДИН ИЗ ПОДХОДОВ К ПОСТРОЕНИЮ ГИБРИДНОЙ ЗАЩИЩЕННОЙ ОБЛАЧНОЙ СРЕДЫ

Tsaregorodtsev A. V., Качко А. К., Лавриненко М. М., Безопасность информационных технологий 2014 № 1 С. 22–27

In response to the ever growing needs in the storage and processing of data the main position are occupied by informational-telecommunication systems, operating on the basis of cloud computing. In this case, the key point in the use of cloud computing is the problem of information security. This article is primarily intended to cover the ...

Added: March 26, 2015

Построение деревьев целей для идентификации требований безопасности среды облачных вычислений

Tsaregorodtsev A. V., Национальная безопасность / nota bene 2013 № 5 С. 51–68

Need to improve and increase the efficiency of the cardinal principles of information security management cloud environment leads to the area of multidimensional properties of " systematic ." Application of technology and methods of structural synthesis of formal information security management systems (ISMS ) in the cloud , connecting different structure hierarchies requirements would more ...

Added: March 17, 2014

Simplifying the Use of Clouds for Scientific Computing with Everest

Volkov S., Sukhoroslov O. V., Procedia Computer Science 2017 Vol. 119 P. 112–120

Cloud computing has emerged as a new paradigm for on-demand access to a wast pool of computing resources that provides an alternative to using on-premises resources. This paper discusses the challenges related to using the cloud computing infrastructures for scientific computing. An approach based on Everest platform addressing these challenges is presented along with the ...

Added: August 30, 2018

Проектирование IoT-платформы для управления энергоресурсами интеллектуальных зданий

Kychkin A., Deryabin A. I., Vikentyeva O. et al., Прикладная информатика 2018 Т. 13 № 4 С. 29–41

The problem of designing a cyberphysical system used as a service for Smart buildings control using Internet technologies — Internet of Things (IoT) is considered. Such software platforms are part of the complex systems of the BEMS — Building Energy Management Systems and are an instrument for implementing energy savings in buildings. IoT servers and ...

Added: September 5, 2018