Early Performance Evaluation of Supervised Graph Anomaly Detection Problem Implemented in Apache Spark

Mazeev A.; A. Semenov; Dmitry D.; Timur Y.

?

Early Performance Evaluation of Supervised Graph Anomaly Detection Problem Implemented in Apache Spark

P. 84–91.

Mazeev A., Semenov A., Dmitry D., Timur Y.

Apache Spark is one of the most popular Big Data frameworks. Performance evaluation of Big Data frameworks is a topic of interest due to the increasing number and importance of data analytics applications within the context of HPC and Big Data convergence. In the paper we present early performance evaluation of a typical supervised graph anomaly detection problem implemented using GraphX and MLlib libraries in Apache Spark on a cluster.

Language: English

Full text

Text on another site

Keywords: machine learning Spark performance evaluation graph processing supervised anomaly detection MLlib

In book

Proceedings of the 3rd Ural Workshop on Parallel, Distributed, and Cloud Computing for Young Scientists

Vol. 1990: Proceedings of the 3rd Ural Workshop on Parallel, Distributed, and Cloud Computing for Young Scientists. , CEUR Workshop Proceedings, 2017.

Unsupervised Graph Anomaly Detection Algorithms Implemented in Apache Spark

Semenov A., Mazeev A., Dmitry D. et al., Lobachevskii Journal of Mathematics 2018 Vol. 39 No. 9 P. 1262–1269

The graph anomaly detection problem occurs in many application areas and can be solved by spotting outliers in unstructured collections of multi-dimensional data points, which can be obtained by graph analysis algorithms. We implement the algorithm for the small community analysis and the approximate LOF algorithm based on Locality-Sensitive Hashing, apply the algorithms to a ...

Added: June 10, 2019

SimVec: predicting polypharmacy side effects for new drugs

Lukashina N., Kartysheva E., Spjuth O. et al., Journal of Cheminformatics 2022 Vol. 14 Article 49

Polypharmacy refers to the administration of multiple drugs on a daily basis. It has demonstrated effectiveness in treating many complex diseases , but it has a higher risk of adverse drug reactions. Hence, the prediction of polypharmacy side effects is an essential step in drug testing, especially for new drugs. This paper shows that the ...

Added: October 11, 2022

Proceedings of the Fifth International Workshop on Experimental Economics and Machine Learning (EEML 2019),Perm, Russia, September 26, 2019

CEUR Workshop Proceedings, 2019.

Proceedings of the Fifth Workshop on Experimental Economics and Machine Learning at the National Research Univeristy Higher School of Economics co-located with the Seventh International Conference on Applied Research in Economics (iCare7) ...

Added: October 23, 2019

Proceedings of the NeurIPS 2021 Competitions and Demonstrations Track

PMLR, 2022.

Added: July 27, 2022

Application of Artificial Intelligence Methods for Improvement of Strategic Decision-Making in Logistics

Kitzmann H., Strimovskaya A., Serova E., , in: Transfer, Diffusion and Adoption of Next-Generation Digital Technologies. IFIP WG 8.6 International Working Conference on Transfer and Diffusion of IT, TDIT 2023 Nagpur, India, December 15–16, 2023 Proceedings, Part IIVol. 698. Springer, 2024. P. 132–143.

Highly evolving economic environment requires from logistics companies fast response and agile solutions. Recently development of digital technologies gives significant advantages to logistics business. Hence many optimized processes belong to operational management level. At the same time the importance of digital technologies adoption to strategic management level should not be underestimated, as it allows gaining competitive advantages alongside the supply chain. ...

Added: January 12, 2024

Proceedings of the International Workshop "What can FCA do for Artificial Intelligence?" (FCA4AI at ECAI 2014)

Prague: CEUR Workshop Proceedings, 2014.

The first and the second edition of the FCA4AI Workshop showed that many researchers working in Artificial Intelligence are indeed interested by a well-founded method for classi- fication and mining such as Formal Concept Analysis (see http://www.fca4ai.hse.ru/). The first edition of FCA4AI was co-located with ECAI 2012 in Montpellier and published as http://ceur-ws.org/Vol-939/ while the ...

Added: September 12, 2014

Application of the Method of Multivariate Multi-stage Forecasting Based on the LSTM Deep Learning Model for Bitcoin Price Time Series

Natalia Sizykh, Said Dandamaev, Dmitry Sizykh, , in: 16th International Conference Management of large-scale system development (MLSD). IEEE, 2023. P. 1–5.

Forecasting data and research on cryptocurrency price forecasting methods are increasing in importance. So far, methods based on LSTM deep learning architecture have shown the best results in forecasting cryptocurrency prices. In order to improve the accuracy of forecasting data, this paper investigates the application of a multivariate multistep forecasting method based on the LSTM ...

Added: December 22, 2023

Belief Functions: Theory and Applications

Dordrecht, L., Heidelberg, NY: Springer, 2014.

This book constitutes the thoroughly refereed proceedings of the Third International Conference on Belief Functions, BELIEF 2014, held in Oxford, UK, in September 2014. The 47 revised full papers presented in this book were carefully selected and reviewed from 56 submissions. The papers are organized in topical sections on belief combination; machine learning; applications; theory; ...

Added: October 1, 2014

Style transfer in NLP: a framework and multilingual analysis with Friends TV series

Tikhonova M., Elina Telesheva, Mirzoev S. et al., , in: 2021 International Conference Engineering and Telecommunication (En&T). IEEE, 2022. P. 1–6.

Style transfer is an important and a rapidly developing of Natural Language Processing. This days more and more methods and models are proposed which allow us to generate text in predefined style. In this paper we propose a framework for style transfer of “Friends” TV series. The trained models are able to mimic one of ...

Added: May 21, 2022

Advances in Neural Information Processing Systems 38 (NeurIPS 2024)

[б.и.], 2024.

Proceedings of the international conference "Neural Information Processing Systems 2024." (NeurIPS 2024) ...

Added: October 15, 2024

An Algorithm to Satisfy the QoS Requirements in a Heterogeneous LoRaWAN Network

Bankov D., Khorov E., Lyakhov A., , in: 2020 IEEE Symposium on Computers and Communications (ISCC). IEEE, 2020. P. 1–6.

LoRaWAN is a popular low power wide area network technology widely used in many scenarios, such as environmental monitoring and smart cities. Different applications demand various quality of service (QoS), and their service within a single network requires special solutions for QoS provision. We consider the problem of QoS provision in heterogeneous LoRaWAN networks that ...

Added: October 17, 2020

Proceedings of the 2023 CLASP Conference on Learning with Small Data (LSD)

Gothenburg: Association for Computational Linguistics, 2023.

Current deep learning systems require large amounts of data in order to yield optimal results. Despite ever-increasing model and data size, these systems have achieved remarkable success across a wide range of tasks in NLP, and AI in general. However, these systems possess a number of limitations. Firstly, the models require a significant amount of ...

Added: December 5, 2023

Hidden Feedback Loops in Machine Learning Systems: A Simulation Model and Preliminary Results

Anton Khritankov, , in: Software Quality: Future Perspectives on Software Engineering Quality: 13th International Conference, SWQD 2021, Vienna, Austria, January 19–21, 2021, Proceedings. Springer, 2021. P. 54–65.

In this concept paper, we explore some of the aspects of quality of continuous learning artificial intelligence systems as they interact with and influence their environment. We study an important problem of implicit feedback loops that occurs in recommendation systems, web bulletins and price estimation systems. We demonstrate how feedback loops intervene with user behavior ...

Added: September 23, 2021

Learning and Intelligent Optimization. 8th International Conference, Lion 8, Gainesville, FL, USA, February 16-21, 2014. Revised Selected Papers

Springer, 2014.

This book constitutes the thoroughly refereed post-conference proceedings of the 8th International Conference on Learning and Optimization, LION 8, which was held in Gainesville, FL, USA, in February 2014. The 33 contributions presented were carefully reviewed and selected for inclusion in this book. A large variety of topics are covered, such as algorithm configuration; multiobjective ...

Added: September 15, 2014

ПРОЕКТНОЕ ПРЕДЛОЖЕНИЕ: АВТОМАТИЗИРОВАННЫЙ ПОДХОД К РЕКОМЕНДАТЕЛЬНЫМ СИСТЕМАМ

Сендерович М. А., В кн.: Межвузовская научно-техническая конференция студентов, аспирантов и молодых специалистов им. Е.В. Арменского. М.: МИЭМ НИУ ВШЭ, 2019. С. 223–224.

Данная работа посвящена актуальной теме автоматизации в машинном обучении на примере создания универсальной рекомендательной системы. В работе исследуются различные типы рекомендательных систем, акцент делается на подходы коллаборативной фильтрации. Изучаются методы автоматизации машинного обучения, на основе которых будет разработана данная рекомендательная система. ...

Added: October 31, 2020

Исследовательский проект как инструмент обучения методам анализа текста: предсказание класса поста в социальной сети

Suvorova A., Смирнова К. Р., Будин Е. А. et al., Компьютерные инструменты в образовании 2018 № 3 С. 49–64

The article describes a student research project on predicting the class of a post on a social network based on its textual content. The features of the project are discussed as an integral part of the trajectory of teaching data analysis methods, including text analysis methods and tools that are often not included in machine ...

Added: January 28, 2019

WIMS 2020: Proceedings of the 10th International Conference on Web Intelligence, Mining and Semantics

Association for Computing Machinery (ACM), 2020.

On behalf of the conference chairs, we welcome you to the 10th International Conference on Web Intelligence, Mining and Semantics (WIMS'20) hosted by LIUPPA Lab of the University de Pau & Pays de l'Adour, Biarritz-France. The 1st WIMS conference was organized in Sogndal Norway. Since then, it has always been published by ACM. After 10 ...

Added: August 28, 2020

Controlling Overestimation Bias with Truncated Mixture of Continuous Distributional Quantile Critics

Кузнецов А. С., Shvechikov P., Grishin A. et al., , in: International Conference on Machine Learning (ICML 2020)Vol. 119. PMLR, 2020. P. 5556–5566.

Added: October 17, 2020

Formal Concept Analysis: 16th International Conference, ICFCA 2021, Strasbourg, France, June 29 – July 2, 2021, Proceedings

Springer, 2021.

This book constitutes the proceedings of the 16th International Conference on Formal Concept Analysis, ICFCA 2021, held in Strasbourg, France, in June/July 2021. The 14 full papers and 5 short papers presented in this volume were carefully reviewed and selected from 32 submissions. The book also contains four invited contributions in full paper length. The research part ...

Added: July 10, 2021

Predictive Analytics Approach for Steel Billets Quality Control System

Belov A. V., Ekaterina A. Melekhova, Vorontsova T., , in: 2022 International Conference on Quality Management, Transport and Information Security, Information Technologies (IT&QM&IS). St. Petersburg: IEEE, 2022. P. 219–223.

The paper deals with the problem of improving the quality of metal products. Nowadays destructive methods of quality control of the steel billets prevail at metallurgical enterprises. This approach to assessing the quality of the steel billets is wasteful, which increases its cost. One of the ways to reduce the cost of production of metal ...

Added: January 28, 2023

Proceedings of the international conference on Uncertainty in Artificial Intelligence (UAI 2018)

[б.и.], 2018.

Proceedings of the international conference on Uncertainty in Artificial Intelligence (UAI 2018) ...

Added: October 29, 2018

Эффективность риск-менеджмента: оценка и ее влияние на инвестиционную привлекательность бизнеса

Makarova V. A., Управление финансовыми рисками 2015 № 04(44) С. 270–287

Methods of companies’ risk management are non-transparent, so the fact of it risk management system often does not increase business attractiveness. The main purpose of this study is to identify the key factors for the effectiveness of the risk management of the company whose management is able to increase its investment attractiveness. This article presents ...

Added: December 7, 2015

МАШИННОЕ ОБУЧЕНИЕ В ИССЛЕДОВАНИЯХ МЕДИКО-БИОЛОГИЧЕСКИХ И СОЦИАЛЬНО-ЭКОНОМИЧЕСКИХ ДАННЫХ

Buzmakov A. V., В кн.: МАШИННОЕ ОБУЧЕНИЕ В ИССЛЕДОВАНИЯХ МЕДИКО-БИОЛОГИЧЕСКИХ И СОЦИАЛЬНО-ЭКОНОМИЧЕСКИХ ДАННЫХ. СПб.: Федеральное государственное автономное образовательное учреждение высшего образования "Санкт-Петербургский политехнический университет Петра Великого", 2020. С. 284–333.

In many practical tasks it is needed to estimate an effect of treatment on individual level. For example, in medicine it is essential to determine the patients that would benefit from a certain medicament. In marketing, knowing the persons that are likely to buy a new product would reduce the amount of spam. In this ...

Added: December 7, 2021