On the efficient application of Aho-Corasick algorithm in process mining

A. Konchagin; A. A. Kalenkova

doi:10.1007/978-3-319-73013-4_34

Publications

?

On the efficient application of Aho-Corasick algorithm in process mining

P. 371–377.

Konchagin A., Kalenkova A. A.

In this paper we present an approach for searching sub-traces in event logs, generated by information systems. Our technique is heavily based on the Aho-Corasick algorithm, and extends it with simultaneous search on several event log traces. The computational complexity of the proposed approach was estimated. Moreover, the approach was implemented and verified on real-life event logs. It was shown that it allows to reduce the search time for event logs with a high proportion of similar traces.

Keywords: process mining журналы событий event logs конечные автоматы finite-state machine алгоритм Ахо-Корасик Aho-Corasick algorithm

Publication based on the results of:

Synthesis and analysis of process models (2017)

In book

Analysis of Images, Social Networks and Texts. 6th International Conference, 2017, Revised Selected Papers

Vol. 10716. , Cham: Springer, 2018.

Merging Directly-Follows Graphs and Sankey Diagrams for Visualizing Acyclic Processes

Derezovskiy I., Shaimov N., Lomazova I. A. et al., Proceedings of the Institute for System Programming of the RAS 2024 Vol. 36 No. 4 P. 155–168

This paper proposes a method to visualize models of acyclic processes based on merging DirectlyFollows Graphs (DFG) and Sankey diagrams. DFG is a popular graphical model to visualize discrete process models, while Sankey diagrams are used to represent flows of any kind. Our approach, based on flow diagrams, allows us to highlight individual cases or ...

Added: October 3, 2024

Discovering hierarchical process models: an approach based on events partitioning

A. K. Begicheva, I. A. Lomazova, R. A. Nesterov, Modeling and Analysis of Information Systems 2024 Vol. 31 No. 3 P. 294–315

Process mining is a field of computer science that deals with the discovery and analysis of process models based on automatically generated event logs. Currently, many companies are using this technology to optimize and improve their business processes. However, a discovered process model may be too detailed, sophisticated, and difficult for experts to understand. In ...

Added: September 14, 2024

Вопросы определимости

Semenov A., В кн.: Всемирный конгресс (26–30 июня 2023 г., Москва). Теория систем, алгебраическая биология, искусственный интеллект: математические основы и приложения: Избранные труды.: М.: [б.и.], 2023. С. 390–405.

Added: March 13, 2024

Searching for Deviations in Trading Systems: Combining Control-Flow and Data Perspectives

Julio C. Carrasquel, Irina A. Lomazova, , in: 6th International Conference, TMPA 2021, Tomsk, Russia, November 25–27, 2021, Revised Selected Papers. Tools and Methods of Program AnalysisVol. 1559: CCIS .: Springer, 2024. P. 94–106.

Trading systems are software platforms that support the exchange of securities (e.g., company shares) between participants. In this paper, we present a method to search for deviations in trading systems by checking conformance between colored Petri nets and event logs. Colored Petri nets (CPNs) are an extension of Petri nets, a formalism for modeling of ...

Added: January 31, 2024

Разработка конструктора правил генерации и обработки событийных рядов

Lyadova L. N., Платунов А. И., Информатизация и связь 2024 № 1 С. 84–89

Summary. The goal of the project is developing tools for generating and preprocessing event logs for process analysis using Process Mining methods. The implementation approach is based on low-code principles. Users should be able to develop their own rules for generating and processing event logs, including additional attributes, – event series. It is based on ...

Added: January 19, 2024

Business Process Management Workshops. BPM 2023 International Workshops, Utrecht, The Netherlands, September 11–15, 2023, Revised Selected Papers

Switzerland: Springer, 2024.

This book constitutes revised papers from the International Workshops held at the 21st International Conference on Business Process Management, BPM 2023, in Utrecht, The Netherlands, during September 2023. Papers from the following workshops are included: • 7th International Workshop on Artificial Intelligence for Business Process Management (AI4BPM 2023) • 7th International Workshop on Business Processes Meet Internet-of-Things (BP-Meet-IoT ...

Added: January 17, 2024

РАЗРАБОТКА КОНСТРУКТОРА ЖУРНАЛОВ СОБЫТИЙ С ДОПОЛНИТЕЛЬНЫМИ АТРИБУТАМИ

Платунов А. И., Lyadova L. N., В кн.: ТЕХНОЛОГИИ РАЗРАБОТКИ ИНСТРУМЕНТАЛЬНЫХ СРЕДСТВ (ТРИС-2023): материалы конференции.: Таганрог: Издательство ЮФУ, 2023. С. 113–122.

The goal of the project is to develop tools for generating and processing event logs including additional attributes for analyzing processes with Process Mining tools. Implementation are based on low-code principles. This enables non-programmers to develop their own data processing rules for generating and preprocessing event logs with additional attributes. The core of the system ...

Added: December 16, 2023

An Approach to Developing Ontology-Based Tools for Event Series Analysis

Anton Platunov, Lyudmila Lyadova, Matta N. et al., , in: IC3K 2023: Proceedings of the 15th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management. Volume 2: KEOD, Rome - Italy, November 13 - 15, 2023Vol. 2: KEOD.: Lisbon: SciTePress, 2023. P. 323–330.

Existing process mining methods allow to investigate processes in different domains. Besides mandatory event attributes like as identifier, activity, and timestamp, additional event attributes can be present in data sources. The analysing dynamics of changing the values of additional attributes allows to get important information on the system. The applications must be developed by programmers ...

Added: November 22, 2023

Discovering Process Models from Event Logs of Multi-Agent Systems Using Event Relations

A. A. Sherstyugina, R. A. Nesterov, Proceedings of the Institute for System Programming of the RAS 2023 Vol. 35 No. 3 P. 11–32

The structure of a process model directly discovered from an event log of a multi-agent system often does not reflect the behavior of individual agents and their interactions. We suggest analyzing the relations between events in an event log to localize actions executed by different agents and involved in their asynchronous interaction. Then, a process ...

Added: October 31, 2023

Using Process Mining to Leverage the Development of a Family of Mobile Applications

L.А. Rezunik, A.I. Perevoznikova, D.V. Eremina et al., Proceedings of the Institute for System Programming of the RAS 2023 Vol. 35 No. 3 P. 171–186

Enterprises often provide their services via a family of applications based on various platforms. Applications in such a family can behave differently. Their development processes can differ as well. Moreover, modern development processes are often complex and sometimes vague. This can lead to bugs, defects, and unwanted discrepancies in applications. In this paper, we show that ...

Added: October 30, 2023

Мещеряков М.В. Сухарев Л.А. Практикум по теории конечных автоматов и формальных языков- Саранск : Изд-во Мордов. ун-та, 2018.-224с.

Мещеряков М. В., Сухарев Л. А., Саранск: Изд-во Мордовского университета, 2018.

The book is an introductory course on the theory of formal languages and finite automata. It presents the main material of diciplina related to the mathematical foundations of a number of syntactic methods of inormatics and programming. The book is intended for undergraduate students in the following fields of study: fundamental computer science and information ...

Added: October 12, 2023

On Interpretations in Büchi Arithmetics

Zapryagaev A., / Series arXiv "math". 2022.

Büchi arithmetics BA_n, n >= 2, are extensions of Presburger arithmetic with an unary functional symbol V_n(x) denoting the largest power of n that divides x. Definability of a set in BA_n is equivalent to its recognizability by a finite automaton receiving numbers in their n-ary expansion. We show that Büchi arithmetics BA_n and BA_m ...

Added: December 5, 2022

Event Series Generation and Analysis Based on Multifaceted Ontology

Zayakin Viktor, Lyadova Lyudmila, Smirnov M. et al., , in: 2022 IEEE 16th International Conference on Application of Information and Communication Technologies (AICT).: Washington: IEEE, 2022. P. 1–6.

The article presents an approach to the analyzing processes in different domains using data from various Internet sources (open databases, news feeds, social networks, etc.). This one is suitable to carry out cross-disciplinary research encompassing processes in various fields (for example, economics, medicine, politics, ecology, etc.) in which events can have mutual affects. The concept ...

Added: October 29, 2022

Анализ академической успеваемости студентов с использованием журналов событий электронной образовательной среды

Shaimov N., Lomazova I. A., Mitsyuk A. A. et al., Моделирование и анализ информационных систем 2022 Т. 29 № 4 С. 286–314

Modern educational process involves the use of electronic educational environments. These are special information systems that are both a means for storing educational materials and a tool for conducting tests, collecting homework, keeping a grade book, and working together. Such environments produce a large amount of data containing the recorded behavior of students and teachers ...

Added: October 14, 2022