Preview

Modeling and Analysis of Information Systems

Advanced search

Application of Genetic Algorithms for Finding Edit Distance between Process Models

https://doi.org/10.18255/1818-1015-2018-6-711-725

Abstract

Finding graph-edit distance (graph similarity) is an important task in many computer science areas, such as image analysis, machine learning, chemicalinformatics. Recently, with the development of process mining techniques, it became important to adapt and apply existing graph analysis methods to examine process models (annotated graphs) discovered from event data. In particular, finding graph-edit distance techniques can be used to reveal patterns (subprocesses), compare discovered process models. As it was shown experimentally and theoretically justified, exact methods for finding graph-edit distances between discovered process models (and graphs in general) are computationally expensive and can be applied to small models only. In this paper, we present and assess accuracy and performance characteristics of an inexact genetic algorithm applied to find distances between process models discovered from event logs. In particular, we find distances between BPMN (Business Process Model and Notation) models discovered from event logs by using different process discovery algorithms. We show that the genetic algorithm allows us to dramatically reduce the time of comparison and produces results which are close to the optimal solutions (minimal graph edit distances calculated by the exact search algorithm).

About the Authors

Anna A. Kalenkova
National Research University Higher School of Economics
Russian Federation

senior research fellow, Laboratory of Process-Aware Information Systems

20 Myasnitskaya St., Moscow 101000



Danil A. Kolesnikov
National Research University Higher School of Economics
Russian Federation

student, Faculty of Computer Science

20 Myasnitskaya St., Moscow 101000



References

1. Van der Aalst W.M.P., Process Mining — Data Science in Action, Second Edition, Springer, 2016.

2. Van der Aalst W. M. P., Weijters T., Maruster L., “Workflow Mining: Discovering Process Models from Event Logs”, IEEE Transactions on Knowledge and Data Engineering, 16:9 (2004), 1128–1142.

3. Leemans S. J. J., Fahland D., van der Aalst W. M. P., “Discovering Block-Structured Process Models from Incomplete Event Logs”, Application and Theory of Petri Nets and Concurrency, Lecture Notes in Computer Science, 8489, Springer, 2014, 91–110.

4. Kalenkova A.A., Ageev A.A., Lomazova I.A., van der Aalst W.M.P., “E-Government Services: Comparing Real and Expected User Behavior”, Business Process Management Workshops, Springer International Publishing, 2018, 484–496.

5. Garey M. R., Johnson D. S., Computers and Intractability; A Guide to the Theory of NP-Completeness, W. H. Freeman & Co., 1990, ISBN: 0716710455.

6. Ivanov S. Y., Kalenkova A. A., van der Aalst W. M. P., “BPMNDiffViz: A Tool for BPMN Models Comparison”, Proceedings of the BPM Demo Session 2015 Co-located with the 13th International Conference on Business Process Management (BPM 2015) (Innsbruck, Austria, September 2, 2015), 2015, 35–39.

7. Hart P.E, Nilsson N.J, Raphael B., “A Formal Basis for the Heuristic Determination of Minimum Cost Paths”, IEEE Transactions on Systems Science and Cybernetics, 4:2 (1968), 100–107.

8. Business Process Model and Notation (BPMN), Object Management Group, formal/2013-12-09, 2013.

9. Cross A. D. J., Wilson R. C., Hancock E. R., “Inexact Graph Matching Using Genetic Search”, Pattern Recognition, 30:6 (1997), 953 – 970.

10. Riesen K., Fischer A., Bunke H., “Improving Approximate Graph Edit Distance Using Genetic Algorithms”, Structural, Syntactic, and Statistical Pattern Recognition, Springer Berlin Heidelberg, 2014, 63–72.

11. Kalenkova A. A., van der Aalst W. M. P., Lomazova I. A., Rubin V. A., “Process Mining Using BPMN: Relating Event Logs and Process Models”, Software & Systems Modeling, 16:4 (2017), 1019–1048.

12. Levenshtein V. I., “Binary Codes Capable of Correcting Deletions, Insertions and Reversals”, Soviet Physics Doklady, 10 (1966), 707.

13. Gladkov L. A., Kureichik V. V., Kureichik V. M., Genetic Algorithms, Fizmatlit, Moscow, 2010, 368 pp., (in Russian).


Review

For citations:


Kalenkova A.A., Kolesnikov D.A. Application of Genetic Algorithms for Finding Edit Distance between Process Models. Modeling and Analysis of Information Systems. 2018;25(6):711-725. (In Russ.) https://doi.org/10.18255/1818-1015-2018-6-711-725

Views: 858


Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 License.


ISSN 1818-1015 (Print)
ISSN 2313-5417 (Online)