Improving the performance of process discovery algorithms by instance selection
Author(s) -
Mohammadreza Fani Sani,
Sebastiaan van Zelst,
Aalst van der
Publication year - 2020
Publication title -
computer science and information systems
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.244
H-Index - 24
eISSN - 2406-1018
pISSN - 1820-0214
DOI - 10.2298/csis200127028s
Subject(s) - computer science , event (particle physics) , process (computing) , data mining , business process discovery , selection (genetic algorithm) , sampling (signal processing) , algorithm , business process , machine learning , artificial intelligence , work in process , business process modeling , physics , filter (signal processing) , quantum mechanics , marketing , computer vision , business , operating system
Process discovery algorithms automatically discover process models based on event data that is captured during the execution of business processes. These algorithms tend to use all of the event data to discover a process model. When dealing with large event logs, it is no longer feasible using standard hardware in limited time. A straightforward approach to overcome this problem is to down-size the event data by means of sampling. However, little research has been conducted on selecting the right sample, given the available time and characteristics of event data. This paper evaluates various subset selection methods and evaluates their performance on real event data. The proposed methods have been implemented in both the ProM and the RapidProM platforms. Our experiments show that it is possible to considerably speed up discovery using instance selection strategies. Furthermore, results show that applying biased selection of the process instances compared to random sampling will result in simpler process models with higher quality.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom