An Optimization Approach for Mining of Process Models with Infrequent Behaviors Integrating Data Flow and Control Flow
Author(s) -
Lili Wang,
Xianwen Fang,
Esther Asare,
Huan Fang
Publication year - 2021
Publication title -
scientific programming
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.269
H-Index - 36
eISSN - 1875-919X
pISSN - 1058-9244
DOI - 10.1155/2021/8874316
Subject(s) - computer science , process (computing) , process mining , trace (psycholinguistics) , data mining , event (particle physics) , construct (python library) , flow (mathematics) , control flow , perspective (graphical) , control (management) , noise (video) , variance (accounting) , business process , artificial intelligence , work in process , business process management , engineering , linguistics , operations management , philosophy , physics , programming language , geometry , mathematics , accounting , quantum mechanics , business , image (mathematics) , operating system
Infrequent behaviors of business process refer to behaviors that occur in very exceptional cases, and their occurrence frequency is low as their required conditions are rarely fulfilled. Hence, a strong coupling relationship between infrequent behavior and data flow exists. Furthermore, some infrequent behaviors may reveal very important information about the process. ,us, not all infrequent behaviors should be disregarded as noise, and identifying infrequent but correct behaviors in the event log is vital to process mining from the perspective of data flow. Existing process mining approaches construct a process model from frequent behaviors in the event log, mostly concentrating on control flow only, without considering infrequent behavior and data flow information. In this paper, we focus on data flow to extract infrequent but correct behaviors from logs. For an infrequent trace, frequent patterns and interactive behavior profiles are combined to find out which part of the behavior in the trace occurs in low frequency. And, conditional dependency probability is used to analyze the influence strength of the data flow information on infrequent behavior. An approach for identifying effective infrequent behaviors based on the frequent pattern under data awareness is proposed correspondingly. Subsequently, an optimization approach for mining of process models with infrequent behaviors integrating data flow and control flow is also presented. ,e experiments on synthetic and real-life event logs show that the proposed approach can distinguish effective infrequent behaviors from noise compared with others.,e proposed approaches greatly improve the fitness of the mined process model without significantly decreasing its precision.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom