Collaborative and Interactive Detection and Repair of Activity Labels in Process Event Logs


Abstract

Process mining uses computational techniques for process-oriented data analysis. The use of poor quality input data will lead to unreliable analysis outcomes (garbage in – garbage out), as it does for other types of data analysis. Among the key inputs to process mining analyses are activity labels in event logs which represent tasks that have been performed. Activity labels are not immune from data quality issues. Fixing them is an important but challenging endeavour, which may require domain knowledge and can be computationally expensive. In this paper we propose to tackle this challenge from a novel angle by using a gamified crowdsourcing approach to the detection and repair of problematic activity labels, namely those with identical semantics but different syntax. Evaluation of the prototype with users and a real-life log showed promising results in terms of quality improvements achieved.

Venue

2nd International Conference on Process Mining (ICPM)

Year

2020

Cite as

Sadeghianasl S., ter Hofstede A. H. M., Suriadi S. and Turkay S., “Collaborative and Interactive Detection and Repair of Activity Labels in Process Event Logs,” 2020 2nd International Conference on Process Mining (ICPM), 2020, pp. 41-48, doi: 10.1109/ICPM49681.2020.00017.