Testing Representational Biases
datasetposted on 12.10.2017 by Wil van der Aalst
Datasets usually provide raw data for analysis. This raw data often comes in spreadsheet form, but can be any collection of data, on which analysis can be performed.
A set of 120 event logs was created to test the effect of the representational bias used and the effect of erratic and infrequent behavior added to highly regular behavior. For each log the underlying model is known. Therefore, such as “gold standard model” serves as a reference for the real process. For real-life event logs there is no such a reference model. Moreover, quality criteria like precision and generalization are still subject to discussion. Therefore, these synthetic event logs were created. The hope is that these event logs contribute to better process discovery tools that can deal with different representational biases and different types of noisy behavior.