Benchmarking logs to test scalability of process discovery algorithms
Datacite citation style:
van der Aalst, Wil (2017): Benchmarking logs to test scalability of process discovery algorithms. Version 1. 4TU.ResearchData. dataset. https://doi.org/10.4121/uuid:1cc41f8a-3557-499a-8b34-880c1251bd6e
Other citation styles (APA, Harvard, MLA, Vancouver, Chicago, IEEE) available at Datacite
Dataset
The set of event logs included, are aimed to support the evaluation of the performance of process discovery algorithms. The largest event logs in this data set have millions of events. If you need even bigger datasets, you can generate these yourself using the CPN Tools sources files included (*.cpn). Each file has two parameters nofcases (i.e., the number of process instances) and nofdupl (i.e., the number of times a process is replicated with unique new names).
history
- 2017-10-12 first online, published, posted
publisher
Eindhoven University of Technology
format
media types: application/pdf, application/vnd.openxmlformats-officedocument.wordprocessingml.document, application/zip, text/csv, text/plain, text/xml
organizations
Eindhoven University of Technology, Department of Mathematics and Computer Science, Data Science Centre Eindhoven
DATA
files (1)
- 155,001,566 bytesMD5:
a0ceda17a699af1732b8bdaa40e80827
data.zip -
download all files (zip)
155,001,566 bytes unzipped