Process Discovery Contest 2020
doi:10.4121/14626020.v1
The doi above is for this specific version of this dataset, which is currently the latest. Newer versions may be published in the future.
For a link that will always point to the latest version, please use
doi: 10.4121/14626020
doi: 10.4121/14626020
Datacite citation style:
Eric Verbeek (2021): Process Discovery Contest 2020. Version 1. 4TU.ResearchData. dataset. https://doi.org/10.4121/14626020.v1
Other citation styles (APA, Harvard, MLA, Vancouver, Chicago, IEEE) available at Datacite
Dataset
This is the data set that was used for the Process Discovery Contest of 2020 (PDC 2020). The data set contains 192 training logs, 192 corresponding test logs, 192 corresponding ground truth logs, and 96 models. The logs are all stored using the IEEE XES file format (see either https://www.xes-standard.org/ or https://ieeexplore.ieee.org/document/7740858), while the models are workflow nets (a subclass of Petri nets) stored in the PNML file format (see
https://www.iso.org/obp/ui/#iso:std:iso-iec:15909:-2:ed-1:v1:en).
https://www.iso.org/obp/ui/#iso:std:iso-iec:15909:-2:ed-1:v1:en).
history
- 2021-05-21 first online, published, posted
publisher
4TU.ResearchData
format
IEEE XES
ISO PNML
organizations
Task Force on Process Mining (https://tf-pm.org)
DATA
files (5)
- 2,954 bytesMD5:
a014de4d17e7650f4c1f2b6318f91da0
readme.txt - 8,790,677 bytesMD5:
c74abcec5f7326415fd4e5f2f94a7b69
Ground Truth Logs.zip - 555,869 bytesMD5:
33c642e9edf346b0274a3872f3d8ed22
Models.zip - 8,257,970 bytesMD5:
9f5c132179469f8c62722da58ddc1b3c
Test Logs.zip - 8,116,355 bytesMD5:
02c965045a5d48adf123c358c70490de
Training Logs.zip -
download all files (zip)
25,723,825 bytes unzipped