Process Discovery Contest 2022
doi:10.4121/21261402.v1
The doi above is for this specific version of this dataset, which is currently the latest. Newer versions may be published in the future.
For a link that will always point to the latest version, please use
doi: 10.4121/21261402
doi: 10.4121/21261402
Datacite citation style:
Eric Verbeek (2022): Process Discovery Contest 2022. Version 1. 4TU.ResearchData. dataset. https://doi.org/10.4121/21261402.v1
Other citation styles (APA, Harvard, MLA, Vancouver, Chicago, IEEE) available at Datacite
Dataset
This data set contains the data set as was used for the Process Discovery Contest of 2022 (PDC 2022). The data set contains 480 training logs, 96 corresponding test logs, 96 corresponding ground truth logs, and 96 models. The logs are all stored using the IEEE XES file format (see either https://www.xes-standard.org/ or https://ieeexplore.ieee.org/document/7740858), while the models are workflow nets (a subclass of Petri nets) stored in the PNML file
format (see https://www.iso.org/obp/ui/#iso:std:iso-iec:15909:-2:ed-1:v1:en).
history
- 2022-10-03 first online, published, posted
publisher
4TU.ResearchData
format
IEEE XES
ISO PNML
organizations
Task Force on Process Mining (https://tf-pm.org)
DATA
files (5)
- 3,206 bytesMD5:
24e7e509ba475aff43552f9e3c373097
readme.txt - 1,532,373 bytesMD5:
0bfaf2157312ee45915226856406b472
Base Logs.zip - 1,678,679 bytesMD5:
cf15848c9373b5aef1573a11f64c1647
Ground Truth Logs.zip - 1,531,888 bytesMD5:
f851b8dfed112496d7920b7627c4ce7e
Test Logs.zip - 5,120,236 bytesMD5:
c56e3e175b6132919c7f821daefdcb91
Training Logs.zip -
download all files (zip)
9,866,382 bytes unzipped