Process Discovery Contest 2024
doi:10.4121/3cfcdbb7-c909-4f60-8bec-62c780598047.v1
The doi above is for this specific version of this dataset, which is currently the latest. Newer versions may be published in the future.
For a link that will always point to the latest version, please use
doi: 10.4121/3cfcdbb7-c909-4f60-8bec-62c780598047
doi: 10.4121/3cfcdbb7-c909-4f60-8bec-62c780598047
Datacite citation style:
Eric Verbeek (2024): Process Discovery Contest 2024. Version 1. 4TU.ResearchData. dataset. https://doi.org/10.4121/3cfcdbb7-c909-4f60-8bec-62c780598047.v1
Other citation styles (APA, Harvard, MLA, Vancouver, Chicago, IEEE) available at Datacite
Dataset
licence
CC0
This data set contains the data set as was used for the Process Discovery Contest of 2024 (PDC 2024).
The data set contains 288 training logs, 96 corresponding test logs and base logs, 96 corresponding
ground truth logs, and 96 models. The logs are all stored using the IEEE XES file format (see either
https://www.xes-standard.org/ or https://ieeexplore.ieee.org/document/7740858), while the models are
workflow nets (a subclass of Petri nets) stored in the PNML fileformat (see
https://www.iso.org/obp/ui/#iso:std:iso-iec:15909:-2:ed-1:v1:en).
history
- 2024-09-23 first online, published, posted
publisher
4TU.ResearchData
format
IEEE XES, ISO PNML
organizations
Task Force on Process Mining (https://tf-pm.org)
DATA
files (6)
- 3,450 bytesMD5:
6a1cac6347fe7501757b4423315d8efb
readme.txt - 2,606,626 bytesMD5:
6bae873769c4c37bfab9bef1360e9c71
Base Logs.zip - 2,757,368 bytesMD5:
131ee95ead821791dc6f7f9fe5307af9
Ground Truth Logs.zip - 497,118 bytesMD5:
b06de0eccc693338e69caa5db3664b2a
Models.zip - 2,607,251 bytesMD5:
19cd100d738cfefff149e952b2faf180
Test Logs.zip - 6,861,194 bytesMD5:
fc511f0aa3c5853c5567ddde894563dc
Training Logs.zip -
download all files (zip)
15,333,007 bytes unzipped