Process Discovery Contest 2023
DOI:10.4121/afd6f608-469e-48f9-977d-875b45840d39.v1
The DOI displayed above is for this specific version of this dataset, which is currently the latest. Newer versions may be published in the future.
For a link that will always point to the latest version, please use
DOI: 10.4121/afd6f608-469e-48f9-977d-875b45840d39
DOI: 10.4121/afd6f608-469e-48f9-977d-875b45840d39
Datacite citation style
Eric Verbeek; Verbeek, H.M.W. (Eric) (2023): Process Discovery Contest 2023. Version 1. 4TU.ResearchData. dataset. https://doi.org/10.4121/afd6f608-469e-48f9-977d-875b45840d39.v1
Other citation styles (APA, Harvard, MLA, Vancouver, Chicago, IEEE) available at Datacite
Dataset
This data set contains the data set as was used for the Process Discovery Contest of 2023 (PDC 2023).
The data set contains 384 training logs, 96 corresponding test logs and base logs, 96 corresponding
ground truth logs, and 96 models. The logs are all stored using the IEEE XES file format (see either
https://www.xes-standard.org/ or https://ieeexplore.ieee.org/document/7740858), while the models are
workflow nets (a subclass of Petri nets) stored in the PNML fileformat (see
https://www.iso.org/obp/ui/#iso:std:iso-iec:15909:-2:ed-1:v1:en).
History
- 2023-10-04 first online, published, posted
Publisher
4TU.ResearchDataFormat
IEEE XES, ISO PNMLOrganizations
Task Force on Process Mining (https://tf-pm.org)DATA
Files (6)
- 3,217 bytesMD5:
a3f4c604bd3767f9ded0c1624de8c487readme.txt - 4,372,016 bytesMD5:
e50ca7e37445347320bc82b4fa2064bdBase Logs.zip - 4,532,918 bytesMD5:
7998c9286c93892b953dfdd4b94f7ddfGround Truth Logs.zip - 576,945 bytesMD5:
2b281aceff385fe8635ab86b3dc61a15Models.zip - 4,371,117 bytesMD5:
8f7012e7ded6f620344373af392d14cfTest Logs.zip - 17,149,317 bytesMD5:
70cf660efcdd6d4ba32dc6878b93be2bTraining Logs.zip -
download all files (zip)
31,005,530 bytes unzipped





