Process Discovery Contest 2025
DOI:10.4121/7212a73a-1eac-4a08-8c01-973dca020822.v1
The DOI displayed above is for this specific version of this dataset, which is currently the latest. Newer versions may be published in the future.
For a link that will always point to the latest version, please use
DOI: 10.4121/7212a73a-1eac-4a08-8c01-973dca020822
DOI: 10.4121/7212a73a-1eac-4a08-8c01-973dca020822
Datacite citation style
Eric Verbeek (2025): Process Discovery Contest 2025. Version 1. 4TU.ResearchData. dataset. https://doi.org/10.4121/7212a73a-1eac-4a08-8c01-973dca020822.v1
Other citation styles (APA, Harvard, MLA, Vancouver, Chicago, IEEE) available at Datacite
Dataset
Licence CC0
Interoperability
This folder contains the data set as was used for the Process Discovery Contest of 2025 (PDC 2025).
The data set contains 288 training logs, 96 corresponding test logs and base logs, 96 corresponding
ground truth logs, and 96 models. The logs are all stored using the IEEE XES file format (see either
https://www.xes-standard.org/ or https://ieeexplore.ieee.org/document/7740858), while the models are
workflow nets (a subclass of Petri nets) stored in the PNML fileformat (see
https://www.iso.org/obp/ui/#iso:std:iso-iec:15909:-2:ed-1:v1:en).
History
- 2025-09-08 first online, published, posted
Publisher
4TU.ResearchDataFormat
IEEE XES, ISO PNMLOrganizations
Task Force on Process Mining (https://tf-pm.org)DATA
Files (6)
- 3,463 bytesMD5:
54fb914c330cd0d2a85d3f4c9d01c06b
readme.txt - 3,205,758 bytesMD5:
59918d57b7f13e6f186fcf6899c97389
Base Logs.zip - 3,352,152 bytesMD5:
32a18818fe437c2f6be5c2dec9dd540e
Ground Truth Logs.zip - 653,582 bytesMD5:
db5d0e9cce7cd583f546c2b08a8fefaa
Models.zip - 3,206,564 bytesMD5:
9417b099e0c6d9620ee113f8d42a5d93
Test Logs.zip - 8,530,906 bytesMD5:
445adf204e43ddfa8589c86a36eeb0a6
Training Logs.zip -
download all files (zip)
18,952,425 bytes unzipped